引用本文:屈志坚,彭翔,王群峰,王汉林.配电网监测数据微批处理的血统链标记容错法[J].电力自动化设备,2019,39(4):
QU Zhijian,PENG Xiang,WANG Qunfeng,WANG Hanlin.Lineage chain mark fault-tolerant method for micro-batching of distribution network monitoring data[J].Electric Power Automation Equipment,2019,39(4):
【打印本页】   【HTML】   【下载PDF全文】   查看/发表评论  【EndNote】   【RefMan】   【BibTex】
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览 4029次   下载 1724  
配电网监测数据微批处理的血统链标记容错法
屈志坚, 彭翔, 王群峰, 王汉林
华东交通大学 电气与自动化工程学院,江西 南昌 330013
摘要:
针对分布式配电自动化系统存在数据量井喷、海量监测数据缺乏高效的分布式故障容错机制的问题,提出一种血统链标记容错新方法。利用弹性分布式数据集、微批计算的记录级容错和血统链标记序列融合处理的设计技巧,实现了分布式数据容错中血统链的追溯和条件标记的自动选择。以铁路配电网监测采集的数据为算例,搭建了4机集群的调度监控平台进行容错测试。以发生频次最高的单数据节点故障为例,测试结果表明:对于包含3×106条监测数据记录的弹性分布式数据集,血统链标记容错模型的集群CPU平均占用率波动小于1.5%,磁盘占用率下降4.2%;当迭代次数达到600、800次时,迭代运算耗时分别降低24.3%和42.9%;所提方法实现500 ms流处理延时的同时,对集群资源的使用情况也具有较好的优化效果,验证了该方法对分布式集群容错的有效性。
关键词:  配电自动化系统  配电网监测数据  分布式集群  微批计算  血统链标记  流计算  容错
DOI:10.16081/j.issn.1006-6047.2019.04.002
分类号:TM73
基金项目:国家自然科学基金资助项目(51567008);江西省杰出青年人才计划项目(20162BCB23045);江西省自然科学基金资助项目(20171BAB206044);江西省科技厅应用培育项目(20181BBE58010)
Lineage chain mark fault-tolerant method for micro-batching of distribution network monitoring data
QU Zhijian, PENG Xiang, WANG Qunfeng, WANG Hanlin
School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China
Abstract:
Aiming at the existing problems of monitoring message blowout and lacking of efficient distributed fault-tolerant mechanism for huge amounts of monitoring data in the distributed distribution automation system, a novel lineage chain mark fault-tolerant method is proposed. The retrospection of lineage chain and the automatic selection of conditional marks in the distributed data fault-tolerant progress are realized by applying the design technique of fusing the resilient distributed data set, the record-level fault-tolerant of micro-batching computing and the lineage chain mark sequence. The monitoring data of railway distribution network are taken as the simulation example and the dispatch monitoring platform of a 4-generator cluster is built to carry out the fault-tolerant test. A single-node fault with the highest frequency is taken as an example, whose test results show that, for the resilient distributed data set including 3×106 monitoring data records, the average CPU utilization rate fluctuation is less than 1.5% and the disk utilization rate decreases by 4.2%;when the number of iteration increases to 600 and 800, the time-consuming decreases by 24.3% and 42.9% respectively; the proposed method not only realizes the stream processing delay of 500 ms, but also has a better optimization effect on the usage of cluster resources, verifying the effectiveness of the method for distributed cluster fault-tolerant.
Key words:  distribution automation system  distribution network monitoring data  distributed cluster  micro-batching computing  lineage chain mark  stream computing  fault-tolerant

用微信扫一扫

用微信扫一扫