《电力自动化设备》

引用本文:	屈志坚,彭翔,王群峰,王汉林.配电网监测数据微批处理的血统链标记容错法[J].电力自动化设备,2019,39(4):
	QU Zhijian,PENG Xiang,WANG Qunfeng,WANG Hanlin.Lineage chain mark fault-tolerant method for micro-batching of distribution network monitoring data[J].Electric Power Automation Equipment,2019,39(4):

【打印本页】【HTML】【下载PDF全文】【查看/发表评论】【EndNote】【RefMan】【BibTex】

←前一篇|后一篇→

过刊浏览高级检索

本文已被：浏览 4029次下载 1724次
字体:加大+\|默认\|缩小-
配电网监测数据微批处理的血统链标记容错法
屈志坚, 彭翔, 王群峰, 王汉林
华东交通大学电气与自动化工程学院，江西南昌 330013

摘要:

针对分布式配电自动化系统存在数据量井喷、海量监测数据缺乏高效的分布式故障容错机制的问题，提出一种血统链标记容错新方法。利用弹性分布式数据集、微批计算的记录级容错和血统链标记序列融合处理的设计技巧，实现了分布式数据容错中血统链的追溯和条件标记的自动选择。以铁路配电网监测采集的数据为算例，搭建了4机集群的调度监控平台进行容错测试。以发生频次最高的单数据节点故障为例，测试结果表明：对于包含3×106条监测数据记录的弹性分布式数据集，血统链标记容错模型的集群CPU平均占用率波动小于1.5%，磁盘占用率下降4.2%；当迭代次数达到600、800次时，迭代运算耗时分别降低24.3%和42.9%；所提方法实现500 ms流处理延时的同时，对集群资源的使用情况也具有较好的优化效果，验证了该方法对分布式集群容错的有效性。

关键词: 配电自动化系统配电网监测数据分布式集群微批计算血统链标记流计算容错

DOI：10.16081/j.issn.1006-6047.2019.04.002

分类号:TM73

基金项目:国家自然科学基金资助项目(51567008)；江西省杰出青年人才计划项目(20162BCB23045)；江西省自然科学基金资助项目(20171BAB206044)；江西省科技厅应用培育项目(20181BBE58010)

Lineage chain mark fault-tolerant method for micro-batching of distribution network monitoring data

QU Zhijian, PENG Xiang, WANG Qunfeng, WANG Hanlin

School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang 330013, China

Abstract:

Aiming at the existing problems of monitoring message blowout and lacking of efficient distributed fault-tolerant mechanism for huge amounts of monitoring data in the distributed distribution automation system, a novel lineage chain mark fault-tolerant method is proposed. The retrospection of lineage chain and the automatic selection of conditional marks in the distributed data fault-tolerant progress are realized by applying the design technique of fusing the resilient distributed data set, the record-level fault-tolerant of micro-batching computing and the lineage chain mark sequence. The monitoring data of railway distribution network are taken as the simulation example and the dispatch monitoring platform of a 4-generator cluster is built to carry out the fault-tolerant test. A single-node fault with the highest frequency is taken as an example, whose test results show that, for the resilient distributed data set including 3×106 monitoring data records, the average CPU utilization rate fluctuation is less than 1.5% and the disk utilization rate decreases by 4.2%;when the number of iteration increases to 600 and 800, the time-consuming decreases by 24.3% and 42.9% respectively; the proposed method not only realizes the stream processing delay of 500 ms, but also has a better optimization effect on the usage of cluster resources, verifying the effectiveness of the method for distributed cluster fault-tolerant.

Key words: distribution automation system distribution network monitoring data distributed cluster micro-batching computing lineage chain mark stream computing fault-tolerant

用微信扫一扫