节点文献

大数据背景下采用互信息与随机森林算法的空气质量预测

AIR QUALITY FORECASTING WITH MUTUAL INFORMATION AND RANDOM FORESTS BASED ON BIG

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 杨正理史文陈海霞王长鹏

【Author】 YANG Zheng-li;SHI Wen;CHEN Hai-xia;WANG Chang-peng;School of Mechanical and Electrical Engineering,Sanjiang University;

【机构】 三江学院机械与电气工程学院

【摘要】 为了实现城市空气质量的精准预测,针对与城市空气质量预测相关的大数据种类多、规模大、维度高和生成速度快等特点,在研究城市不同区域空气质量评价指标的基础上,提出不同区域空气质量子空间聚类分析方法,挖掘不同区域空气质量的特征。通过对不同区域进行群体划分,并利用互信息矩阵从城市功能、地形、气象条件等方面辨识与不同区域空气质量相关联的因素,构建基于随机森林算法的城市空气质量预测模型。该方法可以有效识别城市不同区域空气质量的强关联因素,避免由于关联因素的差异性对空气质量预测造成的不利影响。仿真结果表明:该方法适用于大数据的分析与处理,并具有较高的预测精度。

【Abstract】 In order to forecast city air quality accurately,taking into the related features of big data account,including numerous varieties,great scale,high-dimension and high velocity,based on city air quality evaluation indexes of different regions,the subspace clustering analysis method of different regional air quality was put forward to investigate the characteristics of different regional air quality. Through grouping of different regions,the mutual information matrix was used to identify factors related to different regional air quality from the aspects of city functions,terrain and weather conditions,etc. in order to establish the city air quality forecast model based on random forest algorithm. The method could effectively identify the strong correlation factors of city air quality in different regions,and avoid the adverse effect on air quality forecasting due to the difference of correlation factors. The simulation results showed that this method was suitable for the analysis and processing of big data,and had high prediction accuracy.

【基金】 江苏省高校自然科学研究面上项目(17KJB470011)
  • 【文献出处】 环境工程 ,Environmental Engineering , 编辑部邮箱 ,2019年03期
  • 【分类号】X51
  • 【被引频次】8
  • 【下载频次】575
节点文献中: 

本文链接的文献网络图示:

本文的引文网络