收藏设为首页 广告服务联系我们在线留言

干旱区地理 ›› 2026, Vol. 49 ›› Issue (2): 343-355.doi: 10.12118/j.issn.1000-6060.2025.100 cstr: 32274.14.ALG2025100

• 气候与水文 • 上一篇    下一篇

基于改进深度学习模型的干旱区城市PM2.5浓度预测与应用研究

王圣节(), 张庆红(), 桑铭键   

  1. 新疆财经大学统计与数据科学学院,新疆 乌鲁木齐 830012
  • 收稿日期:2025-02-27 修回日期:2025-04-23 出版日期:2026-02-25 发布日期:2026-02-27
  • 通讯作者: 张庆红(1973-),女,博士,教授,主要从事资源环境统计研究. E-mail: zhqh@xjufe.edu.cn
  • 作者简介:王圣节(1997-),男,博士研究生,主要从事干旱区环境统计研究. E-mail: wsj12252025@163.com
  • 基金资助:
    国家自然科学基金资助项目(72164034);国家社科基金项目(24XTJ003);新疆财经大学研究生科研创新项目(XJUFE2025B011)

Prediction and application of urban PM2.5 concentration in arid zones based on improved deep learning models

WANG Shengjie(), ZHANG Qinghong(), SANG Mingjian   

  1. School of Statistics and Data Science, Xinjiang University of Finance and Economics, Urumqi 830012, Xinjiang, China
  • Received:2025-02-27 Revised:2025-04-23 Published:2026-02-25 Online:2026-02-27

摘要:

随着全球城市化进程加速,干旱区城市PM2.5污染因其独特的地理气候条件,呈现强非平稳性与复杂时空特征,传统预测模型难以有效捕捉其动态规律。针对这一挑战,构建了“自适应噪声完备经验模态分解-常春藤优化算法-柯尔莫哥洛夫-阿诺德网络-双向长短期记忆神经网络”(CEEMDAN-IVY-KAN-BiLSTM)混合预测框架,以提升PM2.5浓度的预测精度。该框架通过降噪分解与参数优化联合提取多尺度特征,融合KAN-BiLSTM模型的强非线性拟合与双向时序建模能力,有效提升预测性能。结果表明:(1) 2021—2024年乌鲁木齐市PM2.5浓度呈显著季节性波动,冬季因燃煤供暖和逆温层影响均值达41.97 μg·m-3,夏季因大气对流增强,浓度降至全年低位附近,且整体呈逐年下降趋势。(2) 对数据进行重要性排序可知PM2.5与空气质量指数、PM10、CO、NO2呈强正相关,与气温、露点温度呈负相关,表明燃煤排放、机动车尾气及气象扩散条件是主要影响因素,且模型有效分离了数据中PM2.5序列的高频波动(如沙尘事件)与低频趋势(季节性变化),降低数据非平稳性影响。(3) 实验基于2021—2024年乌鲁木齐市逐日空气质量数据开展,结果显示本框架在决定系数、平均绝对误差与均方根误差指标上分别达到0.991、1.391和1.881,均显著优于传统机器学习和常见深度学习模型。验证了“分解-优化-集成”的深度学习框架在干旱区城市PM2.5预测中的适用性。

关键词: PM2.5浓度预测, CEEMDAN分解, IVY优化算法, KAN-BiLSTM模型, 深度学习, 干旱区城市PM2.5

Abstract:

With the acceleration of global urbanization, the severe PM2.5 pollution in arid zone cities, owing to their unique geographical and climatic conditions, exhibits strong non-stationarity and complex spatiotemporal characteristics, making it difficult for traditional prediction models to effectively capture its dynamic patterns. To address this challenge, this study proposes a hybrid prediction framework of an “adaptive noise complete ensemble empirical mode decomposition-Ivy optimization algorithm-Kolmogorov Arnold network-bidirectional long short-term memory neural network” (CEEMDAN-IVY-KAN-BiLSTM), aiming to enhance the prediction accuracy of PM2.5 concentrations. This framework jointly extracts multi-scale features through noise reduction decomposition as well as parameter optimization and integrates the strong nonlinear fitting and bidirectional time series modeling capabilities of the KAN-BiLSTM model, effectively improving the prediction performance. The results reveal that the PM2.5 concentration in Urumqi City from 2021 to 2024 shows significant seasonal fluctuations, with an average of 41.97 μg·m-3 in winter due to coal heating and the influence of the inversion layer and drops to 14.04 μg·m-3 in summer due to enhanced atmospheric convection. Moreover, it shows an overall decreasing trend annually. Moreover, the importance ranking of the data indicates that PM2.5 is significantly positively correlated with air quality index, PM10, CO, and NO2, and negatively correlated with temperature and dew point temperature, suggesting that coal emissions, vehicle exhaust, and meteorological diffusion conditions are the main influencing factors. Moreover, the model effectively separates the high-frequency fluctuations (such as sandstorm events) and low-frequency trends (seasonal changes) in the PM2.5 sequence, reducing the impact of data non-stationarity. Finally, the experiments were based on daily air quality data in Urumqi City from 2021 to 2024, results of which demonstrate that this model achieves the coefficient of determination, mean absolute error, and root mean square error values of 0.991, 1.391, and 1.881, respectively, significantly outperforming conventional machine learning and common deep learning models. This verifies the applicability of the “decomposition-optimization-integration” deep learning framework in the prediction of arid zone cities.

Key words: PM2.5 concentration prediction, CEEMDAN decomposition, IVY optimisation algorithm, KAN-BiLSTM model, deep learning, urban PM2.5 concentration in arid zones