Residential College | false |
Status | 已發表Published |
Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China | |
Wu, Boxi1,2,3; Wu, Cheng1,2,3![]() ![]() | |
2024-03-05 | |
Source Publication | Atmospheric Environment
![]() |
ISSN | 1352-2310 |
Volume | 321Pages:120347 |
Abstract | Long-term air pollution data are essential for formulating air quality management policies and assessing their corresponding impacts on public health. However, missing data are inevitably encountered during air pollution observations at different sites. This study proposed a machine learning approach that utilizes data from neighboring sites to reconstruct missing data. Hourly observation data from three neighboring sites in the Pearl River Delta (PRD) region in South China, were used for data retrieval, including the NC site (2006–2015), JXL site and PYZX site (2014–2022). The overlapped data (2014.05–2015.12) were used to train and evaluate the machine learning models. The performance of 11 algorithms (CatBoost, XGBoost, LightGBM, LightGBMXT, LightGBMLarge, RandomForestMSE, ExtraTreeMSE, NeuralNetTorch, NeuralNetFastAI, KNeighborsDist, and KNeighborsUnif) for the retrieval of major air pollutants, including O3, NO2, PM2.5, PM10 and SO2 was benchmarked by a set of evaluation metrics. CatBoost showed the best performance; thus, it was adopted for air pollutant data reconstruction in NC (2016–2022) and PYZX (2008–2014). Long-term data (2006–2022) at the NC were obtained by combining the observation and retrieval data. In the past 15 years, the O3 concentration of NC has increased by 72% at a rate of 0.83 ppb yr−1 (3.2% yr−1 ). On the contrary, substantial reductions were observed for NO2 (61%), PM2.5 (51%) and PM10 (42%) at the NC site, with the rates of −1.27 ppb yr−1 (−5.9% yr−1 ), −1.96 μg m−3 yr−1 (−5.8% yr−1 ) and −2.32 μg m−3 yr−1 (−5.2% yr−1 ), respectively. SO2 exhibits the most pronounced reduction (79%) among all species, with two distinct rates of −4.10 ppb yr−1 (−27.4% yr−1 ) and −0.40 ppb yr−1 (−6.2% yr−1), for 2008–2012 and 2012–2022, respectively. This study demonstrates the feasibility of machine learning in filling the data gap of air pollution monitoring network and highlights the importance of continuous long-term air pollution data in reviewing air quality management policies. |
Keyword | Air Pollution Data Gap Long-term Trend Machine Learning Monitoring Network |
DOI | 10.1016/j.atmosenv.2024.120347 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Environmental Sciences & Ecology ; Meteorology & Atmospheric Sciences |
WOS Subject | Environmental Sciences ; Meteorology & Atmospheric Sciences |
WOS ID | WOS:001166916500001 |
Publisher | PERGAMON-ELSEVIER SCIENCE LTD, THE BOULEVARD, LANGFORD LANE, KIDLINGTON, OXFORD OX5 1GB, ENGLAND |
Scopus ID | 2-s2.0-85182892646 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | Faculty of Science and Technology DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING DEPARTMENT OF OCEAN SCIENCE AND TECHNOLOGY |
Corresponding Author | Wu, Cheng |
Affiliation | 1.Institute of Mass Spectrometry and Atmospheric Environment, Guangdong Provincial Engineering Research Center for on-line Source Apportionment System of Air Pollution, Jinan University, Guangzhou, 510632, China 2.Guangdong-Hongkong-Macau Joint Laboratory of Collaborative Innovation for Environmental Quality, Guangzhou, 510632, China 3.Guangdong MS Institute of Scientific Instrument Innovation, Guangzhou, 510530, China 4.School of Atmospheric Sciences, Nanjing University of Information Science and Technology, Nanjing, 210044, China 5.Guangzhou Sub-branch of Guangdong Ecological and Environmental Monitoring Center, Guangzhou, 518049, China 6.Institute of Tropical and Marine Meteorology, CMA, Guangzhou, 510080, China 7.Department of Civil and Environmental Engineering, Centre for Regional Oceans, And Department of Ocean Science and Technology, Faculty of Science and Technology, University of Macau, Taipa, Macau, China 8.Department of Geography and Resource Management, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China 9.Institute of Traditional Chinese Medicine and Natural Products, College of Pharmacy, And JNU-HKUST Joint Laboratory for Neuroscience and Innovative Drug Research, Jinan University, Guangzhou, 510632, China |
Recommended Citation GB/T 7714 | Wu, Boxi,Wu, Cheng,Ye, Yuchen,et al. Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China[J]. Atmospheric Environment, 2024, 321, 120347. |
APA | Wu, Boxi., Wu, Cheng., Ye, Yuchen., Pei, Chenglei., Deng, Tao., Li, Yong Jie., Lu, Xingcheng., Wang, Lei., Hu, Bin., Li, Mei., & Wu, Dui (2024). Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China. Atmospheric Environment, 321, 120347. |
MLA | Wu, Boxi,et al."Long-term hourly air quality data bridging of neighboring sites using automated machine learning: A case study in the Greater Bay area of China".Atmospheric Environment 321(2024):120347. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment