TY - JOUR
T1 - Evaluation of Machine Learning Models in Air Pollution Prediction for a Case Study of Macau as an Effort to Comply with UN Sustainable Development Goals
AU - Lei, Thomas M.T.
AU - Cai, Jianxiu
AU - Molla, Altaf Hossain
AU - Kurniawan, Tonni Agustiono
AU - Kong, Steven Soon Kai
N1 - Publisher Copyright:
© 2024 by the authors.
PY - 2024/9
Y1 - 2024/9
N2 - To comply with the United Nations Sustainable Development Goals (UN SDGs), in particular with SDG 3, SDG 11, and SDG 13, a reliable air pollution prediction model must be developed to construct a sustainable, safe, and resilient city and mitigate climate change for a double win. Machine learning (ML) and deep learning (DL) models have been applied to datasets in Macau to predict the daily levels of roadside air pollution in the Macau peninsula, situated near the historical sites of Macau. Macau welcomed over 28 million tourists in 2023 as a popular tourism destination. Still, an accurate air quality forecast has not been in place for many years due to the lack of a reliable emission inventory. This work will develop a dependable air pollution prediction model for Macau, which is also the novelty of this study. The methods, including random forest (RF), support vector regression (SVR), artificial neural network (ANN), recurrent neural network (RNN), long short-term memory (LSTM), and gated recurrent unit (GRU), were applied and successful in the prediction of daily air pollution levels in Macau. The prediction model was trained using the air quality and meteorological data from 2013 to 2019 and validated using the data from 2020 to 2021. The model performance was evaluated based on the root mean square error (RMSE), mean absolute error (MAE), Pearson’s correlation coefficient (PCC), and Kendall’s tau coefficient (KTC). The RF model best predicted PM10, PM2.5, NO2, and CO concentrations with the highest PCC and KTC in a daily air pollution prediction. In addition, the SVR model had the best stability and repeatability compared to other models, with the lowest SD in RMSE, MAE, PCC, and KTC after five model runs. Therefore, the results of this study show that the RF model is more efficient and performs better than other models in the prediction of air pollution for the dataset of Macau.
AB - To comply with the United Nations Sustainable Development Goals (UN SDGs), in particular with SDG 3, SDG 11, and SDG 13, a reliable air pollution prediction model must be developed to construct a sustainable, safe, and resilient city and mitigate climate change for a double win. Machine learning (ML) and deep learning (DL) models have been applied to datasets in Macau to predict the daily levels of roadside air pollution in the Macau peninsula, situated near the historical sites of Macau. Macau welcomed over 28 million tourists in 2023 as a popular tourism destination. Still, an accurate air quality forecast has not been in place for many years due to the lack of a reliable emission inventory. This work will develop a dependable air pollution prediction model for Macau, which is also the novelty of this study. The methods, including random forest (RF), support vector regression (SVR), artificial neural network (ANN), recurrent neural network (RNN), long short-term memory (LSTM), and gated recurrent unit (GRU), were applied and successful in the prediction of daily air pollution levels in Macau. The prediction model was trained using the air quality and meteorological data from 2013 to 2019 and validated using the data from 2020 to 2021. The model performance was evaluated based on the root mean square error (RMSE), mean absolute error (MAE), Pearson’s correlation coefficient (PCC), and Kendall’s tau coefficient (KTC). The RF model best predicted PM10, PM2.5, NO2, and CO concentrations with the highest PCC and KTC in a daily air pollution prediction. In addition, the SVR model had the best stability and repeatability compared to other models, with the lowest SD in RMSE, MAE, PCC, and KTC after five model runs. Therefore, the results of this study show that the RF model is more efficient and performs better than other models in the prediction of air pollution for the dataset of Macau.
KW - air pollution prediction
KW - air quality
KW - deep learning
KW - machine learning
KW - sustainability
KW - sustainable development
UR - http://www.scopus.com/inward/record.url?scp=85203980859&partnerID=8YFLogxK
U2 - 10.3390/su16177477
DO - 10.3390/su16177477
M3 - Article
AN - SCOPUS:85203980859
SN - 2071-1050
VL - 16
JO - Sustainability
JF - Sustainability
IS - 17
M1 - 7477
ER -