TY - GEN
T1 - Hybrid Machine Learning for Forecasting and Monitoring Air Pollution in Surabaya
AU - Suhartono,
AU - Choiruddin, Achmad
AU - Prabowo, Hendri
AU - Lee, Muhammad Hisyam
N1 - Publisher Copyright:
© 2021, Springer Nature Singapore Pte Ltd.
PY - 2021
Y1 - 2021
N2 - This research aims to propose hybrid machine learnings for forecasting and monitoring air pollution in Surabaya. In particular, we introduce two hybrid machine learnings, i.e. hybrid Time Series Regression – Feedforward Neural Network (TSR-FFNN) and hybrid Time Series Regression – Long Short-Term Memory (TSR-LSTM). TSR is used to capture linear patterns from data, whereas FFNN or LSTM is used to capture non-linear patterns. Fifteen half-hourly series data, i.e. CO, NO2, O3, PM10, and SO2 in three SUF stations at Surabaya, are used as the case study. We compare the forecasting accuracy of these hybrid machine learnings with several individual methods (i.e. TSR, ARIMA, FFNN, and LSTM), and combined methods (i.e. TSR with AR error and TSR with ARMA error). The identification step showed that these air pollution data have double seasonal patterns, i.e. daily and weekly seasonality. The comparison results showed that no superior method that yields the most accurate forecast for all series data. Moreover, the results showed that hybrid methods gave more accurate forecast at 8 series data, whereas the individual methods yielded better results at 7 series data. It supported that methods that are more complex do not always produce better forecasts than simple methods, as shown by the first result of the M3 competition. Additionally, the results of the forecast of air pollution index for monitoring air pollution in Surabaya show that the air quality is in good and moderate air pollution levels for duration of 19.30 to 03.00 and 0.30 to 19.30, respectively.
AB - This research aims to propose hybrid machine learnings for forecasting and monitoring air pollution in Surabaya. In particular, we introduce two hybrid machine learnings, i.e. hybrid Time Series Regression – Feedforward Neural Network (TSR-FFNN) and hybrid Time Series Regression – Long Short-Term Memory (TSR-LSTM). TSR is used to capture linear patterns from data, whereas FFNN or LSTM is used to capture non-linear patterns. Fifteen half-hourly series data, i.e. CO, NO2, O3, PM10, and SO2 in three SUF stations at Surabaya, are used as the case study. We compare the forecasting accuracy of these hybrid machine learnings with several individual methods (i.e. TSR, ARIMA, FFNN, and LSTM), and combined methods (i.e. TSR with AR error and TSR with ARMA error). The identification step showed that these air pollution data have double seasonal patterns, i.e. daily and weekly seasonality. The comparison results showed that no superior method that yields the most accurate forecast for all series data. Moreover, the results showed that hybrid methods gave more accurate forecast at 8 series data, whereas the individual methods yielded better results at 7 series data. It supported that methods that are more complex do not always produce better forecasts than simple methods, as shown by the first result of the M3 competition. Additionally, the results of the forecast of air pollution index for monitoring air pollution in Surabaya show that the air quality is in good and moderate air pollution levels for duration of 19.30 to 03.00 and 0.30 to 19.30, respectively.
KW - Air pollution
KW - Forecasting
KW - Hybrid
KW - Machine learning
KW - Monitoring
UR - http://www.scopus.com/inward/record.url?scp=85119398039&partnerID=8YFLogxK
U2 - 10.1007/978-981-16-7334-4_27
DO - 10.1007/978-981-16-7334-4_27
M3 - Conference contribution
AN - SCOPUS:85119398039
SN - 9789811673337
T3 - Communications in Computer and Information Science
SP - 366
EP - 380
BT - Soft Computing in Data Science - 6th International Conference, SCDS 2021, Proceedings
A2 - Mohamed, Azlinah
A2 - Yap, Bee Wah
A2 - Zain, Jasni Mohamad
A2 - Berry, Michael W.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 6th International Conference on Soft Computing in Data Science, SCDS 2021
Y2 - 2 November 2021 through 3 November 2021
ER -