TY - GEN
T1 - Adaptive Cyclic Learning Rate for Long-Term Time-Series Forecasting on Encoder-Decoder and Encoder-Only Deep Learning Model Architectures
AU - Mahhisa, Farrela Ranku
AU - Shiddiqi, Ary Mazharuddin
AU - Anggraini, Ratih Nur Esti
AU - Putra, Andika Laksana
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Long-term time-series forecasting remains a critical challenge, with deep learning models often facing persistent training instability. This study reveals a critical insight: the effectiveness of a dynamic optimizer is fundamentally dependent on the underlying model architecture. We find that the autoregressive decoding process in encoder-decoder models clashes with cyclical learning rate adjustments, while the direct, nonautoregressive structure of encoder-only models proves highly compatible. To demonstrate this, we investigate the Adaptive Cyclic Learning Rate (ACLR) across five representative models from both encoder-decoder (Transformer, Informer) and encoder-only (iTransformer, FRNet, PatchTST) families on three benchmark datasets. The results are statistically significant, showing that ACLR consistently enhances stability and prediction accuracy for encoder-only models while inducing instability and performance degradation in their encoder-decoder counterparts. These findings provide a new guiding principle for practitioners, emphasizing that architectural compatibility should be a primary consideration when selecting an optimization strategy for long-term time series forecasting.
AB - Long-term time-series forecasting remains a critical challenge, with deep learning models often facing persistent training instability. This study reveals a critical insight: the effectiveness of a dynamic optimizer is fundamentally dependent on the underlying model architecture. We find that the autoregressive decoding process in encoder-decoder models clashes with cyclical learning rate adjustments, while the direct, nonautoregressive structure of encoder-only models proves highly compatible. To demonstrate this, we investigate the Adaptive Cyclic Learning Rate (ACLR) across five representative models from both encoder-decoder (Transformer, Informer) and encoder-only (iTransformer, FRNet, PatchTST) families on three benchmark datasets. The results are statistically significant, showing that ACLR consistently enhances stability and prediction accuracy for encoder-only models while inducing instability and performance degradation in their encoder-decoder counterparts. These findings provide a new guiding principle for practitioners, emphasizing that architectural compatibility should be a primary consideration when selecting an optimization strategy for long-term time series forecasting.
KW - adaptive learning rate
KW - deep learning
KW - long-term forecasting
KW - model evaluation
KW - time series
UR - https://www.scopus.com/pages/publications/105034733782
U2 - 10.1109/ICTS67612.2025.11369704
DO - 10.1109/ICTS67612.2025.11369704
M3 - Conference contribution
AN - SCOPUS:105034733782
T3 - 2025 15th International Conference on Information and Communication Technology and System: AI for the Now and Next: Delivering Solutions and Driving Vision, ICTS 2025
BT - 2025 15th International Conference on Information and Communication Technology and System
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th International Conference on Information and Communication Technology and System, ICTS 2025
Y2 - 12 November 2025 through 13 November 2025
ER -