Fuzzy Systems and its Applications

Fuzzy Systems and its Applications

Investigating the effect of selecting hyperparameters in time series forecasting using LSTM neural networks

Document Type : Original Article

Authors
1 Computer Engineering Department, Faculty of Engineering, Zabol University, Zabol, Iran
2 Department of Electrical Engineering, Faculty of Engineering, University of Zabol, zabol, Iran.
10.22034/jfsa.2025.483155.1248
Abstract
“One of the important issues in the field of data mining is time series prediction, which has attracted the attention of many researchers in recent years.” Usually, the time series data are related to time and have specific patterns that by knowing these patterns and using them, it is possible to predict future events. One of the most common deep neural networks for predicting time series is the long-short-term memory (LSTM) neural network. The LSTM network is a recurrent neural network that can retain complex time-related information and understand complex patterns in time series. One of the important challenges of neural networks in predicting time series is determining the correct value of hyperparameters, which has a great impact on their efficiency and performance. Various hyperparameters can be considered. The hyperparameters considered in this paper are: the number of neurons in the hidden layer, the sparsity parameter, the weight reduction parameter in the cost function, and the sparsity penalty weight. The Bayesian optimization method is used to set the correct value of the hyperparameters to achieve the highest accuracy in the problem. Bayesian optimization method determines probability distribution for hyperparameters using probability theory. In order to evaluate the performance of the proposed method, a random data set has been used. The proposed method is applicable and generalizable to real data sets such as weather, environment and energy data. The obtained results show the efficiency and proper performance of the proposed method.
Keywords

[5]    Abbasimehr, H., & Paki, R. (2021) Improving time series forecasting using LSTM and attention models. Journal of Ambient Intelligence and Humanized Computing, 13, 673–691. Available: https://api.semanticscholar.org/CorpusID:234311110
[6]    Abbasimehr, H., Shabani, M., & Yousefi, M. (2020) An optimized model using LSTM network for demand forecasting. Computers & Industrial Engineering, 143, 106435. https://doi.org/10. 1016/j.cie.2020.106435
[7]    Baek, Y., & Kim, H. Y. (2018) ModAugNet: A new forecasting framework for stock market index value with an overfitting prevention LSTM module and a prediction LSTM module. Expert Systems with  Applications,  113,  457–480.  https://doi.org/10.1016/j.eswa.2018.07.019
[8]    Eid, M. M., Khodadadi, N., & Mirjalili, S. (2022) Meta-heuristic optimization of LSTM-based deep network for boosting the prediction of monkeypox cases. [Online], pp. 1–20.
[9]    ElSaid, A., El Jamiy, F., Higgins, J., Wild, B., & Desell, T. (2018) Optimizing long short-term memory recurrent neural networks using ant colony optimization to predict turbine engine vibration. Ap- plied Soft Computing, 73, 969–991. https://doi.org/10.1016/j.asoc.2018.09.013
[10]    Ghanbari, N., Zahiri, H., & Shahraki, H. (2022) Clustering of fuzzy data sets based on particle swarm optimization with fuzzy cluster centers. International Journal of Industrial Engineering & Production Research, 33(2), 1–12.
[11]    Ghanbari, N., Zahiri, S. H., & Shahraki, H. (2024) Clustering of triangular fuzzy data based on heuristic methods. Journal of Electrical and Computer Engineering Innovations, 12(1), 1–14. https://doi.org/10.22061/jecei.2023.9641.645
[12]    Hajizadeh Tahan, M., Ghasemzadeh, M., & Rezaeian, M. (2021) An evolutionary attention-based deep long short term memory for time series prediction. Computational Intelligence and Electrical Engineering, 11(4), 15–28.
[13]    Hochreiter, S., & Schmidhuber, J. (1997) Long short-term memory. Neural Computation, 9(8), 1735–1780.    https://doi.org/10.1162/neco.1997.9.8.1735
[14]    Izadi, M. (2013) Forecasting inflation rate in Kermanshah province with LSTM neural network.
[15]    Karevan, Z., & Suykens, J. A. K. (2020) Transductive LSTM for time-series prediction: An ap- plication to weather forecasting. Neural Networks, 125, 1–9. https://doi.org/10.1016/j. neunet.2019.12.030
[16]    Ma, R., Zhu, X., & Yan, L. (2020) A hybrid approach for clustering uncertain time series. Journal of Computing and Information Technology, 28(4), 255–267. https://doi.org/10.20532/cit. 2020.1004802
[17]    Moghar, A. (2020) A LSTM recurrent stock market prediction using neural network. Procedia Com- puter Science, 170, 1168–1173. https://doi.org/10.1016/j.procs.2020.03.049
[18]    Sagheer, A., & Kotb, M. (2019) Time series forecasting of petroleum production using deep LSTM recurrent networks. Neurocomputing, 323, 203–213. https://doi.org/10.1016/j. neucom.2018.09.082
[19]    Science, N., Phenomena, C., Kumar, V., Chimmula, R., & Zhang, L. (2020) Time series forecasting of COVID-19 transmission in Canada using LSTM networks. Chaos, Solitons and Fractals, 135.
[20]    Surakhi, O., et al. (2021) Time-lag selection for time-series forecasting using neural network and heuristic algorithm. pp. 1–22.
Volume 8, Issue 1 - Serial Number 16
Open Access Statement
June 2025
Pages 93-119

  • Receive Date 15 October 2024
  • Revise Date 29 June 2025
  • Accept Date 15 July 2025