Implementation of Multiple Linear Regression for Predicting Time Series Data in Infectious Diseases Using a Machine Learning Approach

  • Sri Ngudi Wahyuni Universitas Amikom Yogyakarta
Keywords: Multiple Regression Linier, Machine Learning, Prediksi, Runtun waktu


Prediction modeling is one way to get prediction results that are close to their true values. Prediction and machine learning have a relationship in the process-relational approach, where it is used to improve processes, data quality, and model quality. This study aims to implement a Multiple Linear Regression (MLR) model to predict time series data, especially COVID-19 infectious diseases in Indonesia using a Machine Learning approach. This research data was taken from March 2, 2020, to November 8, 2020,  and updated by the National Disaster Management Agency (BNPB). The predictive analysis uses parameters of the number of new cases, the number of recovered patients, and the number of deaths. The prediction is carried out over the next 4 days to see the short-term trend of adding new data on COVID-19 patients in Indonesia. The test results show that the R2 value in the MLR model is close to 100%, which is 4.161E+12. So that the Mean Square Error (MAE) value of the MLR model is 1,386E+12 so the MLR accuracy value is 4.1% and the accuracy value is 95.9%.


[1] S. Kumari, D. Kumar, and M. Mittal, “An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier,” International Journal of Cognitive Computing in Engineering, vol. 2, pp. 40–46, Jun. 2021, doi: 10.1016/J.IJCCE.2021.01.001.
[2] P. Wang, “Time series prediction for the epidemic trends of COVID-19 using the improved LSTM deep learning method: Case studies in Russia, Peru and Iran,” Chaos Solitons Fractals, vol. 140, 2020, doi: 10.1016/j.chaos.2020.110214.
[3] S. Ketu and P. K. Mishra, “Enhanced Gaussian process regression-based forecasting model for COVID-19 outbreak and significance of IoT for its detection,” Applied Intelligence, vol. 51, no. 3, pp. 1492–1512, Mar. 2021, doi: 10.1007/s10489-020-01889-9.
[4] Z. Malki, E. Atlam, A. Hassanien, G. D.- Chaos, & S., and undefined 2020, “Association between weather data and COVID-19 pandemic predicting mortality rate: Machine learning approaches,” Elsevier, Accessed: Jul. 06, 2022. [Online]. Available:
[5] S. N. Wahyuni, E. Sediyono, and I. Sembiring, “Indonesian Covid-19 Future Forecasting Based on Machine Learning Approach,” in 2021 3rd International Conference on Electronics Representation and Algorithm (ICERA), IEEE, 2021, pp. 104–108.
[6] S. N. Wahyuni, E. Sediono, I. Sembiring, and N. N. Khanom, “Comparative analysis of time series prediction model for forecasting COVID-19 trend,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 28, no. 1, pp. 600–610, Oct. 2022, doi: 10.11591/ijeecs.v28.i1.pp600-610.
[7] R. Davuluri and R. Rengaswamy, “A survey of different machine learning models for alzheimer disease prediction,” International Journal of Emerging Trends in Engineering Research, vol. 8, no. 7, pp. 3328–3337, Jul. 2020, doi: 10.30534/ijeter/2020/73872020.
[8] G. Battineni, G. G. Sagaro, N. Chinatalapudi, and F. Amenta, “Applications of machine learning predictive models in the chronic disease diagnosis,” Journal of Personalized Medicine, vol. 10, no. 2. MDPI AG, Jun. 01, 2020. doi: 10.3390/jpm10020021.
[9] A. U. Haq, J. P. Li, M. H. Memon, S. Nazir, R. Sun, and I. Garciá-Magarinõ, “A hybrid intelligent system framework for the prediction of heart disease using machine learning algorithms,” Mobile Information Systems, vol. 2018, 2018, doi: 10.1155/2018/3860146.
[10] R. K. Mojjada, A. Yadav, A. V. Prabhu, and Y. Natarajan, “Machine learning models for covid-19 future forecasting,” Mater Today Proc, Dec. 2020, doi: 10.1016/j.matpr.2020.10.962.
[11] K. Sujatha, K. V. Krishna Kishore, and B. S. Rao, “Machine learning models for forecasting confirmed, recovered and deceased COVID-19 cases in India,” International Journal of Control and Automation, vol. 13, no. 4, 2020.
[12] M. Nemati, J. Ansary, and N. Nemati, “Machine-Learning Approaches in COVID-19 Survival Analysis and Discharge-Time Likelihood Prediction Using Clinical Data,” Patterns, vol. 1, no. 5. Elsevier BV, p. 100074, 2020. doi: 10.1016/j.patter.2020.100074.
[13] F. W. Wibowo, “Prediction Modelling of COVID-19 Outbreak in Indonesia using a Logistic Regression Model,” vol. 2019.
[14] B. Bera, S. Bhattacharjee, N. Sengupta, and S. Saha, “PM2.5 concentration prediction during COVID-19 lockdown over Kolkata metropolitan city, India using MLR and ANN models,” Environmental Challenges, vol. 4, Aug. 2021, doi: 10.1016/j.envc.2021.100155.
[15] C. Zhu, J. Wang, and J. Tian, “Grain Consumption Forecasting: One Modified MLR Model Combined with Time Series Forecasting Theory,” Int J Math Math Sci, vol. 2020, 2020, doi: 10.1155/2020/3956717.
[16] E. Bas, E. Egrioglu, U. Yolcu, and C. Grosan, “Type 1 fuzzy function approach based on ridge regression for forecasting,” Granular Computing, vol. 4, no. 4, 2019, doi: 10.1007/s41066-018-0115-4.
[17] M. S.-A. Intelligence and undefined 2021, “Forecasting COVID-19 outbreak progression using hybrid polynomial-Bayesian ridge regression model,” Springer, Accessed: Jul. 06, 2022. [Online]. Available:
[18] Z. Y. Algamal, M. R. Abonazel, and A. F. Lukman, “Modified Jackknife Ridge Estimator for Beta Regression Model With Application to Chemical Data,” International Journal of Mathematics, Statistics, and Computer Science, vol. 1, pp. 15–24, Feb. 2023, doi: 10.59543/ijmscs.v1i.7713.
[19] M. Saqib, “Forecasting COVID-19 outbreak progression using hybrid polynomial-Bayesian ridge regression model,” Applied Intelligence, vol. 51, no. 5, pp. 2703–2713, May 2021, doi: 10.1007/S10489-020-01942-7.
[20] D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” PeerJ Comput Sci, vol. 7, pp. 1–24, 2021, doi: 10.7717/PEERJ-CS.623.
[21] F. Rustam et al., “COVID-19 Future Forecasting Using Supervised Machine Learning Models,” IEEE Access, vol. 8, pp. 101489–101499, 2020, doi: 10.1109/ACCESS.2020.2997311.