Investigation of Statistical Machine Learning Models for COVID-19 Epidemic Process Simulation: Random Forest, K-Nearest Neighbors, Gradient Boosting

Thumbnail Image

Date

2022-05-30

Authors

Chumachenko, Dmytro
Meniailov, Ievgen
Bazilevych, Kseniia
Chumachenko, Tetyana
Чумаченко, Тетяна Олександрівна
Чумаченко, Татьяна Александровна
Yakovlev, Sergey

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

COVID-19 has become the largest pandemic in recent history to sweep the world. This study is dev oted to developing and investigating three models of the COVID-19 epidemic process based on statistical machine learning and the evaluation of the results of their forecasting. The models developed are based on Random Forest, K-Nearest Neighbors, and Gradient Boosting methods. The models were studied for the adequacy and accuracy of predictive incidence for 3, 7, 10, 14, 21, and 30 days. The study used data on new cases of COVID-19 in Germany, Japan, South Korea, and Ukraine. These countries are selected because they have different dynamics of the COVID-19 epidemic process, and their governments have applied various control measures to contain the pandemic. The simulation results showed sufficient accuracy for practical use in the K-Nearest Neighbors and Gradient Boosting models. Public health agencies can use the models and their predictions to address various pandemic containment challenges. Such challenges are investigated depending on the duration of the constructed forecast.

Description

Keywords

epidemic model, epidemic process, machine learning, COVID-19, K-Nearest neighbors method, gradient boosting, random forest

Citation

Investigation of Statistical Machine Learning Models for COVID-19 Epidemic Process Simulation: Random Forest, K-Nearest Neighbors, Gradient Boosting / D. Chumachenko, I. Meniailov, K. Bazilevych, T. Chumachenko, S. Yakovlev // Computation. – 2022. – Vol. 10, issue 86, on 30 May 2022. – P. 1–22. – DOI: https://doi.org/10.3390/computation10060086.