Published 2021-12-15
license
Article

Big Data technology in the analysis of the state of the covid-19 pandemic in Colombia

DOI: https://doi.org/10.22490/25394088.5612
Jorge Luis Quintero López
Andrés Arismendi Ramírez
Ángela Liceth Pérez Rendón

At the present time of the pandemic, there is a need to process large volumes of information generated by reported positive cases, in order to identify patterns that lead to facing the emergency with timely contingency measures. In the present study, the treatment of a data set of the general population of Colombia is proposed, with information from the month of March and April 2021, in order to characterize, georeference and predict to give value to the data, in search of an understanding of the dynamics of the virus, for which three Naive Bayes, Random Forest and J-48 tree models were used, seeking to identify the virus with greater precision; When using the Weka application, it is concluded that the model that best fits the prediction is the J-48 tree classification algorithm with a classification level of correct instances of 99.24%, with a Kappa value of 0.9266 reporting that there is close to 100% concordance in class classification, with an amount, for this case, of study of 221,583 classes and the prediction with 30 classes taken from the original base consisting of approximately 2,774,465 data. By applying statistical tests, it is possible to identify the correlation between the attributes, which leads to guaranteeing the correct modeling for the prediction. This process becomes a potential input to support the management processes of society and that benefits the decisions that are made in terms of public health.

keywords: prediction, machine learning, Sars-Cov-2, quarantine
license

Copyright (c) 2022 Publicaciones e Investigación

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

When the Publicaciones e Investigaciones Journal receives an original study or article from its author(s), whether by email, postal service, or the platforms available for said purpose, know that it may be published in physical or electronic formats in national or international archives, databases, or SIRES. As such, Publications and Research authorizes the reproduction and citation of said material, provided that the description of information is carried out in conformity with bibliographic norms, and mention the corresponding names, authors, article, issue, and pages. Publications and Research, in advance, expresses that the information, concepts, and methods are the responsibility of the author(s). As such, the UNAD does not have any influence whatsoever over that expressed in the manuscript.

How to Cite
Quintero López, J. L. ., Arismendi Ramírez, A. ., & Pérez Rendón, Ángela L. . (2021). Big Data technology in the analysis of the state of the covid-19 pandemic in Colombia. Publicaciones E Investigación, 15(4). https://doi.org/10.22490/25394088.5612
Almétricas
Metrics
File downloads
153
Jan 2022Jul 2022Jan 2023Jul 2023Jan 2024Jul 2024Jan 2025Jul 2025Jan 20267.0
|

PRIVACY STATEMENT: In accordance with the Personal Data Protection Law (Law 1581 of 2012), the names and email addresses managed by Publicaciones e Investigación will be used exclusively for the purposes stated by this journal and will not be made available for any other purpose or to any other individual. Manuscripts submitted to the publication are only accessible to the editorial team and external peer reviewers. 

Design and implemented by