Assessing the Nationwide COVID-19 Risk in Mexico through the Lens of Comorbidity by an XGBoost-Based Logistic Regression Model
Article
-
- Overview
-
- Research
-
- Identity
-
- Additional Document Info
-
- View All
-
Overview
abstract
-
The outbreak of the new COVID-19 disease is a serious health problem that has affected a large part of the world population, especially older adults and people who suffer from a previous comorbidity. In this work, we proposed a classifier model that allows for deciding whether or not a patient might suffer from the COVID-19 disease, considering spatio-temporal variables, physical characteristics of the patients and the presence of previous diseases. We used XGBoost to maximize the likelihood function of the multivariate logistic regression model. The estimated and observed values of percentage occurrence of cases were very similar, and indicated that the proposed model was suitable to predict new cases (AUC = (Formula presented.)). The main results revealed that patients without comorbidities are less likely to be COVID-19 positive, unlike people with diabetes, obesity and pneumonia. The distribution function by age group showed that, during the first and second wave of COVID-19, young people aged (Formula presented.) were the least affected by the pandemic, while the most affected were people between 20 and 40 years, followed by adults older than 40 years. In the case of the third and fourth wave, there was an increased risk for young individuals (under 20 years), while older adults over 40 years decreased their chances of infection. Estimates of positive COVID cases with both the XGBoost-LR model and the multivariate logistic regression model were used to create maps to visualize the spatial distribution of positive cases across the country. Spatial analysis was carried out to determine, through the data, the main geographical areas where a greater number of positive cases occurred. The results showed that the areas most affected by COVID-19 were in the central and northern regions of Mexico. © 2022 by the authors.
publication date
funding provided via
published in
Research
keywords
-
comorbidity; coronavirus; logistic regression; ROC curve; spatial analysis COVID-19; diabetes; obesity; pneumonia; spatial analysis; spatial distribution; adult; age distribution; algorithm; Article; clinical feature; comorbidity; coronavirus disease 2019; diabetes mellitus; female; geographic distribution; human; infection risk; logistic regression analysis; major clinical study; male; mathematical computing; mathematical model; Mexico; obesity; pneumonia; population distribution; risk assessment; spatiotemporal analysis; adolescent; aged; comorbidity; epidemiology; Mexico; pandemic; statistical model; Federal District [Mexico]; Mexico City; Mexico [North America]; Adolescent; Aged; Comorbidity; COVID-19; Humans; Logistic Models; Mexico; Pandemics
Identity
Digital Object Identifier (DOI)
PubMed ID
Additional Document Info
start page
end page
volume
issue