✅ In this study the severity of COVID-19 disease in kidney patients was measured using variables including age, diabetes mellitus, blood pressure, CKD stage, etc. The results showed that high levels of kidney disease can lead to severe COVID-19.
The COVID-19 pandemic is a phenomenon that has plagued and killed many people worldwide (1). The transmission rate of the disease is so high that it has disrupted the normal lives of people. The disease occurs with various intensities in different people. Several studies have shown that underlying diseases such as diabetes, heart failure, ischemic stroke, and kidney failure can adversely affect COVID-19 severity (2-3). Studies have demonstrated that CKD is associated with severe COVID-19 (4). Various studies have examined the relationship between these two diseases.
Cheng examined the relationship between kidney disease and hospital deaths in COVID-19 patients. In a cohort study, he evaluated 701 kidney patients with COVID-19. The results showed that 16% of deaths occurred in the hospital (5). In a case report, Fu reported a patient with end-stage kidney disease (ESRD) who had been admitted to a hospital with COVID-19 infection. The study case was a 75-year-old man with ESRD who had contracted COVID-19. The results showed that patients with chronic kidney disease (CKD) were more likely to get more severe COVID-19 infections due to their weakened immune systems (6). Flythe examined the hospital mortality of dialysis patients with pneumonia due to COVID-19. The results showed that there is a shorter time between the onset of symptoms and ICU admission in dialysis patients. The mortality rate is very high in these patients, and they should take more care to avoid COVID-19 infection (7). Marco demonstrated that simultaneous underlying kidney disease and diabetes mellitus in patients infected with COVID-19 weaken the immune system and may pose higher risk of death (8).
Due to the widespread prevalence of COVID-19, assessing its severity in CKD patients is very important, and it may be useful in decreasing their mortality rate. This study aimed to predict COVID-19 infection severity in CKD patients by combining feature selection and classification methods.
This study was conducted between March 2021 and September 2021 in Isfahan University of medical sciences.
We used RapidMiner software Version 9.1 for statistical analysis.
A. Feature selection method
Due to the multiplicity of attributes, it was necessary to use attribute selection methods to reduce the number of attributes. The proposed method was the ant colony method (9). In this method, different combinations of traits are considered. There is a total of 2 to the power of 35 states. Each of the compounds is considered a node in the graph. Each node represents an ant with a pheromone content. The ants then begin to move and choose paths that are more efficient and less distant. Then the traits that receive less pheromone will be eliminated from the set of traits. By applying the ant colony algorithm, the number of selected traits was reduced from 35 to 18 traits, including age, diabetes mellitus, hypertension, insulin, ACEI/ ARB use, statin use, cough, shortness of breath, myalgia, nausea, fever, intubation, interval from onset of symptoms to recovery or death, hospital stay duration, duration of stay in ICU, antiviral, antibiotic, and corticosteroid use, CKD stage, and severity of COVID-19.
B - CHAID method
CHAID is a tree-based decision-making approach. In this method, data space is divided into different subcategories in different performances. This method starts from the root node and creates two or more nodes in each division. This algorithm uses chi-square statistical test and, based on this test, decides which subset should be used to identify child nodes (10). The algorithm values the values of Statistics are homogeneous or similar depending on the target variable and retain all heterogeneous values. Then the best attribute is selected to form the first branch in the decision tree in a way that each node is made up of a group of homogeneous values of the selected target variable. This reversal process continues until the tree is fully formed. The flowchart of the proposed method is presented in Figure 1.
Figure 1. Flowchart of the proposed method
Evaluation Measures
This study used test data sets to evaluate the methods. First, the complexity matrix, which includes various criteria, such as true positive (TP), true negative (TN), false positive, and false negative (FN), is calculated. The target is a three-objective class of mild, moderate, and severe. Thus, a 3×3 complexity matrix will be created. The rows correspond to each of the predicted values and the columns correspond to the actual values.
One of these criteria is accuracy (11); the closer the accuracy is to 1, the better the result. This criterion is calculated using the following formula:
Other criteria are sensitivity, specificity (12), and confidence (13); the closer they are to 1, the better the performance. The confidence value indicates how reliable this rule is. The higher the value, the more likely the head items will occur in a group (13). Sensitivity can determine the true positive rate, and specificity can determine the false positive rate.
Eighteen attributes were extracted. People were divided into five groups based on their age. The number of people aged between 56 and 70 was 126 and the number of those aged between 71 and 98 was 187. Two hundred and eleven people had diabetes mellitus, 95 people used insulin, and 89 people had CKD stage 4.
Table 1. Attributes, values, and count of kidney disease dataset
Value (count) | Attribute |
10–25 (8), 26–40 (23), 41–55 (58), 56–70 (126), 71–98 (187) | Age |
Yes (247), No (157) | Myalgia |
Yes (211), No (194) | Diabetes Mellitus |
Yes (216), No (187) | Hypertension |
Used (95), Not used (310) | Insulin |
Used (125), Not used (280) | ACEI/ARB |
Yes (127), No (276) | Statin |
Yes (197), No (205) | Cough |
Yes (229), No (174) | Shortness of breath |
Used (80), Not used (324) | Intubation |
Yes (76), No (328) | Nausea |
Yes (136), No (269) | Fever |
Yes (46), No (359) | Antiviral |
Yes (313), No (92) | Antibiotics |
Yes (275), No (130) | Corticosteroids |
One day to one week (80) One week two weeks (141) two weeks or more (182) |
Interval from onset of symptoms to recovery or death |
2 (11), 3a (43), 3b (69), 4 (89), 5 (4), 5nd (47) | CKD stage |
0 days (243), one day to one week (89), more than one week (64) | Duration of stay in ICU |
To investigate the effectiveness of the ant colony algorithm in reducing the number of traits from 35 to 18, the CHAID method was used once on the 18 traits and once on the 35 traits. Therefore, the proposed method is the combination of ant colony theory and CHAID. This combined method was compared to the CHAID method. The accuracy is shown in Table 2. The results of Table 2 show that the use of the ACO method improved the accuracy of the CHAID method by up to 17%.
Table 2. The accuracy of CHAID and CHAID+ACO.
CHAID+ACO | CHAID |
87.65% | 70.37% |
The sensitivity and specificity of the methods are listed in Table 3. The CHAID+ACO method detects 86% of severe cases of COVID-19 correctly. It was able to identify 92% of the moderate cases of COVID-19.
Table 3. The sensitivity and specificity of CHAID and CHAID+ACO.
CHAID+ACO | CHAID | |||
Specificity | Sensitivity | Specificity | Sensitivity | Class |
97.00% | 80.00% | 82.00% | 65.00% | Mild |
87.76% | 92.16% | 70.46% | 77.46% | Moderate |
94.00% | 86.93% | 78.24% | 69.33% | Severe |
A total of 22 rules were extracted, of which 6 rules with a confidence higher than 60% were introduced as the selected rules, which are listed in Table 4. These rules have been approved by kidney disease experts.
Rule | Condition | Result | Confidence |
Rule 1 | If the length of hospital stay in the ICU is more than 7 days, age is between 71 and 98, and the person has nausea |
Severe COVID-19 | 85% |
Rule 2 | If CKD stage is 3b and age is between 41 and 55 | Severe COVID-19 | 66% |
Rule 3 | If CKD stage is 5ND and the person has shortness of breath | Severe COVID-19 | 81% |
Rule 4 | If CKD stage is 2 | Moderate COVID-19 | 63% |
Rule 5 | If CKD stage is 3b and the person has diabetes mellitus | Moderate COVID-19 | 66% |
Rule 6 | If the interval from the onset of symptoms to recovery or death is two weeks and the person has shortness of breath |
Mild COVID-19 | 68% |
Discussion
The recent transition from traditional medicine to evidence-based medicine has highlighted the need for data mining (14). One of the issues with COVID-19 is the effect of underlying diseases on its severity. Research has shown that underlying diseases such as kidney disease or diabetes mellitus can exacerbate COVID-19 or even lead to death (7). Therefore, this group of patients have a higher priority over other patients in receiving vaccines (15). In the rules extracted in this study, the effect of kidney disease level on the severity of COVID-19 is evident. If a person has 5ND kidney disease level and also has shortness of breath, in 81% of cases, COVID-19 will be severe. Also, if the CKD stage is 3b and the age is between 41 and 55 years old, the type of COVID-19 disease will be severe in 66% of cases. Ronco stated that kidney involvement in COVID-19 is high and will lead to hospital admission in 40% of cases (16). Also, if the CKD stage is 2, COVID-19 severity will be moderate in 63% of cases. In fact, the lowest level of kidney disease corresponds to moderate COVID-19 severity. This highlights the importance of monitoring CKD patients and their health during the COVID-19 pandemic. Thus, these patients may deal with more serious COVID-19 infections even if their CKD disease level is not very high. Kapanova stated that chronic kidney disease has negative impact on COVID-19 (17). Note that in another rule, even non-diabetes mellitus patients with CKD stage 3b experience moderate COVID-19 severity in 66% of cases.
According to the study findings, the use of ant colony improves the performance of the CHAID method. Therefore, it can be said that basing prediction on some traits does not improve the performance of the method and may even reduce the accuracy of the method. In fact, the ant colony theory method can prevent the destructive effects of confounding variables and improve the performance of the proposed method.
Conclusion
In this study, the COVID-19 severity in CKD patients was measured using various variables including age, diabetes mellitus, hypertension, CKD stage, etc. The results showed that ACO improved the performance of the CHAID method in predicting disease severity in CKD patients with COVID-19. Thus, this combination can be considered in future machine learning studies. Moreover, kidney disease can lead to more severe COVID-19 infections.
Measures should be taken to monitor the health of these patients. Moreover, the importance of kidney disease in prioritizing these patients in hospitals across the country should be considered.
Acknowledgements
None.
Conflicts of Interest
The authors declare that they have no conflict of interest.
Rights and permissions | |
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. |