Background and Objectives: Diabetic patients are always at risk of hypertension. In this paper, the main goal was to design a native cost sensitive model for the diagnosis of hypertension among diabetics considering the prior probabilities.
Methods: In this paper, we tried to design a cost sensitive model for the diagnosis of hypertension in diabetic patients, considering the distribution of the disease in the general population. Among the data mining algorithms, Decision Tree, Artificial Neural Network, K-Nearest Neighbors, Support Vector Machine, and Logistic Regression were used. The data set belonged to Azarbayjan-e-Sharqi, Iran.
Results: For people with diabetes, a systolic blood pressure more than 130 mm Hg increased the risk of hypertension. In the non-cost-sensitive scenario, Youden's index was around 68%. On the other hand, in the cost-sensitive scenario, the highest Youden's index (47.11%) was for Neural Network. However, in the cost-sensitive scenario, the value of the imposed cost was important, and Decision Tree and Logistic Regression show better performances.
Conclusion: When diagnosing a disease, the cost of miss-classifications and also prior probabilities are the most important factors rather than only minimizing the error of classification on the data set.
Rights and permissions | |
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. |