Comparison of machine learning methods for a diabetes prediction information system

Ескіз недоступний
Дата
2021
Автори
Shmatko, O. V.
Korol, O.
Tkachov, A.
Otenko, V.
Шматко, О. В.
DOI
Назва журналу
Номер ISSN
Назва тому
Видавець
CEUR Workshop Proceedings
Анотація
Diabetes is a disease for which there is no permanent cure; therefore, methods and information systems are required for its early detection. This paper proposes an information system for predicting diabetes based on the use of data mining methods and machine learning (ML) algorithms. The paper discusses a number of machine learning methods such as decision trees (DT), logistic regression (LR), k-Nearest Neighbors (k-NN). For our research, we used the Pima Indian Diabetes (PID) dataset collected from the UCI machine learning repository. The dataset contains information about 768 patients and their corresponding nine unique attributes. Research has been carried out to improve the prediction index based on the Recursive Feature Elimination method. We found that the logistic regression (LR) model performed well in predicting diabetes. We have shown that in order to use the created model topredict the likelihood of diabetes mellitus with an accuracy of 78%, it is necessary and sufficient to use such indicators of the patient's health status as the number of times of pregnancy, the concentration of glucose in the blood plasma during the oralglucose tolerance test, the BMI index and the result of the calculation. heredity functions "DiabetesPedigreeFunction"
Опис
Ключові слова
machine learning , data mining , neural network , diabetes prediction information system , knn , logistic regression , decision tree
Бібліографічний опис
Shmatko, O., Korol, O., Tkachov, A., & Otenko, V. (2021). Comparison of machine learning methods for a diabetes prediction information system. Intellectual Systems and Information Technologies (ISIT 2021) : short Paper Proceedings of the 2nd International Conference. CEUR Workshop Proceedings, 3126, 192–197.
Shmatko O., Korol O., Tkachov A., Otenko V. Comparison of machine learning methods for a diabetes prediction information system. Intellectual Systems and Information Technologies (ISIT 2021) : short Paper Proceedings of the 2nd International Conference. CEUR Workshop Proceedings. 2021. Vol. 3126. P. 192–197.