Diabetes Detection Using Gradient Boosting Classifier (XGBOOST)
Keywords:
Diabetes, Machine, Learning, Prediction, DatasetAbstract
Diabetes results from elevated glucose levels in humans and should not be overlooked if left untreated, as it can lead to significant health issues, including heart complications, kidney disorders, hypertension, and eye damage, as well as impact other organs. Early detection of diabetes can help manage the condition effectively. To accomplish this, we aim to predict diabetes in individuals with high accuracy by utilizing various machine learning techniques. These techniques enhance prediction outcomes by developing models from patient data. In this research, we applied machine learning classification and ensemble methods to a dataset for diabetes prediction. The techniques used include K-Nearest Neighbor (KNN), Logistic Regression (LR), Support Vector Machine (SVM), Gradient Boosting (XGBOOST), LightGradientBoosting (LightGBM) and Random Forest (RF). Each model demonstrated varying levels of accuracy when compared to one another. This project identifies a model with superior accuracy, indicating its effectiveness in predicting diabetes. Our findings reveal that the Gradient Boosting Classifier (XGBOOST) method achieved greater accuracy than the other machine learning techniques
Downloads
References
Al-Zebari, A., & Sengur, A. (2019).” Performance Comparison of Machine Learning Techniques on Diabetes Disease Detection.” 1–4. https://doi.org/10.1109/ubmyk48245.2019.8965542
V. Jithendra, B. Jagadeesh, S. Kusuma, M. Madhusudhan, and R. M. Sai Mohit, “Diabetes Prediction using Machine Learning Techniques,” Journal of Artificial Intelligence and Capsule Networks, vol. 5, no. 2, pp. 190–206, Jun. 2023, doi: 10.36548/jaicn.2023.2.008.
A. Choudhury and D. Gupta, “A Survey on Medical Diagnosis of Diabetes Using Machine Learning Techniques,” springer singapore, 2018, pp. 67–78. doi: 10.1007/978-981-13-1280-9_6
S. Mishra, P. Chaudhury, B. K. Mishra, and H. K. Tripathy, “An implementation of Feature ranking using Machine learning techniques for Diabetes disease prediction,” Mar. 2016, vol. 2, pp. 1–3. doi: 10.1145/2905055.2905100.
M. J. Uddin et al., “A Comparison of Machine Learning Techniques for the Detection of Type-2 Diabetes Mellitus: Experiences from Bangladesh,” Information, vol. 14, no. 7, p. 376, Jul. 2023, doi: 10.3390/info14070376.
M. Phongying and S. Hiriote, “Diabetes Classification Using Machine Learning Techniques,” Computation, vol. 11, no. 5, p. 96, May 2023, doi: 10.3390/computation11050096.
M. A. Sarwar, M. A. Shah, N. Kamal, and W. Hamid, “Prediction of Diabetes Using Machine Learning Algorithms in Healthcare,” Sep. 2018, pp. 1–6. doi: 10.23919/iconac.2018.8748992.
A. Juneja, V. Kumar, S. Kaur, and S. Juneja, “Predicting Diabetes Mellitus With Machine Learning Techniques Using Multi-Criteria Decision Making,” International Journal of Information Retrieval Research, vol. 11, no. 2, pp. 38–52, Apr. 2021, doi: 10.4018/ijirr.2021040103.
D. Y. Shin, J. K. Hyun, B. Lee, J. W. Park, and W. S. Yoo, “Prediction of Diabetic Sensorimotor Polyneuropathy Using Machine Learning Techniques.,” Journal of Clinical Medicine, vol. 10, no. 19, p. 4576, Oct. 2021, doi: 10.3390/jcm10194576.
A. García-Domínguez et al., “Diabetes Detection Models in Mexican Patients by Combining Machine Learning Algorithms and Feature Selection Techniques for Clinical and Paraclinical Attributes: A Comparative Evaluation.,” Journal of Diabetes Research, vol. 2023, pp. 1–19, Jun. 2023, doi: 10.1155/2023/9713905.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.