A Novel Application of K-means Cluster Prediction Model for Diabetes Early  Identification  using Dimensionality Reduction Techniques

Krishna B., Vamshi; K., Raguru Jaya; A. P., Bhuvaneswari; H. L., Gururaj; Ravi, Vinayakumar; Almeshari, Meshari; Alzamil, Yasser

RESEARCH ARTICLE

A Novel Application of K-means Cluster Prediction Model for Diabetes Early Identification using Dimensionality Reduction Techniques

Vamshi Krishna B.¹ Raguru Jaya K.¹ Bhuvaneswari A. P.² Gururaj H. L.³^{, *} Vinayakumar Ravi⁴^{, *} Meshari Almeshari⁵ Yasser Alzamil⁵ Authors Info & Affiliations

The Open Bioinformatics Journal • 31 Aug 2023 • RESEARCH ARTICLE • DOI: 10.2174/18750362-v16-230825-2023-18

Purpose:

Diabetes is a condition where the body cannot utilize insulin properly. Maintenance of the levels of insulin in the body is mandatory, otherwise it will lead to several disorders of kidney failure, heart attack, nervous weakness, blindness, etc. Among the 10 majority diseases, diabetes is occupying the second role by covering 34.2 million individuals as for the National Diabetes Statistics report. According to the World Health Organization, diabetes is playing the 7th role in cause of death. Thus early identification of diabetes can overcome these severe damages.

Methods:

Accurate predictions require a lot of data, which is introducing the curse of dimensionality. In the present research, PIMA Indians diabetes data set is considered and different classification models viz., K-means clustering with logistic regression, SVM (Support Vector Machine), Random Forest, etc. are implemented in predicting the accuracy of diabetes.

Results:

The accuracies for diabetes prediction are ranging from 0.9875 to 1.0. KCPM (K-means cluster prediction model) and has shown an increase in accuracy of 0.67% for the combined K -means clustering and different classification algorithms. In KCPM, firstly, the data is clustered using k-means into patients with and without diabetes, and then the clustered results are compared with the target variable and then filtered, followed by applying the different supervised classification algorithms for predicting the disease.

Conclusion:

The results show that KCPM predicts diabetes with a higher accuracy of 0.67% compared with other existing methods. By KCPM-based automated diabetes analysis system, early prediction of the disease may protect patients from facing severe disorders in life.

Keywords: Clustering, Classification, Curse of dimensionality, Diabetes, Prediction, Classifiers, Accuracy.

Fulltext HTML PDF ePub

A Novel Application of K-means Cluster Prediction Model for Diabetes Early Identification using Dimensionality Reduction Techniques

Abstract

Purpose:

Methods:

Results:

Conclusion:

Bentham Is Proud To Announce Collaboration With Elsevier

Three Journals Receive Impact Factors

The Nursing Journal Directory Indexes Bentham Journal, The Open Public Health Journal

Follow Us

Authors & Information

Authors

Affiliations

Information

Published In

Article Information

Cite As

Article History

Copyright

ACKNOWLEDGEMENTS

Download

Download1

Download

Citations & Metrics

Citations

Cite As

Export Citation

Metrics

Article Usage (Last 30 Days)

Article Usage (Demographic)

Copyright & License

Copyright & License

© 2023 Krishna B .et al

Media

Figures

Tables

Abstract

Purpose:

Methods:

Results:

Conclusion:

Bentham Is Proud To Announce Collaboration With Elsevier

Three Journals Receive Impact Factors

The Nursing Journal Directory Indexes Bentham Journal, The Open Public Health Journal

Authors

Affiliations

Information

Published In

Article Information

Cite As

Article History

Copyright

ACKNOWLEDGEMENTS

Download1

Download

Citations

Cite As

Export Citation

Metrics

Article Usage (Last 30 Days)

Article Usage (Demographic)

Copyright & License

© 2023 Krishna B .et al

Figures

Share

Share article link

Share on social media