Application of Machine Learning Techniques for Predicting Stroke Disease
DOI:
https://doi.org/10.21015/vtcs.v12i2.1906Abstract
Stroke is a cerebrovascular illness caused by a sudden halt in blood flow to the brain, resulting in neurological impairment. Stroke is a major public health problem worldwide, affecting millions of people. It is a significant source of illness and mortality, imposing a significant socio-economic burden. A thorough awareness of the current global situation is required for effective treatments and preventive actions. This research compares data mining techniques for the prediction of stroke illness. Using a dataset obtained from Mayo Hospital, Lahore, that had 2326 instances, each with 11 attributes, we compared the performance of Support Vector Machine (SVM), Random Forest, Neural Network, and K-Nearest Neighbors (KNN) approaches. Orange Data Mining Software was applied to evaluate the data and execute machine learning techniques. The results show that Naïve Bayes is the best method for predicting the prevalence of Stroke disease. The proposed model demonstrates an Area Under the Curve (AUC) of 88.3 %, an accuracy of 80.8%, and notable metrics including an F1-Score and precision.
References
World Health Organization. (2020, December 9). The top 10 causes of death. Retrieved June 4, 2023, from https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death
Warlow, C. P. (1998). Epidemiology of stroke. The Lancet, 352, S1-S4.
Mathers, C. D., Lopez, A. D., & b Murray, C. J. (2006). The burden of disease and mortality by condition: Data, methods, and results for 2001. In Global Burden of Disease and Risk Factors (Vol. 45). Oxford University Press: New York, NY, USA; The World Bank: Washington, DC, USA.
Mandeep Kaur, Sachin R. Sakhare, Kirti Wanjale, Farzana Akter, "Early Stroke Prediction Methods for Prevention of Strokes", Behavioural Neurology, vol. 2022, Article ID 7725597, 9 pages, 2022. https://doi.org/10.1155/2022/7725597
Xia, X., Yue, W., Chao, B., et al. (2019). Prevalence and risk factors of stroke in the elderly in Northern China: Data from the National Stroke Screening Survey. Journal of Neurology, 266(6), 1449–1458. https://doi.org/10.1007/s00415-019-09281-5
Lecouturier, J., Murtagh, M.J., Thomson, R.G. et al. Response to symptoms of stroke in the UK: a systematic review. BMC Health Serv Res 10, 157 (2010). https://doi.org/10.1186/1472-6963-10-157
Kaur, M., Sakhare, S. R., Wanjale, K., & Akter, F. (2022). Early stroke prediction methods for prevention of strokes. Behavioural Neurology, 2022, 7725597.
Roger, V. L., Go, A. S., Lloyd-Jones, D. M., Adams, R. J., Berry, J. D., Brown, T. M., ... et al. (2011). Heart disease and stroke statistics—2011 update: A report from the American Heart Association. Circulation, 123(4), e18–e209.
Katan, M., & Luft, A. (2018). Global burden of stroke. In Seminars in Neurology (Vol. 38, pp. 208–211). Thieme Medical Publishers: New York, NY, USA.
Pandian, J. D., Gall, S. L., Kate, M. P., Silva, G. S., Akinyemi, R. O., Ovbiagele, B. I., ... Thrift, A. G. (2018). Prevention of stroke: A global perspective. The Lancet, 392, 1269–1278.
Gibson, L., & Whiteley, W. (2013). The differential diagnosis of suspected stroke: A systematic review. Journal of the Royal College of Physicians of Edinburgh, 43, 114–118.
Huang, R., Liu, J., Wan, T. K., Siriwanna, D., Woo, Y. M. P., Vodencarevic, A., & Chan, K. H. K. (2023). Stroke mortality prediction based on ensemble learning and the combination of structured and textual data. Computers in Biology and Medicine, 155, 106176.
Thanka, M. R., Ram, K. S., Gandu, S. P., Edwin, E. B., Ebenezer, V., & Joy, P. (2023). Comparing resampling techniques in stroke prediction with machine and deep learning. In Proceedings of the 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS) (pp. 1415–1420). IEEE: Piscataway, NJ, USA.
Singh, M. S., Choudhary, P., & Thongam, K. (2020). A comparative analysis for various stroke prediction techniques. In Proceedings of the Computer Vision and Image Processing: 4th International Conference, CVIP 2019, Jaipur, India, 27–29 September 2019; Revised Selected Papers, Part II.
Wang, W., Chakraborty, G., & Chakraborty, B. (2020). Predicting the risk of chronic kidney disease (CKD) using machine learning algorithm. Applied Sciences, 11, 202.
Speiser, J. L., Karvellas, C. J., Wolf, B. J., Chung, D., Koch, D. G., & Durkalski, V. L. (2019). Predicting daily outcomes in acetaminophen-induced acute liver failure patients with machine learning techniques. Computer Methods and Programs in Biomedicine, 175, 111–120.
Li, X., Bian, D., Yu, J., Li, M., & Zhao, D. (2019). Using machine learning models to improve stroke risk level classification methods of China national stroke screening. BMC Medical Informatics and Decision Making, 19, 1–7.
Kwekha-Rashid, A. S., Abduljabbar, H. N., & Alhayani, B. (2021). Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Applied Nanoscience, 2021, 1–13.
Dev, S., Wang, H., Nwosu, C. S., Jain, N., Veeravalli, B., & John, D. (2022). A predictive analytics approach for stroke prediction using machine learning and neural networks. Healthcare Analytics, 2, 100032.
Cheon, S., Kim, J., & Lim, J. (2019). The use of deep learning to predict stroke patient mortality. International Journal of Environmental Research and Public Health, 16, 1876.
Wang, W., Chakraborty, G., & Chakraborty, B. (2020). Predicting the risk of chronic kidney disease (CKD) using machine learning algorithm. Applied Sciences, 11, 202.
Chin, C. L., Lin, B. J., Wu, G. R., Weng, T. C., Yang, C. S., Su, R. C., & Pan, Y. J. (2017). An automated early ischemic stroke detection system using CNN deep learning algorithm. In Proceedings of the 2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST), Taichung, Taiwan, 8–10 November 2017.
Nwosu, C. S., Dev, S., Bhardwaj, P., Veeravalli, B., & John, D. (2019). Predicting stroke from electronic health records. In 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp. 5704–5707). IEEE.
Pathan, M. S., Jianbiao, Z., John, D., Nag, A., & Dev, S. (2020). Identifying stroke indicators using rough sets. IEEE Access, 8, 210318–210327.
Teoh, D. (2018). Towards stroke prediction using electronic health records. BMC Medical Informatics and Decision Making, 18(1), 1–11.
Singh, M. S., & Choudhary, P. (2017). Stroke prediction using artificial intelligence. In 2017 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON) (pp. 158–161). IEEE.
Shoily, T. I., Islam, T., Jannat, S., Tanna, S. A., Alif, T. M., & Ema, R. R. (2019). Detection of stroke disease using machine learning algorithms. In Proceedings of the 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Kanpur, India, 6–8 July 2019 (pp. 1–6). IEEE: Piscataway, NJ, USA.
Hung, C.-Y., Lin, C.-H., Lan, T.-H., Peng, G.-S., & Lee, C.-C. (2019). Development of an intelligent decision support system for ischemic stroke risk assessment in a population-based electronic health record database. PLoS One, 14(3), e0213007.
Dritsas, E., Alexiou, S., & Moustakas, K. (2022). Cardiovascular disease risk prediction with supervised machine learning techniques. In Proceedings of the 8th International Conference on Information and Communication Technologies for Ageing Well and e-Health—ICT4AWE, INSTICC, Online, 22–24 April 2022 (pp. 315–321). SciTePress: Setúbal, Portugal.
Maldonado, S., López, J., & Vairetti, C. (2019). An alternative SMOTE oversampling strategy for high-dimensional datasets. Applied Soft Computing, 76, 380–389.
J M, S.L., P, S. Unveiling the potential of machine learning approaches in predicting the emergence of stroke at its onset: a predicting framework. Sci Rep 14, 20053 (2024). https://doi.org/10.1038/s41598-024-70354-1
Peker, M., Özkaraca, O., & Sasar, A. (2018). Use of Orange Data Mining Toolbox for Data Analysis in Clinical Decision Making: The Diagnosis of Diabetes Disease. In Expert System Techniques in Biomedical Science Practice (pp. 143–167). Available online: [URL] (accessed on 3 February 2021).
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC-By) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
This work is licensed under a Creative Commons Attribution License CC BY