Application of Machine Learning Techniques for Predicting Stroke Disease

Authors

DOI:

https://doi.org/10.21015/vtcs.v12i2.1906

Abstract

Stroke is a cerebrovascular illness caused by a sudden halt in blood flow to the brain, resulting in neurological impairment. Stroke is a major public health problem worldwide, affecting millions of people. It is a significant source of illness and mortality, imposing a significant socio-economic burden. A thorough awareness of the current global situation is required for effective treatments and preventive actions. This research compares data mining techniques for the prediction of stroke illness. Using a dataset obtained from Mayo Hospital, Lahore, that had 2326 instances, each with 11 attributes, we compared the performance of Support Vector Machine (SVM), Random Forest, Neural Network, and K-Nearest Neighbors (KNN) approaches. Orange Data Mining Software was applied to evaluate the data and execute machine learning techniques. The results show that Naïve Bayes is the best method for predicting the prevalence of Stroke disease. The proposed model demonstrates an Area Under the Curve (AUC) of 88.3 %, an accuracy of 80.8%, and notable metrics including an F1-Score and precision.  

References

World Health Organization. (2020, December 9). The top 10 causes of death. Retrieved June 4, 2023, from https://www.who.int/news-room/fact-sheets/detail/the-top-10-causes-of-death

Warlow, C. P. (1998). Epidemiology of stroke. The Lancet, 352, S1-S4.

Mathers, C. D., Lopez, A. D., & b Murray, C. J. (2006). The burden of disease and mortality by condition: Data, methods, and results for 2001. In Global Burden of Disease and Risk Factors (Vol. 45). Oxford University Press: New York, NY, USA; The World Bank: Washington, DC, USA.

Mandeep Kaur, Sachin R. Sakhare, Kirti Wanjale, Farzana Akter, "Early Stroke Prediction Methods for Prevention of Strokes", Behavioural Neurology, vol. 2022, Article ID 7725597, 9 pages, 2022. https://doi.org/10.1155/2022/7725597

Xia, X., Yue, W., Chao, B., et al. (2019). Prevalence and risk factors of stroke in the elderly in Northern China: Data from the National Stroke Screening Survey. Journal of Neurology, 266(6), 1449–1458. https://doi.org/10.1007/s00415-019-09281-5

Lecouturier, J., Murtagh, M.J., Thomson, R.G. et al. Response to symptoms of stroke in the UK: a systematic review. BMC Health Serv Res 10, 157 (2010). https://doi.org/10.1186/1472-6963-10-157

Kaur, M., Sakhare, S. R., Wanjale, K., & Akter, F. (2022). Early stroke prediction methods for prevention of strokes. Behavioural Neurology, 2022, 7725597.

Roger, V. L., Go, A. S., Lloyd-Jones, D. M., Adams, R. J., Berry, J. D., Brown, T. M., ... et al. (2011). Heart disease and stroke statistics—2011 update: A report from the American Heart Association. Circulation, 123(4), e18–e209.

Katan, M., & Luft, A. (2018). Global burden of stroke. In Seminars in Neurology (Vol. 38, pp. 208–211). Thieme Medical Publishers: New York, NY, USA.

Pandian, J. D., Gall, S. L., Kate, M. P., Silva, G. S., Akinyemi, R. O., Ovbiagele, B. I., ... Thrift, A. G. (2018). Prevention of stroke: A global perspective. The Lancet, 392, 1269–1278.

Gibson, L., & Whiteley, W. (2013). The differential diagnosis of suspected stroke: A systematic review. Journal of the Royal College of Physicians of Edinburgh, 43, 114–118.

Huang, R., Liu, J., Wan, T. K., Siriwanna, D., Woo, Y. M. P., Vodencarevic, A., & Chan, K. H. K. (2023). Stroke mortality prediction based on ensemble learning and the combination of structured and textual data. Computers in Biology and Medicine, 155, 106176.

Thanka, M. R., Ram, K. S., Gandu, S. P., Edwin, E. B., Ebenezer, V., & Joy, P. (2023). Comparing resampling techniques in stroke prediction with machine and deep learning. In Proceedings of the 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS) (pp. 1415–1420). IEEE: Piscataway, NJ, USA.

Singh, M. S., Choudhary, P., & Thongam, K. (2020). A comparative analysis for various stroke prediction techniques. In Proceedings of the Computer Vision and Image Processing: 4th International Conference, CVIP 2019, Jaipur, India, 27–29 September 2019; Revised Selected Papers, Part II.

Wang, W., Chakraborty, G., & Chakraborty, B. (2020). Predicting the risk of chronic kidney disease (CKD) using machine learning algorithm. Applied Sciences, 11, 202.

Speiser, J. L., Karvellas, C. J., Wolf, B. J., Chung, D., Koch, D. G., & Durkalski, V. L. (2019). Predicting daily outcomes in acetaminophen-induced acute liver failure patients with machine learning techniques. Computer Methods and Programs in Biomedicine, 175, 111–120.

Li, X., Bian, D., Yu, J., Li, M., & Zhao, D. (2019). Using machine learning models to improve stroke risk level classification methods of China national stroke screening. BMC Medical Informatics and Decision Making, 19, 1–7.

Kwekha-Rashid, A. S., Abduljabbar, H. N., & Alhayani, B. (2021). Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Applied Nanoscience, 2021, 1–13.

Dev, S., Wang, H., Nwosu, C. S., Jain, N., Veeravalli, B., & John, D. (2022). A predictive analytics approach for stroke prediction using machine learning and neural networks. Healthcare Analytics, 2, 100032.

Cheon, S., Kim, J., & Lim, J. (2019). The use of deep learning to predict stroke patient mortality. International Journal of Environmental Research and Public Health, 16, 1876.

Wang, W., Chakraborty, G., & Chakraborty, B. (2020). Predicting the risk of chronic kidney disease (CKD) using machine learning algorithm. Applied Sciences, 11, 202.

Chin, C. L., Lin, B. J., Wu, G. R., Weng, T. C., Yang, C. S., Su, R. C., & Pan, Y. J. (2017). An automated early ischemic stroke detection system using CNN deep learning algorithm. In Proceedings of the 2017 IEEE 8th International Conference on Awareness Science and Technology (iCAST), Taichung, Taiwan, 8–10 November 2017.

Nwosu, C. S., Dev, S., Bhardwaj, P., Veeravalli, B., & John, D. (2019). Predicting stroke from electronic health records. In 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp. 5704–5707). IEEE.

Pathan, M. S., Jianbiao, Z., John, D., Nag, A., & Dev, S. (2020). Identifying stroke indicators using rough sets. IEEE Access, 8, 210318–210327.

Teoh, D. (2018). Towards stroke prediction using electronic health records. BMC Medical Informatics and Decision Making, 18(1), 1–11.

Singh, M. S., & Choudhary, P. (2017). Stroke prediction using artificial intelligence. In 2017 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON) (pp. 158–161). IEEE.

Shoily, T. I., Islam, T., Jannat, S., Tanna, S. A., Alif, T. M., & Ema, R. R. (2019). Detection of stroke disease using machine learning algorithms. In Proceedings of the 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Kanpur, India, 6–8 July 2019 (pp. 1–6). IEEE: Piscataway, NJ, USA.

Hung, C.-Y., Lin, C.-H., Lan, T.-H., Peng, G.-S., & Lee, C.-C. (2019). Development of an intelligent decision support system for ischemic stroke risk assessment in a population-based electronic health record database. PLoS One, 14(3), e0213007.

Dritsas, E., Alexiou, S., & Moustakas, K. (2022). Cardiovascular disease risk prediction with supervised machine learning techniques. In Proceedings of the 8th International Conference on Information and Communication Technologies for Ageing Well and e-Health—ICT4AWE, INSTICC, Online, 22–24 April 2022 (pp. 315–321). SciTePress: Setúbal, Portugal.

Maldonado, S., López, J., & Vairetti, C. (2019). An alternative SMOTE oversampling strategy for high-dimensional datasets. Applied Soft Computing, 76, 380–389.

J M, S.L., P, S. Unveiling the potential of machine learning approaches in predicting the emergence of stroke at its onset: a predicting framework. Sci Rep 14, 20053 (2024). https://doi.org/10.1038/s41598-024-70354-1

Peker, M., Özkaraca, O., & Sasar, A. (2018). Use of Orange Data Mining Toolbox for Data Analysis in Clinical Decision Making: The Diagnosis of Diabetes Disease. In Expert System Techniques in Biomedical Science Practice (pp. 143–167). Available online: [URL] (accessed on 3 February 2021).

Downloads

Published

2024-11-19

How to Cite

Rafiq, M. Y., Nazeer, A., & Gilani, A. (2024). Application of Machine Learning Techniques for Predicting Stroke Disease. VAWKUM Transactions on Computer Sciences, 12(2), 123–136. https://doi.org/10.21015/vtcs.v12i2.1906