An ensembling approach to predict hepatitis in patients with liver disease using machine learning
DOI:
https://doi.org/10.21015/vtse.v11i3.1598Abstract
With a 3.5% mortality rate, liver disease is one of the worst diseases in existence. Pakistan is targeting this major health issue from several perspectives, to improve prevention, diagnosis, and treatment due to having the highest incidence of liver disorders in the world. For liver problem disease, also known as HEP C, Pakistan is now the second most prevalent country in the world. This is due to the rapid progression of HEP C, which can only be stopped by early diagnosis. If not, it progresses to the last stage of HEP C cirrhosis, which has no other treatment options besides liver transplantation. One and only machine learning algorithms like logistic regression, random forest, KNN, K-Means, and XGBoost can be used to predict liver illness utilizing modern methods like artificial intelligence. Data is gathered from Kaggle and subjected to several machine learning algorithms after pre-processing in order to quickly diagnose liver disease. Additionally, to improve accuracy, all of these algorithms are ensemble, and accuracy is 78.96%, along with precision, recall, and F1 score. In this work, liver disease is predicted early on using pre-processing, feature extraction, and classification techniques. Recall, precision, and f1score metrics are used to compare the accuracy of the six algorithms, and these algorithms are then combined to provide the most accurate diagnosis of liver disease.
References
Arif, M., Aslam, M. A., Rehman, H. U., Abbas, M., and Bukhari, S. (2022). Laboratory diagnostic pathways using machine learning. VFAST Transactions on Software Engineering, 10(1):78–85.
Battineni, G., Sagaro, G. G., Chinatalapudi, N., and Amenta, F. (2020). Applications of machine learning predictive models in the chronic disease diagnosis. Journal of personalized medicine, 10(2):21.
Bihter, D. (2020). A comparative study on the performance of classification algorithms for effective diagnosis of liver diseases. Sakarya University Journal of Computer and Information Sciences, 3(3):366–375.
Deekshatulu, B., Chandra, P., et al. (2013). Classification of heart disease using k-nearest neighbor and genetic algorithm. Procedia technology, 10:85–94. DOI: https://doi.org/10.1016/j.protcy.2013.12.340
Fatima, M., Pasha, M., et al. (2017). Survey of machine learning algorithms for disease diagnostic. Journal of Intelligent Learning Systems and Applications, 9(01):1. DOI: https://doi.org/10.4236/jilsa.2017.91001
Ibrahim, I. and Abdulazeez, A. (2021). The role of machine learning algorithms for diagnosing diseases. Journal of Applied Science and Technology Trends, 2(01):10–19.
Islam, M., Wu, C.-C., Poly, T. N., Yang, H.-C., Li, Y.-C. J., et al. (2018). Applications of machine learning in fatty live disease prediction. In Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth, pages 166–170. IOS Press.
Jun-Shan, T., Wei, H., and Yan, Q. (2009). Application of genetic algorithm in data mining. In 2009 First International Workshop on Education Technology and Computer Science, volume 2, pages 353–356. IEEE. DOI: https://doi.org/10.1109/ETCS.2009.340
Kumar, N., Narayan Das, N., Gupta, D., Gupta, K., and Bindra, J. (2021). Efficient automated disease diagnosis using machine learning models. Journal of Healthcare Engineering, 2021.
Mujumdar, A. and Vaidehi, V. (2019). Diabetes prediction using machine learning algorithms. Procedia Computer Science, 165:292–299.
Nahar, N. and Ara, F. (2018). Liver disease prediction by using different decision tree techniques. International Journal of Data Mining & Knowledge Management Process, 8(2):01–09. DOI: https://doi.org/10.5121/ijdkp.2018.8201
Nilashi, M., Ahmadi, N., Samad, S., Shahmoradi, L., Ahmadi, H., Ibrahim, O., Asadi, S., Abdullah, R., Abumalloh, R. A., and Yadegaridehkordi, E. (2020). Disease diagnosis using machine learning techniques: A review and classification. Journal of Soft Computing and Decision Support Systems, 7(1):19–30.
Priya, M. B., Juliet, P. L., and Tamilselvi, P. (2018). Performance analysis of liver disease prediction using machine learning algorithms. International Research Journal of Engineering and Technology (IRJET), 5(1):206–211.
Rahman, A. S., Shamrat, F. J. M., Tasnim, Z., Roy, J., and Hossain, S. A. (2019). A comparative study on liver disease prediction using supervised machine learning algorithms. International Journal of Scientific & Technology Research, 8(11):419–422.
Rairikar, A., Kulkarni, V., Sabale, V., Kale, H., and Lamgunde, A. (2017). Heart disease prediction using data mining techniques. In 2017 International conference on intelligent computing and control (I2C2), pages 1–8. IEEE. DOI: https://doi.org/10.1109/I2C2.2017.8321771
Senturk, Z. K. (2020). Early diagnosis of parkinson’s disease using machine learning algorithms. Medical hypotheses, 138:109603.
Shaheamlung, G. and Kaur, H. (2021). The diagnosis of chronic liver disease using machine learning techniques. INFORMATION TECHNOLOGY IN INDUSTRY, 9(2):554–564.
Tigga, N. P. and Garg, S. (2020). Prediction of type 2 diabetes using machine learning classification methods. Procedia Computer Science, 167:706–716.
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC-By) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
This work is licensed under a Creative Commons Attribution License CC BY