Adaptive Bug Localization Framework for Precision-Driven Bug Localization in Software Engineering

Authors

DOI:

https://doi.org/10.21015/vtse.v12i3.1832

Abstract

Software development always looks for automated methods to improve productivity and accuracy in issue detection. The paper conducts a comparative examination of several machine-learning techniques to tackle the bug localization difficulty. Our study compared the performance of Logistic Regression (LR), Random Forest Classifier (RFC), Support Vector Machine (SVM), Gradient Boosting Classifier (GBC), and Adaptive Bug Localization System (ABLS) on five dataset versions. The results demonstrate the superior performance of ensemble learning methods. The ABLS model regularly beats other models regarding F1 score, accuracy, and recall, indicating its strong potential for precise problem localization. The study highlights the necessity of continuously adapting models to tackle idea drift in dynamic datasets. Our research suggests a path for future endeavours involving improving feature engineering and integrating real-time online learning to sustain high performance in bug localization activities.

Author Biographies

Waqas Ali, School of Information Engineering, Yangzhou University, China

Student

Shamshad Lakho, Department of Computer Science, Quaid-e-Awam University of Engineering , Science and Technology, Nawabshah,Pakistan

Assistant Professor at Department of Computer Science, Quaid-e-Awam University of Engineering , Science and Technology, Nawabshah

Nadeem Naeem Bhatti, Department of Electronic Engineering, Quaid-e-Awam University of Engineering, Science and Technology, Nawabshah,Pakistan

Associate Professor at Department of Electronic Engineering, Quaid-e-Awam University of Engineering, Science and Technology, Nawabshah

Imran Ali Memon, Department of Information Technology, Shaheed Benazir Bhutto University, Shaheed Benazirabad,Pakistan

Lecturer at Department of Information Technology, Shaheed Benazir Bhutto University, Shaheed Benazirabad

References

S. Singh and V. Kumar, “Heart disease prediction using machine learning algorithms,” Materials Today: Proceedings, vol. 49, pp. 1822–1827, 2022.

G. Choudhary, “Heart disease prediction using machine learning,” Journal of Computer Science and Technology Studies, vol. 5, no. 2, pp. 12–19, 2022.

R. Paul, A. Bandopadhyay, A. Karmakar, and P. Gupta, “Machine learning approach for prediction of heart disease,” in Proceedings of the 2022 IEEE 5th International Conference on Computing Communication and Automation (ICCCA), pp. 265–270, 2022.

P. Kaur and M. Kaur, “Heart disease prediction using hybrid machine learning model,” in Proceedings of the 2022 3rd International Conference on Computational Techniques in Information and Communication Technologies (ICCTICT), pp. 170–175, 2022.

D. Kumar, G. V. Soni, and M. Yadav, “Heart disease prediction using data mining and machine learning algorithms: A review,” Journal of Intelligent Systems, vol. 12, no. 1, pp. 33–45, 2022.

M. Al-Habsi and M. Al-Maashari, “Prediction of heart disease using machine learning techniques: A review,” International Journal of Computer Applications, vol. 183, no. 30, pp. 23–29, 2021.

K. A. Patel and P. Shah, “Heart disease prediction using deep learning techniques,” Journal of Artificial Intelligence Research, vol. 8, no. 1, pp. 200–206, 2022.

S. B. Naik and V. Y. Suryawanshi, “Heart disease prediction system using data mining techniques,” International Journal of Computer Science and Information Technologies, vol. 10, no. 3, pp. 45–49, 2022.

A. Agarwal and P. Sharma, “Prediction of heart disease using data mining techniques,” Materials Today: Proceedings, vol. 50, pp. 1741–1747, 2022.

S. Gupta and R. K. Garg, “Heart disease prediction using various machine learning algorithms,” Journal of Computing and Information Science in Engineering, vol. 6, no. 2, pp. 177–186, 2022.

S. Suneja, Y. Zhuang, Y. Zheng, J. Laredo, A. Morari, and U. Khurana, “Code vulnerability detection via signal-aware learning,” in 2023 IEEE 8th European Symposium on Security and Privacy (EuroS&P), pp. 506–523, 2023.

T. D. LaToza, G. Venolia, and R. DeLine, “Maintaining mental models: a study of developer work habits,” in Proceedings of the 28th international conference on Software engineering, pp. 492–501, 2006.

J. A. Jones, M. J. Harrold, and J. Stasko, “Visualization of test information to assist fault localization,” in Proceedings of the 24th international conference on Software engineering, pp. 467–477, 2002.

W. E. Wong, Y. Qi, L. Zhao, and K.-Y. Cai, “Effective fault localization using code coverage,” in 31st Annual International Computer Software and Applications Conference (COMPSAC 2007), vol. 1, pp. 449–456, 2007.

M. Renieres and S. P. Reiss, “Fault localization with nearest neighbor queries,” in 18th IEEE International Conference on Automated Software Engineering, 2003. Proceedings., pp. 30–39, 2003.

M. Chen, X. Qiu, and X. Li, “Automatic test case generation for UML activity diagrams,” in Proceedings of the 2006 international workshop on Automation of software test, pp. 2–8, 2006.

C. Liu, X. Yan, L. Fei, J. Han, and S. P. Midkiff, “SOBER: statistical model-based bug localization,” ACM SIGSOFT Software Engineering Notes, vol. 30, no. 5, pp. 286–295, 2005.

P. Chatterjee, M. Kong, and L. Pollock, “Finding help with programming errors: An exploratory study of novice software engineers’ focus in stack overflow posts,” Journal of Systems and Software, vol. 159, p. 110454, 2020.

S. Kim, K. Pan, and J. E. Whitehead Jr., “Memories of bug fixes,” in Proceedings of the 14th ACM SIGSOFT international symposium on Foundations of software engineering, pp. 35–45, 2006.

H. Washizaki et al., “Systematic literature review of security pattern research,” Information, vol. 12, no. 1, p. 36, 2021.

A. Schr{"o}ter, T. Zimmermann, and A. Zeller, “Predicting component failures at design time,” in Proceedings of the 2006 ACM/IEEE international symposium on Empirical software engineering, pp. 18–27, 2006.

T. Zimmermann, N. Nagappan, H. Gall, E. Giger, and B. Murphy, “Cross-project defect prediction: a large scale experiment on data vs. domain vs. process,” in Proceedings of the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering, pp. 91–100, 2009.

X. Xia, D. Lo, S. McIntosh, E. Shihab, and A. E. Hassan, “Cross-project build co-change prediction,” in IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER), pp. 311–320, 2015.

S. Lessmann, B. Baesens, C. Mues, and S. Pietsch, “Benchmarking classification models for software defect prediction: A proposed framework and novel findings,” IEEE transactions on software engineering, vol. 34, no. 4, pp. 485–496, 2008.

K. O. Elish and M. O. Elish, “Predicting defect-prone software modules using support vector machines,” Journal of Systems and Software, vol. 81, no. 5, pp. 649–660, 2008.

H. Zhang, X. Zhang, and M. Gu, “Predicting defective software components from code complexity measures,” in 13th Pacific Rim International Symposium on Dependable Computing (PRDC 2007), pp. 93–96, 2007.

R. Malhotra, “A systematic review of machine learning techniques for software fault prediction,” Applied Soft Computing, vol. 27, pp. 504–518, 2015.

T. Menzies and A. Marcus, “Automated severity assessment of software defect reports,” in IEEE International Conference on Software Maintenance, pp. 346–355, 2008.

D. Opitz and R. Maclin, “Popular ensemble methods: An empirical study,” Journal of artificial intelligence research, vol. 11, pp. 169–198, 1999.

Y. Liu, Y. Wang, and J. Zhang, “New machine learning algorithm: Random forest,” in Information Computing and Applications: Third International Conference, ICICA 2012, Chengde, China, September 14-16, 2012. Proceedings 3, pp. 246–252, 2012.

J. H. Friedman, “Greedy function approximation: a gradient boosting machine,” Annals of statistics, pp. 1189–1232, 2001.

S. Kim, T. Zimmermann, K. Pan, and J. E. Whitehead Jr., “Automatic identification of bug-introducing changes,” in 21st IEEE/ACM international conference on automated software engineering (ASE'06), pp. 81–90, 2006.

Y. Freund and R. E. Schapire, “A decision-theoretic generalization of on-line learning and an application to boosting,” Journal of computer and system sciences, vol. 55, no. 1, pp. 119–139, 1997.

M. White, C. Vendome, M. Linares-V{'a}squez, and D. Poshyvanyk, “Toward deep learning software repositories,” in IEEE/ACM 12th Working Conference on Mining Software Repositories, pp. 334–345, 2015.

Q. Le and T. Mikolov, “Distributed representations of sentences and documents,” in International conference on machine learning, pp. 1188–1196, 2014.

Downloads

Published

2024-09-30

How to Cite

Ali, W., soomro, S. siraj, Lakho, S., Bhatti, N. N., & Memon, I. A. (2024). Adaptive Bug Localization Framework for Precision-Driven Bug Localization in Software Engineering. VFAST Transactions on Software Engineering, 12(3), 230–242. https://doi.org/10.21015/vtse.v12i3.1832