AOPs-XGBoost: Machine learning Model for the prediction of Antioxidant Proteins properties of peptides

Authors

  • Sikander Rahu School of Computer Science and Technology, Xidian University, Xi’an 710071, China
  • Ali Ghulam Information Technology Centre, Sindh Agriculture University, Sindh, Pakistan
  • Zar Nawab Khan Swati Department Of Computer Sciences, Karakoram International University Gilgit-Baltistan, Pakistan
  • Jawad Usman Arshed Department of computer science, University of Baltistan Skardu, Pakistan
  • Muhammad Shahid Malik Department Of Computer Sciences, Karakoram International University Gilgit-Baltistan, Pakistan
  • Nauman khan COMSATS University, Abbottabad, Pakistan

DOI:

https://doi.org/10.21015/vtse.v10i4.1318

Abstract

Abstract Antioxidant proteins are essential for protecting cells from free radicals. The accurate identification of antioxidant proteins via biological tests is difficult because of the high time and financial investment required. The potential of peptides produced from natural proteins is demonstrated by the fact that they are generally regarded as secure and may have additional advantageous bioactivities. Antioxidative peptides are typically discovered by analyzing numerous peptides created when a variety of proteases hydrolysis proteins. The eXtreme Gradient Boosting (XGBoost) technique was used to create a novel model for the current study, which was then compared to the most popular machine learning models. We suggested a machine-learning model that we named AOPs-XGBoost, built on sequence features and Extreme Gradient Boosting (XGBoost). We used 10-fold cross-validation testing was performed on a testing dataset using the propose. AOPs-XGBoost classifier, and the results showed a sensitivity of 67.56%, specificity of 93.87%, average accuracy of 80.72%, mean cross-validation (MCC) of 66.29%), and area under the receiver operating characteristic curve (AUC) of 88.01%. The outcomes demonstrated that the XGBoost model outperformed the other models with accuracy of 80.72% and area under the receiver operating characteristic curve of 88.01% which were better than the other models. Experimental results demonstrate that AOPs-XGBoost is a useful classifier that advances the study of antioxidant proteins.

References

S. B. Nimse and D. Pal, “Free radicals, natural antioxidants, and their reaction mechanisms,” RSC Adv., vol. 5, no. 35, pp. 27986–28006, 2015. DOI: https://doi.org/10.1039/C4RA13315C

V. Lobo, A. Patil, A. Phatak, and N. Chandra, “Free radicals, antioxidants and functional foods: Impact onhuman health,” Pharmacogn. Rev, vol. 4, 2010.

M. Valko, D. Leibfritz, J. Moncol, M. T. Cronin, M. Mazur, and J. Telser, “Free radicals and antioxidants in normalphysiological functions and human disease,” Int. J. Biochem. Cell Biol, vol. 39, pp. 44–84, 2007.

M. Valko, D. Leibfritz, J. Moncol, M. T. D. Cronin, M. Mazur, and J. Telser, “Free radicals and antioxidants in normal physiological functions and human disease,” Int. J. Biochem. Cell Biol., vol. 39, no. 1, pp. 44–84, 2007. DOI: https://doi.org/10.1016/j.biocel.2006.07.001

S. Sen, R. Chakraborty, C. Sridhar, Y. Reddy, and B. De, “Free radicals, antioxidants,diseases and phytomedicines: current status and future prospect,” Int J Pharm Sci Rev Res, vol. 3, pp. 91–100, 2010.

J. K. Miller, E. Brzezinska-Slebodzinska, and F. C. Madsen, “Oxidative stress, antioxidants,and animal function,” J Dairy Sci, vol. 76, pp. 2812–2823, 1993. DOI: https://doi.org/10.3168/jds.S0022-0302(93)77620-1

N. F. Santos-Sánchez, Institute of Agroindustries, Technological University of the Mixteca, Mexico, R. Salas-Coronado, R. Valadez-Blanco, B. Hernández-Carlos, and P. C. Guadarrama-Mendoza, “Natural antioxidant extracts as food preservatives [pdf],” Acta Sci. Pol. Technol. Aliment., vol. 16, no. 4, pp. 361–370, 2017.

A. H. Butt, N. Rasool, and Y. D. Khan, “Prediction of antioxidant proteins by incorporating statistical moments based features into Chou’s PseAAC,” J. Theor. Biol., vol. 473, pp. 1–8, 2019.

V. Lobo, A. Patil, A. Phatak, and N. Chandra, “Free radicals, antioxidants and functional foods: Impact on human health,” Pharmacogn. Rev., vol. 4, no. 8, pp. 118–126, 2010. DOI: https://doi.org/10.4103/0973-7847.70902

S. Reuter, S. C. Gupta, M. M. Chaturvedi, and B. B. Aggarwal, “Oxidative stress, inflammation, and cancer: How are they linked? Free Radic,” Biol. Med, vol. 49, pp. 1603–1616, 2010. DOI: https://doi.org/10.1016/j.freeradbiomed.2010.09.006

Y. Gilgun-Sherki, E. Melamed, and D. Offen, The role of oxidative stress in thepathogenesis of multiple sclerosis: The need for effectiveantioxidant therapy. .

Y. El Masri et al., “Role of oxidative stress in cardiac remodeling induced by the HFD diet and cigarette smoke exposure,” Arch. Cardiovasc. Dis. Suppl., vol. 14, no. 2, pp. 190–191, 2022.

P. Feng and L. Feng, “Recent advances on antioxidant identification based on machine learning methods,” Curr. Drug Metab., vol. 21, no. 10, pp. 804–809, 2020. DOI: https://doi.org/10.2174/1389200221666200719001449

P.-M. Feng, H. Lin, and W. Chen, “Identification of antioxidants from sequence information is using naïve Bayes,” Comput. Math. Methods Med., vol. 2013, p. 567529, 2013. DOI: https://doi.org/10.1155/2013/567529

P. Feng, W. Chen, and H. Lin, “Identifying antioxidant proteins by using optimal dipeptide compositions,” Interdiscip. Sci., vol. 8, no. 2, pp. 186–191, 2016. DOI: https://doi.org/10.1007/s12539-015-0124-9

R. Sikander, A. Ghulam, and F. Ali, “XGB-DrugPred: computational prediction of druggable proteins using eXtreme gradient boosting and optimized features set,” Sci. Rep., vol. 12, no. 1, p. 5505, 2022.

S. Rahu et al., “UBI-XGB: Identification of Ubiquitin Proteins Using Machine Learning Model,” Journal of Mountain Area Research, vol. 8, pp. 14–26, 2022.

G. Wang and R. L. Dunbrack Jr, “PISCES: a protein sequence culling server,” Bioinformatics, vol. 19, no. 12, pp. 1589–1591, 2003. DOI: https://doi.org/10.1093/bioinformatics/btg224

E. Fernández-Blanco, V. Aguiar-Pulido, C. R. Munteanu, and J. Dorado, “Random Forest classification based on star graph topological indices for antioxidant proteins,” J. Theor. Biol., vol. 317, pp. 331–337, 2013. DOI: https://doi.org/10.1016/j.jtbi.2012.10.006

L. Fu, B. Niu, Z. Zhu, S. Wu, and W. Li, “CD-HIT: accelerated for clustering the next-generation sequencing data,” Bioinformatics, vol. 28, no. 23, pp. 3150–3152, 2012. DOI: https://doi.org/10.1093/bioinformatics/bts565

M. Wang et al., “DeepMal: Accurate prediction of protein malonylation sites by deep neural networks,” Chemometr. Intell. Lab. Syst., vol. 207, no. 104175, p. 104175, 2020.

“Correction to: Prediction of transcript structure and concentration using RNA-Seq data,” Brief. Bioinform., 2023.

X.-G. Chen, W. Zhang, X. Yang, C. Li, and H. Chen, “ACP-DA: Improving the prediction of anticancer peptides using Data Augmentation,” Front. Genet., vol. 12, p. 698477, 2021.

“Classification of large-scale datasets of Landsat-8 satellite image based on LIBLINEAR library,” ajest, 2022.

D. Chakraborty and H. Elzarka, “Advanced machine learning techniques for building performance simulation: a comparative analysis,” J. Build. Perform. Simul., vol. 12, no. 2, pp. 193–207, 2019.

L. Zou, C. Nan, and F. Hu, “Accurate prediction of bacterial type IV secreted effectors using amino acid composition and PSSM profiles,” Bioinformatics, vol. 29, no. 24, pp. 3135–3142, 2013. DOI: https://doi.org/10.1093/bioinformatics/btt554

A. Ghulam et al., “Identifying Molecular Functions of Dynein Motor Proteins Using Extreme Gradient Boosting Algorithm with Machine Learning,” Journal of Mountain Area Research, vol. 8, pp. 1–13, 2022.

Downloads

Published

2022-12-31

How to Cite

Rahu, S., Ghulam, A., Khan Swati, Z. N., Arshed, J. U., Malik, M. S., & khan, N. (2022). AOPs-XGBoost: Machine learning Model for the prediction of Antioxidant Proteins properties of peptides. VAWKUM Transactions on Computer Sciences, 10(2), 73–82. https://doi.org/10.21015/vtse.v10i4.1318