Protein Carbonylation Sites Prediction using Biomarkers of Oxidative Stress in Various Human Diseases: A Systematic Literature Review

Adeel Ashraf, Anam Shahzadi, Muhammad Sohaib Akram


Protein carbonylation is a non-enzymatic, irreversible, post translational modification (PTM). Carbonylation basically occurs due to the ROS, these species cause the oxidation of proteins and it will lead towards post translational modification of proteins known as carbonylation. In this ROS induces the carbonyl groups into the side chain of amino acid lysine (K), Proline (P), Arginine (R), Threonine (T). Carbonylation is known as a major hall mark or oxidative stress and leads to various diseases like age and age-related diseases. Different techniques and tools have been presented for detection of protein carbonylation, yet still there is no accurate result. In this systematic literature review I try to provide deep understanding of protein carbonylation sites, various techniques, comparison of tools relative to the techniques and its role in different diseases.

Full Text:



Rao, R. S. P, Møller and I.M, "Pattern of occurrence and occupancy of carbonylation sites in proteins. Proteomics," Proteomics, vol. 11, no. 21, pp. 4166-4173, 2011.

Augustyniak, E, Adam, A, Wojdyla, K, Rogowska-Wrzesinska, A, Willetts, R, Korkmaz, A, Griffiths and H. R, "Validation of protein carbonyl measurement: a multi-centre study," Redox biology, vol. 4, pp. 149-157, 2015.

Fedorova, M, Bollineni, R.C, Hoffmann and R, "Protein carbonylation as a major hallmark of oxidative damage: update of analytical strategieS," Mass spectrometry reviews, vol. 33, no. 2, pp. 79-97, 2014.

Weng, S.L, Huang, K.Y, Kaunang, F.J, Huang, C.H, Kao, H.J, Chang, T.H, Lee and T.Y, "Investigation and identification of protein carbonylation sites based on position-specific amino acid composition and physicochemical features," BMC bioinformatics, vol. 18, no. 3, pp. 125-141, 2017.

Xu, Y, Wang, X, Wang, Y, Tian, Y, Shao, X, Wu, L.Y, Deng and N, "Prediction of posttranslational modification sites from amino acid sequences with kernel methods," Journal of theoretical biology, vol. 344, pp. 78-87, 2014.

Coffey, C.M, Gronert and S, "A cleavable biotin tagging reagent that enables the enrichment and identification of carbonylation sites in proteins," Analytical and bioanalytical chemistry, vol. 408, no. 3, pp. 865-874, 2016.

sJia, J, Liu, Z, Xiao, X, Liu, B, Chou and K.C, "iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC," Oncotarget, vol. 7, no. 23, pp. 34558-34570, 2016.

Rogowska-Wrzesinska, A, Wojdyla, K, Nedić, O, Baron, C, P, Griffiths and H.R, "Analysis of protein carbonylation—pitfalls and promise in commonly used methods," Free radical research, vol. 48, no. 10, pp. 1145-1162, 2014.

Joshi, S, Kumari, S, Rathore and A.S, "Identification and characterization of carbonylation sites in trastuzumab biosimilars," International Journal of Biological Macromolecules, 2020.

Lv, H, Han, J, Liu, J, Zheng, J, Liu, R, Zhong and D, "CarSPred: a computational tool for predicting carbonylation sites of human proteins," PloS one, vol. 9, no. 10, p. e111478, 2014.

Jia, J, Liu, Z, Xiao, X, Liu, B, Chou and K.C, "iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC," Oncotarget, vol. 7, no. 23, pp. 34558-34570, 2017.

Kao, H.J, Weng, S.L, Huang, K.Y, Kaunang, F.J, Hsu, J, B. K, Huang, C.H, Lee and T.Y, "MDD-carb: a combinatorial model for the identification of protein carbonylation sites with substrate motifs," BMC systems biology, vol. 11, no. 7, pp. 127-140, 2017.

Zuo, Y, Jia and C.Z, " CarSite: identifying carbonylated sites of human proteins based on a one-sided selection resampling method," Molecular BioSystems, vol. 13, no. 11, pp. 2362-2369, 2017.

Hasan, M.A.M, Li, J, Ahmad, S, Molla and M.K.I, "predCar-site: Carbonylation sites prediction in proteins using support vector machine with resolving data imbalanced issue," Analytical biochemistry, vol. 525, pp. 107-113, 2017.

Zhang, D, Xu, Z.C, Su, W, Yang, Y.H, Lv, H, Yang, H, Lin and H, "iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features," Bioinformatics, 2020.

Lyu, H, Hao, L, Zheng, J, Liu, C, Liu, Y, Shang and Y, "Predicting Carbonylation Sites of Human Proteins with a New Max-Significance and Min-Redundancy Feature Selection Criterion," In 2018 9th International Conference on Information Technology in Medicine and Education (ITME), pp. 265-272, 2018.

Rao, R.S.P, Zhang, N, Xu, D, Møller and I.M, "CarbonylDB: a curated data-resource of protein carbonylation sites," Bioinformatics, vol. 34, no. 14, pp. 2518-2520, 2018.

Madian, A.G, Regnier and F.E, "Proteomic identification of carbonylated proteins and their oxidation sites," Journal of proteome research, vol. 9, no. 8, pp. 3766-3780, 2010.

Møller, I.M, Rogowska-Wrzesinska, A, Rao and R.S.P, "Protein carbonylation and metal-catalyzed protein oxidation in a cellular perspective," Journal of proteomics, vol. 74, no. 11, pp. 2228-2242, 2011.

Verrastro, I, Pasha, S, Jensen, K.T, Pitt, A.R, Spickett and C.M, "Mass spectrometry-based methods for identifying oxidized proteins in disease: advances and challenges," Biomolecules, vol. 5, no. 2, pp. 378-411, 2015.

Artemenko, K, Mi, J, Bergquist and J, "Mass-spectrometry-based characterization of oxidations in proteins," Free radical research, vol. 49, no. 5, pp. 477-493, 2015.

Santos, A.L, Lindner and A.B, "Protein posttranslational modifications: roles in aging and age-related disease," Oxidative Medicine and Cellular Longevity, vol. 2017, 2017.

Mittal, A, Rath and P.C, "Protein Structure and Function in Aging and Age-Related Diseases. In Models," Molecules and Mechanisms in Biogerontology, pp. 3-26, 2020.

Krisko, A, Radman and M, "Protein damage, ageing and age-related diseases," Open biology, vol. 9, no. 3, p. 180249, 2019.

Cabiscol, E, Tamarit, J, R. and J, "Protein carbonylation: proteomics, specificity and relevance to aging. Mass spectrometry reviews," Mass spectrometry reviews, vol. 33, no. 1, pp. 21-48, 2014.

Langert and L, "PLASMA PROTEIN CARBONYLATION AS BIOMARKER FOR ESTIMATE OF AGING," Doctoral dissertation, University of Split. School of Medicine. Public health, 2018.

Hecker, M, Wagner and A.H, "Role of protein carbonylation in diabetes," Journal of inherited metabolic disease, vol. 41, no. 1, pp. 29-38, 2018.

Saeed, S.; Mahmood, M. K.; Khan, Y. D., An exposition of facial expression recognition techniques. Neural Computing and Applications 2018, 29 (9), 425-443.

Butt, A. H.; Khan, Y. D., CanLect-Pred: A cancer therapeutics tool for prediction of target cancerlectins using experiential annotated proteomic sequences. IEEE Access 2019, 8, 9520-9531.

Amanat, S.; Ashraf, A.; Hussain, W.; Rasool, N.; Khan, Y. D., Identification of lysine carboxylation sites in proteins by integrating statistical moments and position relative features via general PseAAC. Current Bioinformatics 2020, 15 (5), 396-407.

Ilyas, S., Hussain, W., Ashraf, A., Khan, Y. D., Khan, S. A., & Chou, K. C. (2019). iMethylK-PseAAC: Improving accuracy of lysine methylation sites identification by incorporating statistical moments and position relative features into general PseAAC via Chou’s 5-steps rule. Current Genomics, 20(4), 275-292.

Hussain, W.; Rasool, N.; Khan, Y. D., A Sequence-Based Predictor of Zika Virus Proteins Developed by Integration of PseAAC and Statistical Moments. Combinatorial chemistry & high throughput screening 2020, 23 (8), 797-804.

Khan, Y. D.; Alzahrani, E.; Alghamdi, W.; Ullah, M. Z., Sequence-based Identification of Allergen Proteins Developed by Integration of PseAAC and Statistical Moments via 5-Step Rule. Current Bioinformatics 2020, 15 (9), 1046-1055.

Mahmood, M. K.; Ehsan, A.; Khan, Y. D.; Chou, K.-C., iHyd-LysSite (EPSV): Identifying Hydroxylysine Sites in Protein Using Statistical Formulation by Extracting Enhanced Position and Sequence Variant Feature Technique. Current Genomics 2020, 21 (7), 536-545.

Naseer, S.; Hussain, W.; Khan, Y. D.; Rasool, N., IPhosS (Deep)-PseAAC: Identify phosphoserine sites in proteins using deep learning on general pseudo amino acid compositions via modified 5-Steps rule. IEEE/ACM Transactions on Computational Biology and Bioinformatics 2020.

Naseer, S.; Hussain, W.; Khan, Y. D.; Rasool, N., Sequence-based identification of arginine amidation sites in proteins using deep representations of proteins and PseAAC. Current Bioinformatics 2020, 15 (8), 937-948.

Shah, A. A.; Khan, Y. D., Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification. Scientific Reports 2020, 10 (1), 1-10.

Awais, M.; Hussain, W.; Rasool, N.; Khan, Y. D., iTSP-PseAAC: Identifying Tumor Suppressor Proteins by Using Fully Connected Neural Network and PseAAC. Current Bioinformatics 2021, 16 (5), 700-709.

Hussain, W.; Rasool, N.; Khan, Y. D., Insights into Machine Learning-based approaches for Virtual Screening in Drug Discovery: Existing strategies and streamlining through FP-CADD. Current Drug Discovery Technologies 2021, 18 (4), 463-472.

Khan, Y. D.; Khan, N. S.; Naseer, S.; Butt, A. H., iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou’s PseAAC. PeerJ 2021, 9, e11581.

Malebary, S. J.; Khan, R.; Khan, Y. D., ProtoPred: Advancing Oncological Research Through Identification of Proto-Oncogene Proteins. IEEE Access 2021, 9, 68788-68797.

Malebary, S. J.; Khan, Y. D., Evaluating machine learning methodologies for identification of cancer driver genes. Scientific reports 2021, 11 (1), 1-13.

Malebary, S. J.; Khan, Y. D., Identification of Antimicrobial Peptides Using Chou's 5 Step Rule. CMC-COMPUTERS MATERIALS & CONTINUA 2021, 67 (3), 2863-2881.

Naseer, S.; Ali, R. F.; Khan, Y. D.; Dominic, P., iGluK-Deep: computational identification of lysine glutarylation sites using deep neural networks with general pseudo amino acid compositions. Journal of Biomolecular Structure and Dynamics 2021, 1-14.

Naseer, S.; Hussain, W.; Khan, Y. D.; Rasool, N., NPalmitoylDeep-PseAAC: A Predictor of N-Palmitoylation Sites in Proteins Using Deep Representations of Proteins and PseAAC via Modified 5-Steps Rule. Current Bioinformatics 2021, 16 (2), 294-305.

Naseer, S.; Hussain, W.; Khan, Y. D.; Rasool, N., Optimization of serine phosphorylation prediction in proteins by comparing human engineered features and deep representations. Analytical Biochemistry 2021, 615, 114069.

Khanum, S., Ashraf, M. A., Karim, A., Shoaib, B., Khan, M. A., Naqvi, R. A., ... & Alswaitti, M. Gly-LysPred: Identification of Lysine Glycation Sites in Protein Using Position Relative Features and Statistical Moments via Chou’s 5 Step Rule.

Lv, H., Dao, F. Y., Zhang, D., Yang, H., & Lin, H. (2021). Advances in mapping the epigenetic modifications of 5‐methylcytosine (5mC), N6‐methyladenine (6mA), and N4‐methylcytosine (4mC). Biotechnology and Bioengineering.

Zulfiqar, H., Sun, Z. J., Huang, Q. L., Yuan, S. S., Lv, H., Dao, F. Y., ... & Li, Y. W. (2021). Deep-4mCW2V: A sequence-based predictor to identify N4-methylcytosine sites in Escherichia coli. Methods.

Liu, Y., Wang, X., & Liu, B. (2019). A comprehensive review and comparison of existing computational methods for intrinsically disordered protein and region prediction. Briefings in bioinformatics, 20(1), 330-346.

Zhang, D., Xu, Z. C., Su, W., Yang, Y. H., Lv, H., Yang, H., & Lin, H. (2021). iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features. Bioinformatics, 37(2), 171-177.



  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.