GlycoMinestruct: a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
- PMID: 27708373
- PMCID: PMC5052564
- DOI: 10.1038/srep34595
GlycoMinestruct: a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
Abstract
Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural features to glycosylation prediction. Here, we propose a novel bioinformatics method called GlycoMinestruct(http://glycomine.erc.monash.edu/Lab/GlycoMine_Struct/) for improved prediction of human N- and O-linked glycosylation sites by combining sequence and structural features in an integrated computational framework with a two-step feature-selection strategy. Experiments indicated that GlycoMinestruct outperformed NGlycPred, the only predictor that incorporated both sequence and structure features, achieving AUC values of 0.941 and 0.922 for N- and O-linked glycosylation, respectively, on an independent test dataset. We applied GlycoMinestruct to screen the human structural proteome and obtained high-confidence predictions for N- and O-linked glycosylation sites. GlycoMinestruct can be used as a powerful tool to expedite the discovery of glycosylation events and substrates to facilitate hypothesis-driven experimental studies.
Conflict of interest statement
The authors declare no competing financial interests.
Figures
Similar articles
-
GlycoMine: a machine learning-based approach for predicting N-, C- and O-linked glycosylation in the human proteome.Bioinformatics. 2015 May 1;31(9):1411-9. doi: 10.1093/bioinformatics/btu852. Epub 2015 Jan 6. Bioinformatics. 2015. PMID: 25568279
-
Positive-unlabelled learning of glycosylation sites in the human proteome.BMC Bioinformatics. 2019 Mar 6;20(1):112. doi: 10.1186/s12859-019-2700-1. BMC Bioinformatics. 2019. PMID: 30841845 Free PMC article.
-
GlycoDomainViewer: a bioinformatics tool for contextual exploration of glycoproteomes.Glycobiology. 2018 Mar 1;28(3):131-136. doi: 10.1093/glycob/cwx104. Glycobiology. 2018. PMID: 29267884
-
Decoding of O-Linked Glycosylation by Mass Spectrometry.Biochemistry. 2017 Mar 7;56(9):1218-1226. doi: 10.1021/acs.biochem.6b01244. Epub 2017 Feb 27. Biochemistry. 2017. PMID: 28196325 Review.
-
Advances in mass spectrometry driven O-glycoproteomics.Biochim Biophys Acta. 2015 Jan;1850(1):33-42. doi: 10.1016/j.bbagen.2014.09.026. Epub 2014 Oct 2. Biochim Biophys Acta. 2015. PMID: 25284204 Review.
Cited by
-
Broadly neutralizing antibodies for HIV prevention: a comprehensive review and future perspectives.Clin Microbiol Rev. 2024 Jun 13;37(2):e0015222. doi: 10.1128/cmr.00152-22. Epub 2024 Apr 30. Clin Microbiol Rev. 2024. PMID: 38687039 Review.
-
EMNGly: predicting N-linked glycosylation sites using the language models for feature extraction.Bioinformatics. 2023 Nov 1;39(11):btad650. doi: 10.1093/bioinformatics/btad650. Bioinformatics. 2023. PMID: 37930896 Free PMC article.
-
Glycoproteomics Landscape of Asymptomatic and Symptomatic Human Alzheimer's Disease Brain.Mol Cell Proteomics. 2022 Dec;21(12):100433. doi: 10.1016/j.mcpro.2022.100433. Epub 2022 Oct 27. Mol Cell Proteomics. 2022. PMID: 36309312 Free PMC article.
-
A systematic review of recent trends in research on therapeutically significant L-asparaginase and acute lymphoblastic leukemia.Mol Biol Rep. 2022 Dec;49(12):11281-11287. doi: 10.1007/s11033-022-07688-4. Epub 2022 Jul 10. Mol Biol Rep. 2022. PMID: 35816224
-
Functions of Glycosylation and Related Web Resources for Its Prediction.Methods Mol Biol. 2022;2499:135-144. doi: 10.1007/978-1-0716-2317-6_6. Methods Mol Biol. 2022. PMID: 35696078
References
-
- Spiro R. G. Protein glycosylation: nature, distribution, enzymatic formation, and disease implications of glycopeptide bonds. Glycobiology 12, 43R–56R (2002). - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous