References
Bagdonaite, I. et al. Glycoproteomics. Nat. Rev. Methods Primers 2, 48 (2022).
Bern, M., Kil, Y. J. & Becker, C. Byonic: advanced peptide and protein identification software. Curr. Protoc. Bioinform. 40, 13 (2012).
Zeng, W.-F., Cao, W.-Q., Liu, M.-Q., He, S.-M. & Yang, P.-Y. Precise, fast and comprehensive analysis of intact glycopeptides and modified glycans with pGlyco3. Nat. Methods 18, 1515–1523 (2021).
Polasky, D. A., Yu, F., Teo, G. C. & Nesvizhskii, A. I. Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat. Methods 17, 1125–1132 (2020).
Lu, L., Riley, N. M., Shortreed, M. R., Bertozzi, C. R. & Smith, L. M. O-pair search with MetaMorpheus for O-glycopeptide characterization. Nat. Methods 17, 1133–1138 (2020).
Fang, Z. et al. Glyco-Decipher enables glycan database-independent peptide matching and in-depth characterization of site-specific N-glycosylation. Nat. Commun. 13, 1900 (2022).
Xiao, K. & Tian, Z. GPSeeker enables quantitative structural N-glycoproteomics for site- and structure-specific characterization of differentially expressed N-glycosylation in hepatocellular carcinoma. J. Proteome Res. 18, 2885–2895 (2019).
Sun, W. et al. Glycopeptide database search and de novo sequencing with PEAKS GlycanFinder enable highly sensitive glycoproteomics. Nat. Commun. 14, 4046 (2023).
Shen, J. et al. StrucGP: de novo structural sequencing of site-specific N-glycan on glycoproteins using a modularization strategy. Nat. Methods 18, 921–929 (2021).
Toghi Eshghi, S., Shah, P., Yang, W., Li, X. & Zhang, H. GPQuest: a spectral library matching algorithm for site-specific assignment of tandem mass spectra to intact N-glycopeptides. Anal. Chem. 87, 5181–5188 (2015).
Li, S., Zhu, J., Lubman, D. M., Zhou, H. & Tang, H. GlycoSLASH: concurrent glycopeptide identification from multiple related LC-MS/MS data sets by using spectral clustering and library searching. J. Proteome Res. 22, 1501–1509 (2023).
Ye, Z., Mao, Y., Clausen, H. & Vakhrushev, S. Y. Glyco-DIA: a method for quantitative O-glycoproteomics with in silico-boosted glycopeptide libraries. Nat. Methods 16, 902–910 (2019).
Yang, Y. et al. GproDIA enables data-independent acquisition glycoproteomics with comprehensive statistical control. Nat. Commun. 12, 6073 (2021).
Searle, B. C. et al. Generating high quality libraries for DIA MS with empirically corrected peptide predictions. Nat. Commun. 11, 1548 (2020).
Yang, Y. et al. In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics. Nat. Commun. 11, 146 (2020).
Demichev, V. et al. dia-PASEF data analysis using FragPipe and DIA-NN for deep proteomics of low sample amounts. Nat. Commun. 13, 3944 (2022).
Yu, Y. & Li, M. Towards highly sensitive deep learning-based end-to-end database search for tandem mass spectrometry. Nat. Mach. Intell. 7, 85–95 (2025).
Bouwmeester, R., Gabriels, R., Hulstaert, N., Martens, L. & Degroeve, S. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat. Methods 18, 1363–1369 (2021).
Zeng, W.-F. et al. AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics. Nat. Commun. 13, 7238 (2022).
Tiwary, S. et al. High-quality MS/MS spectrum prediction for data-dependent and data-independent acquisition data analysis. Nat. Methods 16, 519–525 (2019).
Tarn, C. & Zeng, W. F. pDeep3: toward more accurate spectrum prediction with fast few-shot learning. Anal. Chem. 93, 5815–5822 (2021).
Gessulat, S. et al. Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat. Methods 16, 509–518 (2019).
Zong, Y. et al. DeepFLR facilitates false localization rate control in phosphoproteomics. Nat. Commun. 14, 2269 (2023).
Li, K., Jain, A., Malovannaya, A., Wen, B. & Zhang, B. DeepRescore: leveraging deep learning to improve peptide identification in immunopeptidomics. Proteomics 20, e1900334 (2020).
Yang, K. L. et al. MSBooster: improving peptide identification rates using deep learning-based features. Nat. Commun. 14, 4539 (2023).
Zhou, W.-J., Wei, Z.-H., He, S.-M. & Chi, H. pValid 2: a deep learning based validation method for peptide identification in shotgun proteomics with increased discriminating power. J. Proteom. 251, 104414 (2022).
Wilhelm, M. et al. Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics. Nat. Commun. 12, 3346 (2021).
Ma, C. et al. Improved peptide retention time prediction in liquid chromatography through deep learning. Anal. Chem. 90, 10881–10888 (2018).
Cox, J. Prediction of peptide mass spectral libraries with machine learning. Nat. Biotechnol. 41, 33–43 (2023).
Zong, Y., Wang, Y., Qiu, X., Huang, X. & Qiao, L. Deep learning prediction of glycopeptide tandem mass spectra powers glycoproteomics. Nat. Mach. Intell. 6, 950–961 (2024).
Yang, Y. & Fang, Q. Prediction of glycopeptide fragment mass spectra by deep learning. Nat. Commun. 15, 2448 (2024).
Vaswani, A. et al. Attention is all you need. In Proc. Advances in Neural Information Processing Systems Vol. 30 (eds Guyon, I. et al.) (Curran Associates, 2017).
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M. & Monfardini, G. The graph neural network model. IEEE Trans. Neural Netw. 20, 61–80 (2009).
Tai, K. S., Socher, R. & Manning, C. D. Improved semantic representations from tree-structured long short-term memory networks. Preprint at https://arxiv.org/abs/1503.00075 (2015).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Zhang, S. Spectrum and Retention Time Prediction for N-Glycopeptides Using Deep Learning. MSc thesis, Univ. Waterloo (2023).
Klein, J., Carvalho, L. & Zaia, J. Expanding N-glycopeptide identifications by modeling fragmentation, elution, and glycome connectivity. Nat. Commun. 15, 6168 (2024).
Zhang, Z. & Shah, B. Prediction of collision-induced dissociation spectra of common N-glycopeptides for glycoform identification. Anal. Chem. 82, 10194–10202 (2010).
Ying, C. et al. Do transformers really perform badly for graph representation? In Proc. Advances in Neural Information Processing Systems Vol. 34 (eds Ranzato, M. et al.) 28877–28888 (Curran Associates, 2021).
Young, A., Röst, H. & Wang, B. Tandem mass spectrum prediction for small molecules using graph transformers. Nat. Mach. Intell. 6, 404–416 (2024).
Dang, L. et al. Recognition of bisecting N-glycans on intact glycopeptides by two characteristic ions in tandem mass spectra. Anal. Chem. 91, 5478–5482 (2019).
Liu, M.-Q. et al. pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification. Nat. Commun. 8, 438 (2017).
Chen, Z. et al. Recognition of core-fucosylated glycopeptides based on the Y1+Fuc/Y1 ratio in low-energy HCD spectra. Anal. Chem. 94, 17349–17353 (2022).
Xin, M. et al. Precision glycoproteomics reveals distinctive N-glycosylation in human spermatozoa. Mol. Cell. Proteomics 21, 100214 (2022).
Xin, M. et al. Precision structural interpretation of site-specific N-glycans in seminal plasma. J. Proteome Res. 21, 1664–1674 (2022).
Abramson, J. et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).
Bekker-Jensen, D. B. et al. An optimized shotgun strategy for the rapid generation of comprehensive human proteomes. Cell Syst. 4, 587–599 (2017).
Shen, J. & Sun, S. StrucGP: a software for structural interpretation of N-glycans on intact glycopeptides using tandem mass spectrometry data. Zenodo https://doi.org/10.5281/zenodo.4925441 (2021).
The, M., MacCoss, M. J., Noble, W. S. & Käll, L. Fast and accurate protein false discovery rates on large-scale proteomics data sets with Percolator 3.0. J. Am. Soc. Mass Spectrom. 27, 1719–1727 (2016).
Chen, T. et al. iProX in 2021: connecting proteomics data sharing with big data. Nucleic Acids Res. 50, D1522–D1527 (2022).
Wang, X., Song, R., Feng, Z. & Sun, S. SpecGP as a transformer-based model for predicting energy-adaptable structural spectra of glycopeptides. Code Ocean https://doi.org/10.24433/CO.3765457.v1 (2026).
Wang, X., Song, R., Feng, Z. & Sun, S. SpecGP as a transformer-based model for predicting energy-adaptable structural spectra of glycopeptides. Zenodo https://doi.org/10.5281/zenodo.19388598 (2026).
