GlycoSiteMiner FAQ
The frequently asked questions are a collection of user questions related to the GlyGen GlycoSiteMiner tool.
What is GlycoSiteMiner?
GlycoSiteMiner (GSM) is an automated literature-mining pipeline designed to extract experimentally verified, protein sequence–specific glycosylation sites from PubMed abstracts. Using advanced ML/AI algorithms, it filters out false positives and ensures data accuracy. Applied to over 33 million PubMed abstracts, GlycoSiteMiner has uncovered 1,118 new sequence-specific glycosylation sites that were not previously available in the GlyGen resource.
How to identify glycosylation sites predicted from GSM on GlyGen?
You can check if a protein has predicted glycosylation sites by visiting its protein details page and selecting the Glycosylation tab from the left-hand menu. This will take you to the Glycosylation section, where several sub-tabs are available. Predicted site information is located in the Text Mining tab, which lists all predicted glycosylation sites along with details about the tool used to generate each prediction.