Protein details/Glycosylation: Difference between revisions

From GlyGen Wiki
Jump to navigation Jump to search
No edit summary
Line 33: Line 33:
The collected data is processed and stored in '''[https://data.glygen.org data.glygen.org]''' in following datasets.
The collected data is processed and stored in '''[https://data.glygen.org data.glygen.org]''' in following datasets.


*Human Glycosylation Sites (UniProtKB)
Homo Sapiens Datasets
*Mouse Glycosylation Sites (UniProtKB)
 
*Glycosylation Sites - UniCarbKB [Human proteins]
Hepatitis C Virus Datasets
*Glycosylation sites - UniCarbKB [Mouse proteins]
 
*Human Glycosylation Sites (RCSB PDB)
SARS Coronavirus Datasets
*Mouse Glycosylation Sites (RCSB PDB)
 
*Glycosylation sites - UniCarbKB [Rat proteins]
<br />
*Rat Glycosylation Sites (UniProtKB)
 
*Rat Glycosylation Sites (RCSB PDB)
*Human Glycosylation Sites (UniProtKB; [https://data.glygen.org/GLY_000038 GLY_000038])
*Human Glycosylation Sites (GlyConnect)
*Mouse Glycosylation Sites (UniProtKB; [https://data.glygen.org/GLY_000039 GLY_000039])
*Mouse Glycosylation Sites (GlyConnect)
*Glycosylation Sites (UniCarbKB [Human proteins]; [https://data.glygen.org/GLY_000040 GLY_000040])
*Rat Glycosylation Sites (GlyConnect)
*Glycosylation sites (UniCarbKB [Mouse proteins]; [https://data.glygen.org/GLY_000041 GLY_000041])
*HCV1a Glycosylation Sites (Literature + UniCarbKB)
*Human Glycosylation Sites (RCSB PDB; [https://data.glygen.org/GLY_000042 GLY_000042])
*HCV1a Glycosylation Sites (UniProtKB)
*Mouse Glycosylation Sites (RCSB PDB; [https://data.glygen.org/GLY_000043 GLY_000043])
*HCV1b Glycosylation Sites (UniProtKB)
*Glycosylation sites (UniCarbKB [Rat proteins]; [https://data.glygen.org/GLY_000221 GLY_000221])
*SARS-CoV2 Glycosylation Sites (UniProtKB)
*Rat Glycosylation Sites (UniProtKB; [https://data.glygen.org/GLY_000224 GLY_000224])
*Glycosylation Sites - UniCarbKB [SARS CoV 2 proteins]
*Rat Glycosylation Sites (RCSB PDB; [https://data.glygen.org/GLY_000226 GLY_000226])
*Human Glycosylation Sites [GPTwiki]
*Human Glycosylation Sites (GlyConnect; [https://data.glygen.org/GLY_000329 GLY_000329])
*Human Glycosylation Sites [Automatic Literature Mining] [Automatically verified]
*Mouse Glycosylation Sites (GlyConnect; [https://data.glygen.org/GLY_000330 GLY_000330])
*SARS-CoV1 Glycosylation Sites (UniProtKB)
*Rat Glycosylation Sites (GlyConnect; [https://data.glygen.org/GLY_000331 GLY_000331])
*Human O-GlcNAc Glycosylation Sites (MCW)
*HCV1a Glycosylation Sites (Literature + UniCarbKB; [https://data.glygen.org/GLY_000335 GLY_000335])
*SARS-CoV2 Glycosylation sites
*HCV1a Glycosylation Sites (UniProtKB; [https://data.glygen.org/GLY_000382 GLY_000382])
*Human Glycosylation Sites UniCarbKB Glycomics Study
*HCV1b Glycosylation Sites (UniProtKB; [https://data.glygen.org/GLY_000383 GLY_000383])
*SARS-CoV1 Glycosylation Sites (Literature)
*SARS-CoV2 Glycosylation Sites (UniProtKB; [https://data.glygen.org/GLY_000473 GLY_000473])
*Glycosylation Sites (UniCarbKB [SARS CoV 2 proteins]; [https://data.glygen.org/GLY_000479 GLY_000479])
*Human Glycosylation Sites ([GPTwiki]; [https://data.glygen.org/GLY_000480 GLY_000480])
*Human Glycosylation Sites ([Automatic Literature Mining] [Automatically verified]; [https://data.glygen.org/GLY_000481 GLY_000481])
*SARS-CoV1 Glycosylation Sites (UniProtKB; [https://data.glygen.org/GLY_000495 GLY_000495])
*Human O-GlcNAc Glycosylation Sites (MCW; [https://data.glygen.org/GLY_000517 GLY_000517])
*SARS-CoV2 Glycosylation sites (UniprotKB; [https://data.glygen.org/GLY_000473 GLY_000473])
*Human Glycosylation Sites UniCarbKB Glycomics Study ([https://data.glygen.org/GLY_000611 GLY_000611])
*SARS-CoV1 Glycosylation Sites (Literature; [https://data.glygen.org/GLY_000612 GLY_000612])


==Data harmonization==
==Data harmonization==

Revision as of 21:12, 11 November 2021

The glycosylation section of the Protein information page in GlyGen provides the detailed list of glycosylation sites, reported glycans attached to the protein and predicted sites.

Glycosylation

Screenshot of the Glycosylation section on the Protein information page in GlyGen.

Glycosylation is presented in 4 tabs on the Protein information page in GlyGen. The Glycosylation summary provides the overall information about the section like the total number of sites (O,N,S,C linked sites), total number of N linked and O-linked sites with the total number of glycan annotations (structures) observed on the protein. Eg. 18 site(s) total, 169 N-linked annotation(s) at 17 site(s), 1 O-linked annotation(s) at 1 site(s)

Reported Sites with Glycan

This tab shows a table with the following columns:

  • Source - GlyGen evidence linking to the databases and papers that provided the glycosylation information
  • Type - Type of glycosylation. Eg. N-linked, O-linked
  • GlyTouCan ID - Unique accession assigned to the registered glycan structure in GlyTouCan database. Eg. G01543ZX
  • Glycan Image - Image of the glycan in SNFG format
  • Residue - Amino acid residue of the given protein along with its position.
  • Note - Additional information about the entry such as curation notes, O-glycosylation subtype, remarks, etc.

Reported Sites

Source of information

The Glycosylation data is collected and integrated from the resources such as UniProtKB, Glyconnect, UniCarbKB, RCSB PDB, The O-GlcNAc Database.

  • UniProtKB - only reported sites information and predicted information is downloaded from UniProtKB
  • Glyconnect - reported sites with glycan information on known and unknown residues is downloaded from Glyconnect
  • UniCarbKB - reported sites with glycan information on known and unknown residues is downloaded from UniCarbKB
  • RCSB PDB - only reported sites information is downloaded from RCSB PDB
  • The O-GlcNAc Database - Only O-GlcNAcylation reported sites with glycan information on known and unknown residues is downloaded from The O-GlcNAc database

Data access

The collected data is processed and stored in data.glygen.org in following datasets.

Homo Sapiens Datasets

Hepatitis C Virus Datasets

SARS Coronavirus Datasets


  • Human Glycosylation Sites (UniProtKB; GLY_000038)
  • Mouse Glycosylation Sites (UniProtKB; GLY_000039)
  • Glycosylation Sites (UniCarbKB [Human proteins]; GLY_000040)
  • Glycosylation sites (UniCarbKB [Mouse proteins]; GLY_000041)
  • Human Glycosylation Sites (RCSB PDB; GLY_000042)
  • Mouse Glycosylation Sites (RCSB PDB; GLY_000043)
  • Glycosylation sites (UniCarbKB [Rat proteins]; GLY_000221)
  • Rat Glycosylation Sites (UniProtKB; GLY_000224)
  • Rat Glycosylation Sites (RCSB PDB; GLY_000226)
  • Human Glycosylation Sites (GlyConnect; GLY_000329)
  • Mouse Glycosylation Sites (GlyConnect; GLY_000330)
  • Rat Glycosylation Sites (GlyConnect; GLY_000331)
  • HCV1a Glycosylation Sites (Literature + UniCarbKB; GLY_000335)
  • HCV1a Glycosylation Sites (UniProtKB; GLY_000382)
  • HCV1b Glycosylation Sites (UniProtKB; GLY_000383)
  • SARS-CoV2 Glycosylation Sites (UniProtKB; GLY_000473)
  • Glycosylation Sites (UniCarbKB [SARS CoV 2 proteins]; GLY_000479)
  • Human Glycosylation Sites ([GPTwiki]; GLY_000480)
  • Human Glycosylation Sites ([Automatic Literature Mining] [Automatically verified]; GLY_000481)
  • SARS-CoV1 Glycosylation Sites (UniProtKB; GLY_000495)
  • Human O-GlcNAc Glycosylation Sites (MCW; GLY_000517)
  • SARS-CoV2 Glycosylation sites (UniprotKB; GLY_000473)
  • Human Glycosylation Sites UniCarbKB Glycomics Study (GLY_000611)
  • SARS-CoV1 Glycosylation Sites (Literature; GLY_000612)

Data harmonization

Data filtering