Protein details/Names

From GlyGen Wiki
Jump to navigation Jump to search

The Names section of the Protein details page in GlyGen provides information about the protein and gene name(s) and synonym(s).

Names

Screenshot of the Names section on the Protein details page in GlyGen.

This section provides the following information:

  • Gene Name (Recommended): Official gene name/symbol assigned to the gene. Human gene names/symbols are approved by the HUGO Gene Nomenclature Committee (HGNC), mouse gene names/symbols are approved by the Mouse Genome Informatics (MGI), whereas rat gene names and symbols are approved by Rat Genome Nomenclature Committee (RGNC). Eg. HGF
  • Gene Name (Synonyms): Synonymous names for the given gene name, assigned by UniProtKB and RefSeq. Eg. HPTA; DFNB39; F-TCF; HGFB; HPTA; SF; HGF
  • Protein Name (Recommended): Protein name assigned by UnitProtKB which follows international protein nomenclature guidelines. Eg. Hepatocyte growth factor
  • Protein Name (Synonyms): Synonymous names for the given protein name, assigned by UniProtKB and NCBI RefSeq name with official nomenclature, when available. Eg. Hepatocyte growth factor isoform 6

Source of information

The Names data is collected and integrated from UniProtKB and NCBI RefSeq databases.

  • UniProtKB - alternative names taken from UniProtKB protein accessions
  • NCBI RefSeq - primary and other alternative/synonym protein names taken from the NCBI RefSeq database

Data access

The collected data is processed and stored at data.glygen.org in the following datasets:

Homo Sapiens (Human) Datasets

Hepatitis C Virus Datasets

  • HCV1a Protein Recommended Names (UniProtKB; GLY_000357)
  • HCV1a Protein Alternative Name (UniProtKB; GLY_000359)
  • HCV1b Protein Recommended Names (UniProtKB; GLY_000358)
  • HCV1b Protein Alternative Name (UniProtKB; GLY_000360)

SARS Coronavirus Datasets

  • SARS-CoV1 Protein Alternative Name (UniProtKB; GLY_000413)
  • SARS-CoV1 Gene Symbols (UniProtKB; GLY_000425)
  • SARS-CoV1 Protein Names (NCBI RefSeq; GLY_000437)
  • SARS-CoV1 Protein Recommended Names (UniProtKB; GLY_000438)
  • SARS-CoV2 Protein Names (NCBI RefSeq; GLY_000613)

Mus musculus (Mouse) Datasets

Rattus norvegicus (Rat) Datasets

Data harmonization

Data filtering