Protein details: Difference between revisions

From GlyGen Wiki
Jump to navigation Jump to search
No edit summary
Line 32: Line 32:


===Mutagenesis===
===Mutagenesis===
{{Main|Protein details/Mutagenesis|l1 = Mutagenesis}}
{{Main|Protein details/Mutagenesis|l1 = Mutagenesis}}The Mutagenesis section of the Protein Details page in GlyGen provides a detailed list of variation start and end positions showing the reference and altered base including site annotations from corresponding publications. This information varies from resources in GlyGen such as The Orthologs Data Base (OrthoDB) which is a comprehensive catalog and resource of orthologs, ie descendants from a single gene of the last common ancestor of a specific phylogeny radiation, specific sites from integrated resource for protein post-translational modification network (iPTMnet). {{Expand section|small=no}}
{{Expand section|small=no}}


===GO Annotations===
===GO Annotations===

Revision as of 15:31, 27 May 2022

In the GlyGen portal the collected information of a protein is presented in the protein information or protein details page. This webpage is subdivided into sections presenting different pieces of information, such as glycosylation data, protein function, protein expression, references to other databases for a protein and publications about this protein.

Information sections

The collected information is presented in different section using mainly table representation or text. However there are also interactive sections that can be manipulated by users.

General

The General section of the Protein Details page contains identifying information about the protein from reference databases and sources. The Gene Name and Gene Location describe the gene that encodes the protein and the location with a link to the chromosome visualizer in Ensembl Gene. Additional fields retrieved from UniProtKB and NCBI RefSeq include UniProtKB ID, UniProtKB Accession, Protein Length, UniProtKB Entry Name, Chemical Mass, RefSeq Accession, RefSeq Name and Organism.

Glycosylation

Glycosylation is presented in GlyGen portal in tables in four different tabs of the Glycosylation section. The first tab "Reported Sites with Glycan" shows a list of glycosylation sites with glycan structures. These glycans can either be defined glycan structures, structures with missing information or compositions. The second tab "Reported Sites" list all glycosylation sites that have been extracted from other databases. These sites have been reported to be glycosylated but the glycan structure has not been identified or reported. The third tab "Predicted Only" shows the sites that have been predicted using different tools. These sites as well do not have glycan structures. The last tab "Text Mining" shows data derived from text mining by automatically extracting site information from PubMed abstracts.

Phosphorylation

The Phosphorylation section of the Protein Details page provides a list of phosphorylation sites that have been extracted from other databases. The Kinase Protein and Kinase Gene columns list the enzymes responsible for phosphorylation, if available. Annotations for experimentally determined phosphorylation sites are retrieved from UniProtKB.

Glycation

The Glycation section of the Protein Details page in GlyGen provides a detailed list of sites with non-enzymatic, covalently linked glucose residues. This section contains information about the type of attachment and the site of glycation. Annotations for experimentally determined glycation sites are retrieved from UniProtKB.

Names

The Names section of the Protein Details page in GlyGen lists the recommended full name of the gene and protein from the UniProtKB database. All other names that are used to represent the gene or protein are listed as synonyms.

Function

The Function section of the Protein Details page in GlyGen lists information about the biological function of the protein. This information is retrieved from the UniProtKB database and from the Gene Summary and GeneRIF sections of the NCBI RefSeq database.

Sequence

Single Nucleotide Variation

Single nucleotide variation is presented in GlyGen portal in tables in two different tabs of the Single Nucleotide Variation section. These are most commonly nonsynonymous mutations which cause a different amino acid to be produced at a given position. The first tab "Disease associated Mutations" shows a list of mutations in the genetic sequence that result in a disease. The second tab "Non-disease associated Mutations" lists all mutations in the genetic sequence that are not associated with a disease. This information is retrieved from the BioMuta database and from EBI-EMBL-UniProtKB.

Mutagenesis

The Mutagenesis section of the Protein Details page in GlyGen provides a detailed list of variation start and end positions showing the reference and altered base including site annotations from corresponding publications. This information varies from resources in GlyGen such as The Orthologs Data Base (OrthoDB) which is a comprehensive catalog and resource of orthologs, ie descendants from a single gene of the last common ancestor of a specific phylogeny radiation, specific sites from integrated resource for protein post-translational modification network (iPTMnet).

GO Annotations

Glycan Ligands

PTM Annotation

Proteoform Annotation

Pathway

Synthesized Glycans

Isoforms

Homologs

Disease

Expression Tissue

Expression Disease

Cross References

History

Publications

URL pattern

Programmatic access

Download options