Data release notes

From GlyGen Wiki
Revision as of 17:02, 27 July 2020 by Xiying (talk | contribs)
Jump to navigation Jump to search

The GlyGen datasets collection is updataed every 2 weeks.

Version 1.5.36

Main article: Data release notes/1.5.36

Related API version: 1.5.43

This version was released at July 20th, 2020

  1. Added O-GlcNAc data extracted from the literature by Stephanie Olivier’s group (GLYDS000518)
  2. Added germline and somatic variation data that has effect on glycosite (loss of glycosylation site and gain of glycosylation sequon) to the mutation section
  3. Added the literature extracted glycosite for SARS-CoV1 M protein
  4. Added glycosylation subtypes to the *_proteoform_glycosylation_sites_uniprotkb.csv
  5. Added species annotation via subsumption (for human, mouse). Rat and HCV to follow in the next release.
  6. Added updated GlyConnect data. (additional o-GlcNAc sites)
  7. Added glycosylation sites through text mining (first iteration).

Version 1.5.18

Main article: Data release notes/1.5.18

Related API version: 1.5.26

This version was released at April 15th, 2020

  1. Added UniProtKB Gene synonyms (search and details)
  2. Added RefSeq Gene names and synonyms
  3. Added RefSeq Protein synonyms
  4. Added UniProtKB Protein synonyms
  5. Updated the Fasta headers of the protein sequences that now resembles the fasta header of UniProtKB sequences
  6. Updated BioMuta data with addition of comments that shows which filters were passed
  7. Created new datasets:mutation literature mining dataset, dbSNP somatic and germline mutations datasets.
  8. Added GlyGen to Pharos Xref in the protein detail cross-reference section
  9. Added HCV 1a and 1b, SARS-CoV1 and 2 proteomes
  10. Added the MIM disease name where DO names were not available
  11. Updated glycan species annotations.
  12. Added Human, Mouse, Rat glycosylation data from GlyConnect.
  13. Added HCV1a glycosylation data from 1 publication.
  14. Added human glycosylation data from 2 publications.
  15. Added glycan x-refs to MatrixDB, GlycoEpitope
  16. Added MatrixDB Protein-GAGs interaction data. (at GlyGen Data)
  17. Included GlyTouCan-composition accessions.
  18. Added protein x-refs to GlycoProtDB
  19. Retired GlycO and GlycomeDB xrefs .
  20. Added SNFG glycans (at GlyGen Data)
  21. Added animated GIF and .mp4 video of 3D model of SARS-CoV-2 spike glycoprotein. (at GlyGen Data)
  22. Updated the synthesized glycan list from Dr. Boons group.
  23. Included additional 2 FAQs: How do I find a GlyTouCan Boons accession for my glycan composition? and How can I convert my glycan sequence to different formats (e.g IUPAC, WURCS, GlycoCT, LinearCode, etc.)?

Version 1.0

Main article: Data release notes/1.0

This version was released at Nov 22, 2019

  • Isoform Alignment.
  • Homolog Alignment.
  • New Usecase added in the quick search.
  • Composition Search.
  • Go ID search.
  • PMID search.
  • Batch search on advanced protein and glycan search page.
  • Multi-select option for amino acids on advanced glycoprotein search.
  • Multi-select option for organisms on advanced glycan search.
  • Integrate subsumption browser.

External links

https://data.glygen.org/history