NCBI RefSeq: reference sequence standards through 25 years of curation and annotation

Reference sequences and annotations serve as the foundation for many lines of research today, from organism and sequence identification to providing a core description of the genes, transcripts and proteins found in an organism's genome. Interpretation of data including transcriptomics, proteom...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Nucleic acids research Ročník 53; číslo D1; s. D243 - D257
Hlavní autoři: Goldfarb, Tamara, Kodali, Vamsi K, Pujar, Shashikant, Brover, Vyacheslav, Robbertse, Barbara, Farrell, Catherine M, Oh, Dong-Ha, Astashyn, Alexander, Ermolaeva, Olga, Haddad, Diana, Hlavina, Wratko, Hoffman, Jinna, Jackson, John D, Joardar, Vinita S, Kristensen, David, Masterson, Patrick, McGarvey, Kelly M, McVeigh, Richard, Mozes, Eyal, Murphy, Michael R, Schafer, Susan S, Souvorov, Alexander, Spurrier, Brett, Strope, Pooja K, Sun, Hanzhen, Vatsan, Anjana R, Wallin, Craig, Webb, David, Brister, J Rodney, Hatcher, Eneida, Kimchi, Avi, Klimke, William, Marchler-Bauer, Aron, Pruitt, Kim D, Thibaud-Nissen, Françoise, Murphy, Terence D
Médium: Journal Article
Jazyk:angličtina
Vydáno: England Oxford University Press 06.01.2025
Témata:
ISSN:0305-1048, 1362-4962, 1362-4962
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Reference sequences and annotations serve as the foundation for many lines of research today, from organism and sequence identification to providing a core description of the genes, transcripts and proteins found in an organism's genome. Interpretation of data including transcriptomics, proteomics, sequence variation and comparative analyses based on reference gene annotations informs our understanding of gene function and possible disease mechanisms, leading to new biomedical discoveries. The Reference Sequence (RefSeq) resource created at the National Center for Biotechnology Information (NCBI) leverages both automatic processes and expert curation to create a robust set of reference sequences of genomic, transcript and protein data spanning the tree of life. RefSeq continues to refine its annotation and quality control processes and utilize better quality genomes resulting from advances in sequencing technologies as well as RNA-Seq data to produce high-quality annotated genomes, ortholog predictions across more organisms and other products that are easily accessible through multiple NCBI resources. This report summarizes the current status of the eukaryotic, prokaryotic and viral RefSeq resources, with a focus on eukaryotic annotation, the increase in taxonomic representation and the effect it will have on comparative genomics. The RefSeq resource is publicly accessible at https://www.ncbi.nlm.nih.gov/refseq.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
The first two authors should be regarded as Joint First Authors.
ISSN:0305-1048
1362-4962
1362-4962
DOI:10.1093/nar/gkae1038