Current status and new features of the Consensus Coding Sequence database

The Consensus Coding Sequence (CCDS) project (http://www.ncbi.nlm.nih.gov/CCDS/) is a collaborative effort to maintain a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assemblies by the National Center for Biotechnology Information (NCBI) and...

Full description

Saved in:
Bibliographic Details
Published in:Nucleic acids research Vol. 42; no. D1; pp. D865 - D872
Main Authors: Farrell, Catherine M., O’Leary, Nuala A., Harte, Rachel A., Loveland, Jane E., Wilming, Laurens G., Wallin, Craig, Diekhans, Mark, Barrell, Daniel, Searle, Stephen M. J., Aken, Bronwen, Hiatt, Susan M., Frankish, Adam, Suner, Marie-Marthe, Rajput, Bhanu, Steward, Charles A., Brown, Garth R., Bennett, Ruth, Murphy, Michael, Wu, Wendy, Kay, Mike P., Hart, Jennifer, Rajan, Jeena, Weber, Janet, Snow, Catherine, Riddick, Lillian D., Hunt, Toby, Webb, David, Thomas, Mark, Tamez, Pamela, Rangwala, Sanjida H., McGarvey, Kelly M., Pujar, Shashikant, Shkeda, Andrei, Mudge, Jonathan M., Gonzalez, Jose M., Gilbert, James G. R., Trevanion, Stephen J., Baertsch, Robert, Harrow, Jennifer L., Hubbard, Tim, Ostell, James M., Haussler, David, Pruitt, Kim D.
Format: Journal Article
Language:English
Published: England Oxford University Press 01.01.2014
Subjects:
ISSN:0305-1048, 1362-4962, 1362-4962
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The Consensus Coding Sequence (CCDS) project (http://www.ncbi.nlm.nih.gov/CCDS/) is a collaborative effort to maintain a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assemblies by the National Center for Biotechnology Information (NCBI) and Ensembl genome annotation pipelines. Identical annotations that pass quality assurance tests are tracked with a stable identifier (CCDS ID). Members of the collaboration, who are from NCBI, the Wellcome Trust Sanger Institute and the University of California Santa Cruz, provide coordinated and continuous review of the dataset to ensure high-quality CCDS representations. We describe here the current status and recent growth in the CCDS dataset, as well as recent changes to the CCDS web and FTP sites. These changes include more explicit reporting about the NCBI and Ensembl annotation releases being compared, new search and display options, the addition of biologically descriptive information and our approach to representing genes for which support evidence is incomplete. We also present a summary of recent and future curation targets.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0305-1048
1362-4962
1362-4962
DOI:10.1093/nar/gkt1059