TCRklass: a new K-string-based algorithm for human and mouse TCR repertoire characterization

The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR repertoire, we developed an integrated pipeline, TCRklass, using K-string-based algorithm that has significantly improved the accu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of immunology (1950) Jg. 194; H. 1; S. 446
Hauptverfasser: Yang, Xi, Liu, Di, Lv, Na, Zhao, Fangqing, Liu, Fei, Zou, Jing, Chen, Yan, Xiao, Xue, Wu, Jun, Liu, Peipei, Gao, Jing, Hu, Yongfei, Shi, Yi, Liu, Jun, Zhang, Ruifen, Chen, Chen, Ma, Juncai, Gao, George F, Zhu, Baoli
Format: Journal Article
Sprache:Englisch
Veröffentlicht: United States 01.01.2015
Schlagworte:
ISSN:1550-6606, 1550-6606
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR repertoire, we developed an integrated pipeline, TCRklass, using K-string-based algorithm that has significantly improved the accuracy and performance over existing tools. We tested TCRklass using manually curated short read datasets in comparison with in silico datasets; it showed higher precision and recall rates on CDR3 identification. We applied TCRklass on large datasets of two human and three mouse TCR repertoires; it demonstrated higher reliability on CDR3 identification and much less biased V/J profiling, which are the two components contributing the diversity of the repertoire. Because of the sequencing cost, short paired-end reads generated by next-generation sequencing technology are and will remain the main source of data, and we believe that the TCRklass is a useful and reliable toolkit for TCR repertoire analysis.
AbstractList The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR repertoire, we developed an integrated pipeline, TCRklass, using K-string-based algorithm that has significantly improved the accuracy and performance over existing tools. We tested TCRklass using manually curated short read datasets in comparison with in silico datasets; it showed higher precision and recall rates on CDR3 identification. We applied TCRklass on large datasets of two human and three mouse TCR repertoires; it demonstrated higher reliability on CDR3 identification and much less biased V/J profiling, which are the two components contributing the diversity of the repertoire. Because of the sequencing cost, short paired-end reads generated by next-generation sequencing technology are and will remain the main source of data, and we believe that the TCRklass is a useful and reliable toolkit for TCR repertoire analysis.The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR repertoire, we developed an integrated pipeline, TCRklass, using K-string-based algorithm that has significantly improved the accuracy and performance over existing tools. We tested TCRklass using manually curated short read datasets in comparison with in silico datasets; it showed higher precision and recall rates on CDR3 identification. We applied TCRklass on large datasets of two human and three mouse TCR repertoires; it demonstrated higher reliability on CDR3 identification and much less biased V/J profiling, which are the two components contributing the diversity of the repertoire. Because of the sequencing cost, short paired-end reads generated by next-generation sequencing technology are and will remain the main source of data, and we believe that the TCRklass is a useful and reliable toolkit for TCR repertoire analysis.
The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR repertoire, we developed an integrated pipeline, TCRklass, using K-string-based algorithm that has significantly improved the accuracy and performance over existing tools. We tested TCRklass using manually curated short read datasets in comparison with in silico datasets; it showed higher precision and recall rates on CDR3 identification. We applied TCRklass on large datasets of two human and three mouse TCR repertoires; it demonstrated higher reliability on CDR3 identification and much less biased V/J profiling, which are the two components contributing the diversity of the repertoire. Because of the sequencing cost, short paired-end reads generated by next-generation sequencing technology are and will remain the main source of data, and we believe that the TCRklass is a useful and reliable toolkit for TCR repertoire analysis.
Author Shi, Yi
Hu, Yongfei
Liu, Peipei
Gao, George F
Zou, Jing
Zhao, Fangqing
Chen, Yan
Zhang, Ruifen
Wu, Jun
Lv, Na
Chen, Chen
Liu, Fei
Xiao, Xue
Zhu, Baoli
Liu, Di
Gao, Jing
Ma, Juncai
Yang, Xi
Liu, Jun
Author_xml – sequence: 1
  givenname: Xi
  surname: Yang
  fullname: Yang, Xi
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
– sequence: 2
  givenname: Di
  surname: Liu
  fullname: Liu, Di
  organization: Network Information Center, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 3
  givenname: Na
  surname: Lv
  fullname: Lv, Na
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 4
  givenname: Fangqing
  surname: Zhao
  fullname: Zhao, Fangqing
  organization: Beijing Institutes of Life Sciences, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 5
  givenname: Fei
  surname: Liu
  fullname: Liu, Fei
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 6
  givenname: Jing
  surname: Zou
  fullname: Zou, Jing
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 7
  givenname: Yan
  surname: Chen
  fullname: Chen, Yan
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 8
  givenname: Xue
  surname: Xiao
  fullname: Xiao, Xue
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 9
  givenname: Jun
  surname: Wu
  fullname: Wu, Jun
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 10
  givenname: Peipei
  surname: Liu
  fullname: Liu, Peipei
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 11
  givenname: Jing
  surname: Gao
  fullname: Gao, Jing
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 12
  givenname: Yongfei
  surname: Hu
  fullname: Hu, Yongfei
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang University, Hangzhou 310003, China; and
– sequence: 13
  givenname: Yi
  surname: Shi
  fullname: Shi, Yi
  organization: Beijing Institutes of Life Sciences, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 14
  givenname: Jun
  surname: Liu
  fullname: Liu, Jun
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; Chinese Center for Disease Control and Prevention, Beijing 102206, China
– sequence: 15
  givenname: Ruifen
  surname: Zhang
  fullname: Zhang, Ruifen
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China
– sequence: 16
  givenname: Chen
  surname: Chen
  fullname: Chen, Chen
  organization: Chinese Center for Disease Control and Prevention, Beijing 102206, China
– sequence: 17
  givenname: Juncai
  surname: Ma
  fullname: Ma, Juncai
  email: zhubaoli@im.ac.cn, gaof@im.ac.cn, ma@im.ac.cn
  organization: Network Information Center, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; zhubaoli@im.ac.cn gaof@im.ac.cn ma@im.ac.cn
– sequence: 18
  givenname: George F
  surname: Gao
  fullname: Gao, George F
  email: zhubaoli@im.ac.cn, gaof@im.ac.cn, ma@im.ac.cn
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; Beijing Institutes of Life Sciences, Chinese Academy of Sciences, Beijing 100101, China; Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang University, Hangzhou 310003, China; and Chinese Center for Disease Control and Prevention, Beijing 102206, China zhubaoli@im.ac.cn gaof@im.ac.cn ma@im.ac.cn
– sequence: 19
  givenname: Baoli
  surname: Zhu
  fullname: Zhu, Baoli
  email: zhubaoli@im.ac.cn, gaof@im.ac.cn, ma@im.ac.cn
  organization: Chinese Academy of Sciences Key Laboratory of Pathogenic Microbiology, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang University, Hangzhou 310003, China; and zhubaoli@im.ac.cn gaof@im.ac.cn ma@im.ac.cn
BackLink https://www.ncbi.nlm.nih.gov/pubmed/25404364$$D View this record in MEDLINE/PubMed
BookMark eNpNkE1LAzEYhINU7IfePUmOXrYm2SS7602KX1gQpN6E5d0k26ZukprsIvrrLVjB08xhnmGYKRr54A1C55TMOeHV1dY6N_jQzSknpKD0CE2oECSTksjRPz9G05S2hBBJGD9BYyb2eC75BL2tFi_vHaR0jQF784mfstRH69dZA8loDN06RNtvHG5DxJvBgcfgNXZhSAbvYRzNzsQ-2Giw2kAE1Ztov6G3wZ-i4xa6ZM4OOkOvd7erxUO2fL5_XNwsM5UXvM8UCNAcWA6GtrIpc6m5rpRo86otSwZSlUUllIK2AMoKkhutC62r0jRCU9KwGbr87d3F8DGY1NfOJmW6DrzZ76ypzCteFawQ--jFITo0zuh6F62D-FX_PcJ-AJ6aZyM
CitedBy_id crossref_primary_10_1016_j_leukres_2016_11_009
crossref_primary_10_1186_s12859_020_03669_1
crossref_primary_10_1093_bib_bbw138
crossref_primary_10_24072_pcjournal_547
crossref_primary_10_1038_ni_3522
crossref_primary_10_1128_jvi_00789_23
crossref_primary_10_1371_journal_pcbi_1004503
crossref_primary_10_1038_s41590_023_01441_0
crossref_primary_10_1007_s11427_020_1718_2
crossref_primary_10_1007_s40745_016_0086_8
crossref_primary_10_3389_fgene_2020_00771
crossref_primary_10_1038_s41590_023_01547_5
crossref_primary_10_1093_bib_bbz092
crossref_primary_10_1126_sciimmunol_aat7061
crossref_primary_10_1371_journal_pone_0166126
crossref_primary_10_1093_bib_bbx111
crossref_primary_10_1186_s12859_017_1853_z
crossref_primary_10_1038_nrg_2016_67
crossref_primary_10_1172_JCI123859
crossref_primary_10_4049_jimmunol_1601710
crossref_primary_10_1038_ni_3491
crossref_primary_10_1038_s41590_019_0414_1
ContentType Journal Article
Copyright Copyright © 2014 by The American Association of Immunologists, Inc.
Copyright_xml – notice: Copyright © 2014 by The American Association of Immunologists, Inc.
DBID CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.4049/jimmunol.1400711
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Medicine
Biology
EISSN 1550-6606
ExternalDocumentID 25404364
Genre Evaluation Studies
Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
-~X
.55
0R~
18M
2WC
34G
39C
53G
5GY
5RE
5VS
5WD
79B
85S
AARDX
ABCQX
ABDFA
ABEJV
ABGNP
ABJNI
ABOCM
ABPPZ
ABXVV
ACGFO
ACGFS
ACIWK
ACNCT
ACPRK
ADBBV
ADIPN
ADNWM
AENEX
AETEA
AFHIN
AFOSN
AFRAH
AGORE
AHMMS
AHWXS
AIZAD
ALMA_UNASSIGNED_HOLDINGS
ARBBW
BAWUL
BCRHZ
BTFSW
CGR
CUY
CVF
D0L
DIK
DU5
E3Z
EBS
ECM
EIF
EJD
F5P
GX1
IH2
K-O
KOP
KQ8
L7B
NPM
OCZFY
OK1
OWPYF
P0W
P2P
PQQKQ
R.V
RHI
ROX
RZQ
SJN
TR2
TWZ
W8F
WH7
WOQ
X7M
XJT
XSW
XTH
YHG
7X8
ID FETCH-LOGICAL-c374t-ca5ad4a23ae1f6b836d4d9c5f39f882a6c8795ccaf7a12703edd7dd98eb5d10b2
IEDL.DBID 7X8
ISICitedReferencesCount 22
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000346700500047&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1550-6606
IngestDate Thu Oct 02 09:50:12 EDT 2025
Mon Jul 21 05:37:36 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License Copyright © 2014 by The American Association of Immunologists, Inc.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c374t-ca5ad4a23ae1f6b836d4d9c5f39f882a6c8795ccaf7a12703edd7dd98eb5d10b2
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Undefined-1
ObjectType-Feature-3
content type line 23
OpenAccessLink https://www.jimmunol.org/content/jimmunol/194/1/446.full.pdf
PMID 25404364
PQID 1639497275
PQPubID 23479
ParticipantIDs proquest_miscellaneous_1639497275
pubmed_primary_25404364
PublicationCentury 2000
PublicationDate 2015-01-01
PublicationDateYYYYMMDD 2015-01-01
PublicationDate_xml – month: 01
  year: 2015
  text: 2015-01-01
  day: 01
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle The Journal of immunology (1950)
PublicationTitleAlternate J Immunol
PublicationYear 2015
SSID ssj0006024
Score 2.3093421
Snippet The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 446
SubjectTerms Algorithms
Amino Acid Sequence
Animals
Automatic Data Processing - methods
Base Sequence
High-Throughput Nucleotide Sequencing - methods
Humans
Mice
Molecular Sequence Data
Receptors, Antigen, T-Cell - analysis
Receptors, Antigen, T-Cell - genetics
Receptors, Antigen, T-Cell - immunology
Reproducibility of Results
Sequence Analysis, DNA
V(D)J Recombination - genetics
Title TCRklass: a new K-string-based algorithm for human and mouse TCR repertoire characterization
URI https://www.ncbi.nlm.nih.gov/pubmed/25404364
https://www.proquest.com/docview/1639497275
Volume 194
WOSCitedRecordID wos000346700500047&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpZ1LSwMxEMeDWhUvPuqrvojgNdg22Ue8iBSLoC1FKvQgLLNJtlbrbm0XwW_vZLO1J0HwsuwlS8hj5rczyfwJufBiKXA_19H6JZLhKzCkcM5AaeXbe2kaCtWSh6DbDQcD2SsDbrPyWOXcJhaGWmfKxsgvkRukkOhtvevJB7OqUTa7WkpoLJMKR5SxR7qCwaJauF93orZI4cxHUndpSoFQfPk6stcvsjFaCutlG78DZuFo2lv_7eI22SwRk964NbFDlkxaJWtOdPKrStY7ZTp9lzz3W49vFp-vKFDka3rPrIxHOmTWuWkK4yF-P395p4i2tJDzo5BqasMFhmJjOjUTM80ztJtU_ZR-djc798hT-7bfumOl3AJTPBA5U-CBFtDkYBqJH4fc10JL5SVcJsjh4CsrTI4zngRg89XcaB1oLUMTe7pRj5v7ZCXNUnNIKASxxgaCxxAKgUyCflDzegIqjrnhukbO5yMY4XK2OQpIDfY9WoxhjRy4aYgmru5GhP-ytmC-OPpD62OygWjjuWDJCakkuJnNKVlVn_loNj0r1gk-u73ON--eyfY
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=TCRklass%3A+a+new+K-string-based+algorithm+for+human+and+mouse+TCR+repertoire+characterization&rft.jtitle=The+Journal+of+immunology+%281950%29&rft.au=Yang%2C+Xi&rft.au=Liu%2C+Di&rft.au=Lv%2C+Na&rft.au=Zhao%2C+Fangqing&rft.date=2015-01-01&rft.issn=1550-6606&rft.eissn=1550-6606&rft.volume=194&rft.issue=1&rft.spage=446&rft_id=info:doi/10.4049%2Fjimmunol.1400711&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1550-6606&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1550-6606&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1550-6606&client=summon