Three assays for in-solution enrichment of ancient human DNA at more than a million SNPs

The strategy of in-solution enrichment for hundreds of thousands of single-nucleotide polymorphisms (SNPs) has been used to analyze >70% of individuals with genome-scale ancient DNA published to date. This approach makes it economical to study ancient samples with low proportions of human DNA and...

Full description

Saved in:
Bibliographic Details
Published in:Genome research Vol. 32; no. 11-12; p. 2068
Main Authors: Rohland, Nadin, Mallick, Swapan, Mah, Matthew, Maier, Robert, Patterson, Nick, Reich, David
Format: Journal Article
Language:English
Published: United States 01.11.2022
Subjects:
ISSN:1549-5469, 1549-5469
Online Access:Get more information
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract The strategy of in-solution enrichment for hundreds of thousands of single-nucleotide polymorphisms (SNPs) has been used to analyze >70% of individuals with genome-scale ancient DNA published to date. This approach makes it economical to study ancient samples with low proportions of human DNA and increases the rate of conversion of sampled remains into interpretable data. So far, nearly all such data have been generated using a set of bait sequences targeting about 1.24 million SNPs (the "1240k reagent"), but synthesis of the reagent has been cost-effective for only a few laboratories. In 2021, two companies, Daicel Arbor Biosciences and Twist Bioscience, made available assays that target the same core set of SNPs along with supplementary content. We test all three assays on a common set of 27 ancient DNA libraries and show that all three are effective at enriching many hundreds of thousands of SNPs. For all assays, one round of enrichment produces data that are as useful as two. In our testing, the "Twist Ancient DNA" assay produces the highest coverages, greatest uniformity on targeted positions, and almost no bias toward enriching one allele more than another relative to shotgun sequencing. We also identify hundreds of thousands of targeted SNPs for which there is minimal allelic bias when comparing 1240k data to either shotgun or Twist data. This facilitates coanalysis of the large data sets that have been generated using 1240k and Twist capture, as well as shotgun sequencing approaches.
AbstractList The strategy of in-solution enrichment for hundreds of thousands of single-nucleotide polymorphisms (SNPs) has been used to analyze >70% of individuals with genome-scale ancient DNA published to date. This approach makes it economical to study ancient samples with low proportions of human DNA and increases the rate of conversion of sampled remains into interpretable data. So far, nearly all such data have been generated using a set of bait sequences targeting about 1.24 million SNPs (the "1240k reagent"), but synthesis of the reagent has been cost-effective for only a few laboratories. In 2021, two companies, Daicel Arbor Biosciences and Twist Bioscience, made available assays that target the same core set of SNPs along with supplementary content. We test all three assays on a common set of 27 ancient DNA libraries and show that all three are effective at enriching many hundreds of thousands of SNPs. For all assays, one round of enrichment produces data that are as useful as two. In our testing, the "Twist Ancient DNA" assay produces the highest coverages, greatest uniformity on targeted positions, and almost no bias toward enriching one allele more than another relative to shotgun sequencing. We also identify hundreds of thousands of targeted SNPs for which there is minimal allelic bias when comparing 1240k data to either shotgun or Twist data. This facilitates coanalysis of the large data sets that have been generated using 1240k and Twist capture, as well as shotgun sequencing approaches.The strategy of in-solution enrichment for hundreds of thousands of single-nucleotide polymorphisms (SNPs) has been used to analyze >70% of individuals with genome-scale ancient DNA published to date. This approach makes it economical to study ancient samples with low proportions of human DNA and increases the rate of conversion of sampled remains into interpretable data. So far, nearly all such data have been generated using a set of bait sequences targeting about 1.24 million SNPs (the "1240k reagent"), but synthesis of the reagent has been cost-effective for only a few laboratories. In 2021, two companies, Daicel Arbor Biosciences and Twist Bioscience, made available assays that target the same core set of SNPs along with supplementary content. We test all three assays on a common set of 27 ancient DNA libraries and show that all three are effective at enriching many hundreds of thousands of SNPs. For all assays, one round of enrichment produces data that are as useful as two. In our testing, the "Twist Ancient DNA" assay produces the highest coverages, greatest uniformity on targeted positions, and almost no bias toward enriching one allele more than another relative to shotgun sequencing. We also identify hundreds of thousands of targeted SNPs for which there is minimal allelic bias when comparing 1240k data to either shotgun or Twist data. This facilitates coanalysis of the large data sets that have been generated using 1240k and Twist capture, as well as shotgun sequencing approaches.
The strategy of in-solution enrichment for hundreds of thousands of single-nucleotide polymorphisms (SNPs) has been used to analyze >70% of individuals with genome-scale ancient DNA published to date. This approach makes it economical to study ancient samples with low proportions of human DNA and increases the rate of conversion of sampled remains into interpretable data. So far, nearly all such data have been generated using a set of bait sequences targeting about 1.24 million SNPs (the "1240k reagent"), but synthesis of the reagent has been cost-effective for only a few laboratories. In 2021, two companies, Daicel Arbor Biosciences and Twist Bioscience, made available assays that target the same core set of SNPs along with supplementary content. We test all three assays on a common set of 27 ancient DNA libraries and show that all three are effective at enriching many hundreds of thousands of SNPs. For all assays, one round of enrichment produces data that are as useful as two. In our testing, the "Twist Ancient DNA" assay produces the highest coverages, greatest uniformity on targeted positions, and almost no bias toward enriching one allele more than another relative to shotgun sequencing. We also identify hundreds of thousands of targeted SNPs for which there is minimal allelic bias when comparing 1240k data to either shotgun or Twist data. This facilitates coanalysis of the large data sets that have been generated using 1240k and Twist capture, as well as shotgun sequencing approaches.
Author Patterson, Nick
Maier, Robert
Rohland, Nadin
Mah, Matthew
Mallick, Swapan
Reich, David
Author_xml – sequence: 1
  givenname: Nadin
  orcidid: 0000-0002-8112-9601
  surname: Rohland
  fullname: Rohland, Nadin
  organization: Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
– sequence: 2
  givenname: Swapan
  orcidid: 0000-0002-8442-9757
  surname: Mallick
  fullname: Mallick, Swapan
  organization: Howard Hughes Medical Institute, Boston, Massachusetts 02115, USA
– sequence: 3
  givenname: Matthew
  surname: Mah
  fullname: Mah, Matthew
  organization: Howard Hughes Medical Institute, Boston, Massachusetts 02115, USA
– sequence: 4
  givenname: Robert
  surname: Maier
  fullname: Maier, Robert
  organization: Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA
– sequence: 5
  givenname: Nick
  orcidid: 0000-0002-2220-3648
  surname: Patterson
  fullname: Patterson, Nick
  organization: Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA
– sequence: 6
  givenname: David
  orcidid: 0000-0002-7037-5292
  surname: Reich
  fullname: Reich, David
  organization: Department of Human Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/36517229$$D View this record in MEDLINE/PubMed
BookMark eNpNkEtPwzAQhC1URB9w5Ip85JJiO37lWJXykKqCRJG4RY6zIUGJXezk0H9PK4rEaT6NZle7M0Uj5x0gdE3JnFJC7z7DnCmpmJ5Txs7QhAqeJYLLbPSPx2ga4xchJOVaX6BxKgVVjGUT9LGtAwA2MZp9xJUPuHFJ9O3QN95hcKGxdQeux77CxtnmiPXQGYfvNwtsetz5ALivD4bBXdO2x7G3zWu8ROeVaSNcnXSG3h9W2-VTsn55fF4u1olNJekTRa0QICyvjGUyFeJwo8g4kFJpK8BSrooipVCV0krCISWl1ipTJVGGqSJjM3T7u3cX_PcAsc-7JlpoW-PADzFnSnAtD68fozen6FB0UOa70HQm7PO_NtgPeoVhsw
CitedBy_id crossref_primary_10_1186_s12915_025_02299_4
crossref_primary_10_1016_j_isci_2025_112871
crossref_primary_10_1007_s12520_025_02186_7
crossref_primary_10_1038_s41559_023_02211_9
crossref_primary_10_1002_elps_202300285
crossref_primary_10_1016_j_fsigen_2024_103217
crossref_primary_10_1016_j_cub_2024_07_063
crossref_primary_10_1111_jse_13029
crossref_primary_10_1186_s13059_025_03707_2
crossref_primary_10_1093_g3journal_jkaf172
crossref_primary_10_1038_s41586_025_08913_3
crossref_primary_10_1093_molbev_msae076
crossref_primary_10_1186_s13059_024_03405_5
crossref_primary_10_1016_j_xpro_2024_102985
crossref_primary_10_1186_s12915_024_02068_9
crossref_primary_10_1016_j_jas_2025_106178
crossref_primary_10_1038_s41598_022_26799_3
crossref_primary_10_1016_j_jgg_2024_07_008
crossref_primary_10_1038_s41586_024_08531_5
crossref_primary_10_1016_j_ab_2024_115636
crossref_primary_10_1371_journal_pone_0302646
crossref_primary_10_1038_s41467_025_63172_0
crossref_primary_10_1016_j_cell_2023_10_018
crossref_primary_10_1038_s41586_024_08496_5
crossref_primary_10_1038_s41588_023_01582_w
crossref_primary_10_1186_s12915_025_02286_9
crossref_primary_10_1016_j_jasrep_2023_104099
crossref_primary_10_1038_s41467_024_55740_7
crossref_primary_10_1093_molbev_msaf139
crossref_primary_10_1371_journal_pone_0293434
crossref_primary_10_1038_s41598_023_45612_3
crossref_primary_10_3390_genes16010023
crossref_primary_10_1007_s12520_024_02033_1
crossref_primary_10_3390_genes15091218
crossref_primary_10_1111_1755_0998_13869
crossref_primary_10_1186_s13059_024_03462_w
crossref_primary_10_1016_j_cub_2024_02_059
crossref_primary_10_1038_s41598_024_69741_5
crossref_primary_10_1093_genetics_iyae212
crossref_primary_10_1016_j_xgen_2025_100976
crossref_primary_10_1038_s41598_025_99743_w
crossref_primary_10_1016_j_isci_2024_111405
crossref_primary_10_1038_s41467_025_60368_2
crossref_primary_10_1038_s41586_025_09103_x
crossref_primary_10_1093_biomtc_ujae107
crossref_primary_10_1126_science_adr3326
crossref_primary_10_1016_j_cub_2023_09_055
crossref_primary_10_1016_j_celrep_2025_115262
crossref_primary_10_1371_journal_pgen_1010931
crossref_primary_10_1186_s12915_025_02343_3
crossref_primary_10_1093_bib_bbae646
crossref_primary_10_1186_s13059_025_03622_6
crossref_primary_10_1007_s12520_024_02130_1
crossref_primary_10_1038_s41586_025_08793_7
crossref_primary_10_1038_s42003_023_05642_z
ContentType Journal Article
Copyright 2022 Rohland et al.; Published by Cold Spring Harbor Laboratory Press.
Copyright_xml – notice: 2022 Rohland et al.; Published by Cold Spring Harbor Laboratory Press.
DBID CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1101/gr.276728.122
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Anatomy & Physiology
Chemistry
Biology
EISSN 1549-5469
ExternalDocumentID 36517229
Genre Research Support, Non-U.S. Gov't
Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: Howard Hughes Medical Institute
GroupedDBID ---
.GJ
18M
29H
2WC
39C
4.4
53G
5GY
5RE
5VS
AAYOK
AAZTW
ABDIX
ABDNZ
ACGFO
ACYGS
ADBBV
ADNWM
AEILP
AENEX
AHPUY
AI.
ALMA_UNASSIGNED_HOLDINGS
BAWUL
BTFSW
C1A
CGR
CS3
CUY
CVF
DIK
DU5
E3Z
EBS
ECM
EIF
EJD
F5P
FRP
GX1
H13
HYE
IH2
K-O
KQ8
MV1
NPM
R.V
RCX
RHF
RHI
RNS
RPM
RXW
SJN
TAE
TR2
VH1
W8F
WOQ
YKV
ZCG
ZGI
ZXP
7X8
ACLKE
ID FETCH-LOGICAL-c360t-71c55e5c4fac26355003594e0d78c5ec147bb31efd6c604e30d88797d07a27b92
IEDL.DBID 7X8
ISICitedReferencesCount 65
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000933587400009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1549-5469
IngestDate Thu Sep 04 19:19:08 EDT 2025
Wed Feb 19 02:25:56 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 11-12
Language English
License 2022 Rohland et al.; Published by Cold Spring Harbor Laboratory Press.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c360t-71c55e5c4fac26355003594e0d78c5ec147bb31efd6c604e30d88797d07a27b92
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0002-2220-3648
0000-0002-8442-9757
0000-0002-8112-9601
0000-0002-7037-5292
OpenAccessLink https://genome.cshlp.org/content/32/11-12/2068.full.pdf
PMID 36517229
PQID 2754860009
PQPubID 23479
ParticipantIDs proquest_miscellaneous_2754860009
pubmed_primary_36517229
PublicationCentury 2000
PublicationDate 2022 Nov-Dec
20221101
PublicationDateYYYYMMDD 2022-11-01
PublicationDate_xml – month: 11
  year: 2022
  text: 2022 Nov-Dec
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Genome research
PublicationTitleAlternate Genome Res
PublicationYear 2022
SSID ssj0003488
Score 2.6290815
Snippet The strategy of in-solution enrichment for hundreds of thousands of single-nucleotide polymorphisms (SNPs) has been used to analyze >70% of individuals with...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 2068
SubjectTerms DNA - genetics
DNA, Ancient - analysis
Gene Library
Humans
Polymorphism, Single Nucleotide
Sequence Analysis, DNA
Title Three assays for in-solution enrichment of ancient human DNA at more than a million SNPs
URI https://www.ncbi.nlm.nih.gov/pubmed/36517229
https://www.proquest.com/docview/2754860009
Volume 32
WOSCitedRecordID wos000933587400009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T8MwELaAgmDh0fIoLxkJsbl1nDhuJlQKFQNElSioW-XaDnQgKU1B6r_n7CZ0QkJisbycFNvny-fz-fsQuvSNGCVGC-JpKkngJZRIagSRgivFAJL7TkXh5UHEcWswiHpFwi0vyirLmOgCtc6UzZE3meBWLwkgwfXkg1jVKHu7WkhorKKKD1DGerUYLNnC_cDpTloWMsLhHPjDsek1X6cNJkLBWg3P6ub-hi7dX6a789_v20XbBb7E7YVD7KEVk1ZRrZ3C2fp9jq-wq_h0qfQq2rgpe5udUvethgZ9WF6DAVTLeY4B0-JxSkoPxeBuY_VmU4o4S7CT9oWuU_rDt3Ebyxm2pbvYZuSxxFbUyJo9xb18Hz137_qde1LoLxDlh3RGhKc4N1wFiVSWs4Y7vr_AUC1aihvlBWI08j2T6FCFNDA-1RCyIqGpkEyMInaA1tIsNUcIS6m0li2IDoC_wEQyA6HARDRKQq4SWkcX5awOYbj20kKmJvvMh8t5raPDxdIMJwsijqEfcsBfLDr-g_UJ2mL25YJ7RniKKgnsbnOG1tXXbJxPz53jQBv3Hr8B8sjMNQ
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Three+assays+for+in-solution+enrichment+of+ancient+human+DNA+at+more+than+a+million+SNPs&rft.jtitle=Genome+research&rft.au=Rohland%2C+Nadin&rft.au=Mallick%2C+Swapan&rft.au=Mah%2C+Matthew&rft.au=Maier%2C+Robert&rft.date=2022-11-01&rft.issn=1549-5469&rft.eissn=1549-5469&rft.volume=32&rft.issue=11-12&rft.spage=2068&rft_id=info:doi/10.1101%2Fgr.276728.122&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1549-5469&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1549-5469&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1549-5469&client=summon