CARE 2.0: reducing false-positive sequencing error corrections using machine learning

Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC bioinformatics Jg. 23; H. 1; S. 227 - 17
Hauptverfasser: Kallenborn, Felix, Cascitti, Julian, Schmidt, Bertil
Format: Journal Article
Sprache:Englisch
Veröffentlicht: London BioMed Central 13.06.2022
BioMed Central Ltd
Springer Nature B.V
BMC
Schlagworte:
ISSN:1471-2105, 1471-2105
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k -mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k -mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k -mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE .
AbstractList Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE.
Abstract Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE .
Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at Keywords: Next-generation sequencing, Error correction, Machine learning
Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k -mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k -mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k -mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE .
Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE .
Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools.BACKGROUNDNext-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools.We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data.RESULTSWe present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data.False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE .CONCLUSIONFalse-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE .
Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE.
ArticleNumber 227
Audience Academic
Author Kallenborn, Felix
Schmidt, Bertil
Cascitti, Julian
Author_xml – sequence: 1
  givenname: Felix
  orcidid: 0000-0003-4516-6357
  surname: Kallenborn
  fullname: Kallenborn, Felix
  email: kallenborn@uni-mainz.de
  organization: Department of Computer Science, Johannes Gutenberg University Mainz
– sequence: 2
  givenname: Julian
  surname: Cascitti
  fullname: Cascitti, Julian
  organization: Department of Computer Science, Johannes Gutenberg University Mainz
– sequence: 3
  givenname: Bertil
  orcidid: 0000-0003-2597-8331
  surname: Schmidt
  fullname: Schmidt, Bertil
  organization: Department of Computer Science, Johannes Gutenberg University Mainz
BackLink https://www.ncbi.nlm.nih.gov/pubmed/35698033$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAUhSNURB_wB1igSGxgkcGPOI5ZVBqNCoxUCanQteW5uUk9ytiDnVTw73FmWtqpUOVFoutzPjsn5zQ7ct5hlr2lZEZpXX2KlNVCFYSxgpRSlAV_kZ3QUtKCUSKOHr0fZ6cxrgmhsibiVXbMRaVqwvlJdr2YX13kbEY-5wGbEazr8tb0EYutj3awt5hH_DWi2-1gCD7k4ENAGKx3MR_jNN8YuLEO8x5NcGnwOnu5g7y5e55l118ufi6-FZffvy4X88sChKRDQaEiwAlgiYyhgBZXTUtXRJZ1w5WpEWWjoGyRGKOgaYBUilFFVrIspTSUn2XLPbfxZq23wW5M-KO9sXo38KHTJgwWetQloBQmYWtkZSVUncJpKSAoRduGYmKd71nbcbXBBtANwfQH0MMdZ29052-1okpwNl3mwx0g-JRYHPTGRsC-Nw79GDWrZCUEJ0Ql6fsn0rUfg0tRTap0tTr9qQdVZ9IHWNf6dC5MUD2XJC0u5cSa_UeVVoMbC6kxrU3zA8PHA0PSDPh76MwYo17-uDrUvnscyr807guUBPVeAMHHGLDVYAczdSPdwvaaEj11Ve-7qlNX9a6rerKyJ9Z7-rMmvjfFJHYdhofknnH9BUwC-YU
CitedBy_id crossref_primary_10_1038_s41598_024_52386_9
crossref_primary_10_3390_ijms252413250
crossref_primary_10_1002_qub2_99
crossref_primary_10_1016_j_ymeth_2023_06_007
crossref_primary_10_1186_s12859_024_05681_1
crossref_primary_10_1016_j_csbj_2024_05_025
crossref_primary_10_1016_j_jaci_2025_06_015
crossref_primary_10_1016_j_drudis_2024_103990
crossref_primary_10_1093_bfgp_elad026
crossref_primary_10_1186_s12859_024_05802_w
Cites_doi 10.1093/bioinformatics/btw146
10.1093/bioinformatics/btv290
10.1038/s41598-019-51418-z
10.1093/bioinformatics/btv415
10.1093/bioinformatics/btu440
10.1093/bioinformatics/btr170
10.1093/bioinformatics/btaa738
10.1186/s12859-017-1784-8
10.1093/bioinformatics/btr708
10.1186/s12859-021-04547-0
10.1093/bioinformatics/btu368
10.1093/bioinformatics/btt086
10.1145/270563.571472
10.1038/s41598-019-52196-4
10.1101/gr.111351.110
10.1093/bioinformatics/btt407
10.1093/bioinformatics/btu856
10.1093/bioinformatics/btw746
10.1101/gr.126953.111
10.1186/s12859-019-2906-2
10.1093/bioinformatics/btz102
10.1093/bioinformatics/btr011
10.1089/cmb.2012.0021
10.1093/bioinformatics/bts690
10.1186/s13059-014-0509-9
10.1023/A:1010933404324
ContentType Journal Article
Copyright The Author(s) 2022
2022. The Author(s).
COPYRIGHT 2022 BioMed Central Ltd.
2022. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: The Author(s) 2022
– notice: 2022. The Author(s).
– notice: COPYRIGHT 2022 BioMed Central Ltd.
– notice: 2022. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID C6C
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
3V.
7QO
7SC
7X7
7XB
88E
8AL
8AO
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABUWG
AEUYN
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
LK8
L~C
L~D
M0N
M0S
M1P
M7P
P5Z
P62
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
Q9U
7X8
5PM
DOA
DOI 10.1186/s12859-022-04754-3
DatabaseName SpringerLink Open Access Journals
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Computing Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest One Sustainability (subscription)
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
Biological Science Collection (subscription)
ProQuest Central
Technology collection
Natural Science Collection
ProQuest One Community College
ProQuest Central
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
ProQuest SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
ProQuest Biological Science Collection
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
Health & Medical Collection (Alumni Edition)
PML(ProQuest Medical Library)
Biological Science Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
SciTech Premium Collection
ProQuest Central China
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
ProQuest One Academic (New)
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Advanced Technologies Database with Aerospace
ProQuest Computing
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Medical Library
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList



MEDLINE

MEDLINE - Academic
Publicly Available Content Database
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: PIMPY
  name: Publicly Available Content Database
  url: http://search.proquest.com/publiccontent
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2105
EndPage 17
ExternalDocumentID oai_doaj_org_article_4ce75ad398e246598147f1cec991fd1e
PMC9195321
A707073779
35698033
10_1186_s12859_022_04754_3
Genre Journal Article
GeographicLocations Germany
GeographicLocations_xml – name: Germany
GrantInformation_xml – fundername: Johannes Gutenberg-Universität Mainz (1030)
– fundername: DeCoDeML Project by Rhein-Main-University Network
– fundername: ;
GroupedDBID ---
0R~
23N
2WC
53G
5VS
6J9
7X7
88E
8AO
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADMLS
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CS3
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
ICD
IHR
INH
INR
ISR
ITC
K6V
K7-
KQ8
LK8
M1P
M48
M7P
MK~
ML0
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XH6
XSB
AAYXX
AFFHD
CITATION
ALIPV
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7QO
7SC
7XB
8AL
8FD
8FK
FR3
JQ2
K9.
L7M
L~C
L~D
M0N
P64
PKEHL
PQEST
PQUKI
PRINS
Q9U
7X8
5PM
ID FETCH-LOGICAL-c571t-1c60c30ce4e22e5cfebdf1b0748d39a8ee7d9c4fe0aa9cddc0692190b74477a13
IEDL.DBID RSV
ISICitedReferencesCount 10
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000810679500002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1471-2105
IngestDate Fri Oct 03 12:53:49 EDT 2025
Tue Nov 04 01:56:32 EST 2025
Wed Oct 01 13:30:51 EDT 2025
Tue Oct 07 05:12:28 EDT 2025
Tue Nov 11 10:27:05 EST 2025
Tue Nov 04 17:09:44 EST 2025
Thu Nov 13 15:09:00 EST 2025
Thu Apr 03 07:04:28 EDT 2025
Tue Nov 18 21:55:10 EST 2025
Sat Nov 29 05:40:12 EST 2025
Sat Sep 06 07:27:21 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Next-generation sequencing
Machine learning
Error correction
Language English
License 2022. The Author(s).
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c571t-1c60c30ce4e22e5cfebdf1b0748d39a8ee7d9c4fe0aa9cddc0692190b74477a13
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0003-4516-6357
0000-0003-2597-8331
OpenAccessLink https://link.springer.com/10.1186/s12859-022-04754-3
PMID 35698033
PQID 2678148178
PQPubID 44065
PageCount 17
ParticipantIDs doaj_primary_oai_doaj_org_article_4ce75ad398e246598147f1cec991fd1e
pubmedcentral_primary_oai_pubmedcentral_nih_gov_9195321
proquest_miscellaneous_2676553009
proquest_journals_2678148178
gale_infotracmisc_A707073779
gale_infotracacademiconefile_A707073779
gale_incontextgauss_ISR_A707073779
pubmed_primary_35698033
crossref_citationtrail_10_1186_s12859_022_04754_3
crossref_primary_10_1186_s12859_022_04754_3
springer_journals_10_1186_s12859_022_04754_3
PublicationCentury 2000
PublicationDate 2022-06-13
PublicationDateYYYYMMDD 2022-06-13
PublicationDate_xml – month: 06
  year: 2022
  text: 2022-06-13
  day: 13
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC bioinformatics
PublicationTitleAbbrev BMC Bioinformatics
PublicationTitleAlternate BMC Bioinformatics
PublicationYear 2022
Publisher BioMed Central
BioMed Central Ltd
Springer Nature B.V
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: Springer Nature B.V
– name: BMC
References L Song (4754_CR6) 2014
D Gusfield (4754_CR21) 1997; 28
Y Heo (4754_CR9) 2016; 32
W-C Kao (4754_CR12) 2011; 21
G Marcais (4754_CR26) 2011; 27
A Gurevich (4754_CR25) 2013; 29
L Ilie (4754_CR5) 2013
L Breiman (4754_CR22) 2001; 45
M Długosz (4754_CR10) 2017; 33
A Allam (4754_CR14) 2015; 31
A Limasset (4754_CR15) 2019; 36
I Fischer-Hwang (4754_CR2) 2019; 9
L Salmela (4754_CR11) 2011; 27
A Sharma (4754_CR19) 2022; 23
M Heydari (4754_CR1) 2017; 18
M Abdallah (4754_CR18) 2019
F Kallenborn (4754_CR17) 2020; 37
JT Simpson (4754_CR3) 2012; 22
F Pedregosa (4754_CR27) 2011; 12
A Bankevich (4754_CR24) 2012; 19
Y Liu (4754_CR4) 2013
P Greenfield (4754_CR7) 2014; 30
M Heydari (4754_CR16) 2019; 20
H Li (4754_CR8) 2015
MH Schulz (4754_CR13) 2014; 30
H Xin (4754_CR20) 2015; 31
W Huang (4754_CR23) 2012; 28
References_xml – volume: 32
  start-page: 2369
  issue: 15
  year: 2016
  ident: 4754_CR9
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btw146
– year: 2015
  ident: 4754_CR8
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btv290
– volume: 9
  start-page: 1
  issue: 1
  year: 2019
  ident: 4754_CR2
  publication-title: Sci Rep
  doi: 10.1038/s41598-019-51418-z
– volume: 12
  start-page: 2825
  year: 2011
  ident: 4754_CR27
  publication-title: J Mach Learn Res
– volume: 31
  start-page: 3421
  issue: 21
  year: 2015
  ident: 4754_CR14
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btv415
– volume: 30
  start-page: i356
  issue: 17
  year: 2014
  ident: 4754_CR13
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btu440
– volume: 27
  start-page: 1455
  issue: 11
  year: 2011
  ident: 4754_CR11
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btr170
– volume: 37
  start-page: 889
  issue: 7
  year: 2020
  ident: 4754_CR17
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btaa738
– volume: 18
  start-page: 1
  issue: 1
  year: 2017
  ident: 4754_CR1
  publication-title: BMC Bioinform
  doi: 10.1186/s12859-017-1784-8
– volume: 28
  start-page: 593
  issue: 4
  year: 2012
  ident: 4754_CR23
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btr708
– volume: 23
  start-page: 25
  issue: 1
  year: 2022
  ident: 4754_CR19
  publication-title: BMC Bioinform
  doi: 10.1186/s12859-021-04547-0
– volume: 30
  start-page: 2723
  issue: 19
  year: 2014
  ident: 4754_CR7
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btu368
– volume: 29
  start-page: 1072
  issue: 8
  year: 2013
  ident: 4754_CR25
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btt086
– volume: 28
  start-page: 41
  issue: 4
  year: 1997
  ident: 4754_CR21
  publication-title: Acm Sigact News
  doi: 10.1145/270563.571472
– year: 2019
  ident: 4754_CR18
  publication-title: Sci Rep
  doi: 10.1038/s41598-019-52196-4
– volume: 21
  start-page: 1181
  issue: 7
  year: 2011
  ident: 4754_CR12
  publication-title: Genome Res
  doi: 10.1101/gr.111351.110
– year: 2013
  ident: 4754_CR5
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btt407
– volume: 31
  start-page: 1553
  issue: 10
  year: 2015
  ident: 4754_CR20
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btu856
– volume: 33
  start-page: 1086
  issue: 7
  year: 2017
  ident: 4754_CR10
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btw746
– volume: 22
  start-page: 549
  issue: 3
  year: 2012
  ident: 4754_CR3
  publication-title: Genome Res
  doi: 10.1101/gr.126953.111
– volume: 20
  start-page: 1
  issue: 1
  year: 2019
  ident: 4754_CR16
  publication-title: BMC Bioinform
  doi: 10.1186/s12859-019-2906-2
– volume: 36
  start-page: 1374
  year: 2019
  ident: 4754_CR15
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btz102
– volume: 27
  start-page: 764
  issue: 6
  year: 2011
  ident: 4754_CR26
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btr011
– volume: 19
  start-page: 455
  issue: 5
  year: 2012
  ident: 4754_CR24
  publication-title: J Comput Biol
  doi: 10.1089/cmb.2012.0021
– year: 2013
  ident: 4754_CR4
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts690
– year: 2014
  ident: 4754_CR6
  publication-title: Genome Biol
  doi: 10.1186/s13059-014-0509-9
– volume: 45
  start-page: 63
  year: 2001
  ident: 4754_CR22
  publication-title: Mach Learn
  doi: 10.1023/A:1010933404324
SSID ssj0017805
Score 2.4400976
Snippet Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error...
Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction...
Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error...
Abstract Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 227
SubjectTerms Algorithms
Analysis
Assembly
Bioinformatics
Biomedical and Life Sciences
Candidates
Computational Biology/Bioinformatics
Computer Appl. in Life Sciences
Construction
Context
Datasets
Decision trees
DNA sequencing
Error correction
Error correction & detection
False positive reactions
High-Throughput Nucleotide Sequencing - methods
Humans
Impact analysis
Learning algorithms
Life Sciences
Machine Learning
Methods
Microarrays
Next-generation sequencing
Nucleotide sequence
Nucleotide sequencing
Pipelining (computers)
Prevention
Sequence Alignment
Sequence Analysis, DNA - methods
Software
Statistical analysis
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Nb9QwELVQBRIXxDeBggxC4gChduzYMbelagWXChUq9WYlznhbCbIo2a3Ev8fjOEtTBFy4xmPLeTP-kp_fEPLSNwVooyEcU1uZS48akA0v89KHo1ijCwgGMdmEPjqqTk_Np0upvpATNsoDj8DtSQe6rFthKiikKk3FpfbcgQsbG99ywNmXaTMdptL9ASr1T09kKrU3cNRpy5G5zqQuZS5my1BU6_99Tr60KF0lTF65NY2L0eFtcivtIuli7P0dcg26u-TGmFfyxz1ysr84PqDFW_aO9ijNGtqgPsQZ5CNH6wJoolBjCfT9qqcO03TERw4DRTL8kn6LPEugKbHE8j45OTz4sv8hT_kTcldqvs65U8wJ5kBCUUDpPDSt503YNFQBz7oC0K1x0gOra-Pa1jFlwgQWXCSl1jUXD8hOt-rgEaGF0qwxrVFeFlKAaGquGieF80a3GlhG-ASndUlcHHNcfLXxkFEpO7rABhfY6AIrMvJ6W-f7KK3xV-v36KWtJcpixw8hWGwKFvuvYMnIC_SxReGLDpk1y3ozDPbj52O70Ch8hPKLGXmVjPwq_IOr00OFgARqZc0sd2eWYWS6efEUSjbNDIMNSIZeVSFCM_J8W4w1ke3WwWoTbRSmc2KhiYdj5G3_W5TKVEwEPPQsJmfAzEu687OoG27wyrTgGXkzRe-vbv0Z-Mf_A_gn5GYRRx-KXu6SnXW_gafkurtYnw_9szh2fwKo6kWC
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: Biological Science Database
  dbid: M7P
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Nb9QwELWggNQL35RAQQEhcYDQ-CNxzAUtVSu4VFWhUm9W4oyXSpCUZLcS_x6P46SkiF64rierjGc8tuPn9wh5ZSsGUklw29RaJMIiB2RFsySzbitWSQbOwItNyIOD4uREHYYPbn2AVY410RfqujX4jXyH5UjOVFBZfDj7maBqFJ6uBgmN6-QGsiQwD907nE4RkK9_vChT5Ds9Rba2BPHrqZCZSPhsMvKc_X9X5j-mpsuwyUtnp35K2r_zv87cJbfDYjReDNlzj1yD5j65NchT_npAjncXR3sxe5e-jztkeHUvEVuXrpAMUK9ziAMSG1ug69ouNqj24e9K9DFi6pfxDw_XhDjoUywfkuP9va-7n5Igw5CYTNJVQk2eGp4aEMAYZMZCVVtaubVHUXNVFgCyVkZYSMtSmbo2aa5cHXSRFkLKkvJHZKNpG3hMYud3Wqla5VYwwYFXJc0rI7ixStYS0ojQMR7aBI5ylMr4rv1epcj1EEPtYqh9DDWPyJvpmbOBoeNK648Y5skS2bX9D2231GGwamFAZqVzrgAm8ky5WElLDRi3mLY1hYi8xCTRyJ_RIEBnWa77Xn_-cqQXEvmTkMUxIq-DkW2dD6YM9x1cTyDl1sxye2bpBriZN49JpEOB6fVFBkXkxdSMTyJoroF27W1yVIVK3V9sDak7-c2zXBUpd_0hZ0k965h5S3P6zdOPKzx5ZTQib8f0v3itf3f8k6u9eEo2mR-YyIq5TTZW3RqekZvmfHXad8_9sP4NgoNTag
  priority: 102
  providerName: ProQuest
Title CARE 2.0: reducing false-positive sequencing error corrections using machine learning
URI https://link.springer.com/article/10.1186/s12859-022-04754-3
https://www.ncbi.nlm.nih.gov/pubmed/35698033
https://www.proquest.com/docview/2678148178
https://www.proquest.com/docview/2676553009
https://pubmed.ncbi.nlm.nih.gov/PMC9195321
https://doaj.org/article/4ce75ad398e246598147f1cec991fd1e
Volume 23
WOSCitedRecordID wos000810679500002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVADU
  databaseName: Open Access: BioMedCentral Open Access Titles
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RBZ
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.biomedcentral.com/search/
  providerName: BioMedCentral
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: DOA
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M~E
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: P5Z
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Biological Science Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M7P
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/biologicalscijournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: K7-
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Publicly Available Content Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: PIMPY
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/publiccontent
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RSV
  dateStart: 20001201
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELZoC1IvvB-BsgoIiQME4sSJE27baisqxCraUrT0YiXOeFsJsijZrcS_Z8ZJFlIeElx8iMdRPJ4Z2_Hnbxh7ZooAZCoBt6ml8IQhDsiCR15kcCtWyABQwCabkNNpMp-nWXcprOnR7v2RpI3U1q2T-HXDiWvNI_S5L2QkvHCL7eB0l1DChtnxx83ZAbH099djfttuMAVZpv5f4_FPE9JlsOSlE1M7ER3e-L8u3GTXu4WnO24t5Ra7AtVtdq1NRfntDjs5GM8mbvDKf-PWxOaKr3UNmiZ4LazrAtwOdU01UNfL2tWU2cPei2hcws8v3C8Wmglul4ticZedHE4-HLz1upQLno4kX3lcx74OfQ0CggAibaAoDS9wnZGUYZonALJMtTDg53mqy1L7cYoxD0dVCClzHt5j29WyggfMDWLpF2mZxkYEIoSwyHlcaBFqk8pSgu8w3o-C0h0fOaXF-KzsviSJVasuhepSVl0qdNiLTZuvLRvHX6X3aXA3ksSkbR8s64XqHFMJDTLKsXMJBCKO0oQLabgGjQtnU3Jw2FMyDUVcGRWBcRb5umnU0fFMjSVxJRFjo8Oed0JmiX3QeXe3ATVB9FoDyb2BJDqzHlb3Fqi6YNIo1CR-VYKG7bAnm2pqSQC5CpZrKxNTBigfX3G_NdhNv8MoRi8JUR9yYMoDxQxrqvMzSzWe0ilrwB32sjfoH5_1Z8U__DfxR2w3sD5BjJh7bHtVr-Exu6ovVudNPWJbci5tmYzYzv5kms1G9s8Jlu-kNyK0boZlFp1ifXb0Pvs0sgHhOwpMUkw
linkProvider Springer Nature
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lb9QwELZKAcGF9yNQICAQBwiNHSeOkRBaSquutqxQH1JvJnHGSyXYlGS3qH-K34jHSbakiN564LqeRJnJPNbxzPcR8tzkDIQUYLepBQ-4QQzInMZBbOxWLBcMrIAjmxDjcbq_Lz8vkV_dLAy2VXY50SXqotT4jXyVJQjOlFKRvj_8ESBrFJ6udhQajVuM4Pin3bLV74Yf7ft9wdjG-u7aZtCyCgQ6FnQWUJ2EOgo1cGAMYm0gLwzNbSlNi0hmKYAopOYGwiyTuih0mEgb1vbBORcio5G97wVykUepwLgaiWBxaoH8AN1gTpqs1hTR4QLslw-5iHkQ9Yqf4wj4uxL8UQpPt2meOqt1JXDj-v9mvBvkWvtn2x800XGTLMH0Frnc0G8e3yZ7a4PtdZ-9Cd_6FSLYWqV9Y8MRgqaV7Qj8ttMcV6CqysrXyGbiZkFqH2cGJv53144Kfsu_MblD9s5Fp7tkeVpO4T7xrZ3DXBYyMZzxCKI8o0mueaSNFIWA0CO0e_9KtxjsSAXyTbm9WJqoxmeU9RnlfEZFHnm1uOawQSA5U_oDutVCEtHD3Q9lNVFtMlJcg4gzq1wKjCextL4hDNWg7WbBFBQ88gydUiE-yBQbkCbZvK7VcGdbDQTiQyFKpUdetkKmtDrorJ3nsJZASLGe5EpP0iYw3V_unFa1CbRWJx7rkaeLZbwSmwKnUM6dTIKsV6G9xb0mVBZ6R3Ei0zCy9hC9IOoZpr8yPfjq4NUlniwz6pHXXbidPNa_Df_gbC2ekCubu5-21NZwPHpIrjKXFBABdIUsz6o5PCKX9NHsoK4eu5Tiky_nHYa_AZYHsoI
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwELagQMWFdyFQICAkDhAax44dc1tKV1SgVdXSqjcrccZLJUiqZLcS_x6PkyxNeUiIazxO4vGMH5pvviHkhS0SkEqCu6aWPOIWOSALmkapdVexQibgBHyxCTmbZcfHau9cFr9Huw8hyS6nAVmaqsXWaWk7F8_EVkuRdy1CJHrMZcojdplc4Qikx_v6wdEqjoCM_UOqzG_7jbYjz9r_69p8bnO6CJy8ED31m9L05v8P5xa50R9Iw0lnQbfJJajukGtdicrvd8nh9mR_J0zexG_DBlle3SdC60wWog7udQZhj8bGFmiaugkNVvzw-RJtiLj6efjNQzYh7GtUzO-Rw-nO5-0PUV-KITKppIuIGhEbFhvgkCSQGgtFaWnhzh9ZyVSeAchSGW4hznNlytLEQrm10M0251LmlG2Qtaqu4AEJEyHjQpVKWJ5wBqzIqSgMZ8YqWUqIA0KHGdGm5ynHchlftb-vZEJ36tJOXdqrS7OAvFr1Oe1YOv4q_Q4neiWJDNv-Qd3Mde-wmhuQae4Gl0HCRaoyyqWlBow7UNuSQkCeo5lo5NCoEKQzz5dtq3cP9vVEIocSMjkG5GUvZGs3BpP3OQ9OE0i7NZLcHEk6Jzfj5sEadb_ItNpp0v1V5ow8IM9WzdgTgXMV1EsvI7AyVOxecb8z3tW4WSpUFjOnDzky65Fixi3VyRdPQa4w-prQgLwejPvnb_1Z8Q__TfwpWd97P9WfdmcfH5HriXcPJM3cJGuLZgmPyVVztjhpmyfe538AVaNVPw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=CARE+2.0%3A+reducing+false-positive+sequencing+error+corrections+using+machine+learning&rft.jtitle=BMC+bioinformatics&rft.au=Kallenborn%2C+Felix&rft.au=Cascitti%2C+Julian&rft.au=Schmidt%2C+Bertil&rft.date=2022-06-13&rft.pub=BioMed+Central+Ltd&rft.issn=1471-2105&rft.eissn=1471-2105&rft.volume=23&rft.issue=1&rft_id=info:doi/10.1186%2Fs12859-022-04754-3&rft.externalDBID=ISR&rft.externalDocID=A707073779
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon