CARE 2.0: reducing false-positive sequencing error corrections using machine learning
Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making...
Uloženo v:
| Vydáno v: | BMC bioinformatics Ročník 23; číslo 1; s. 227 - 17 |
|---|---|
| Hlavní autoři: | , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
London
BioMed Central
13.06.2022
BioMed Central Ltd Springer Nature B.V BMC |
| Témata: | |
| ISSN: | 1471-2105, 1471-2105 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Background
Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as
k
-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools.
Results
We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved
k
-mer analysis show the applicability of CARE 2.0 to real-world data.
Conclusion
False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve
k
-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at
https://github.com/fkallen/CARE
. |
|---|---|
| AbstractList | Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE. Abstract Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE . Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at Keywords: Next-generation sequencing, Error correction, Machine learning Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k -mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k -mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k -mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE . Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE . Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools.BACKGROUNDNext-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools.We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data.RESULTSWe present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0's hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data.False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE .CONCLUSIONFalse-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE . Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction programs are able to reliably detect and correct the majority of sequencing errors. However, they also introduce new errors by making false-positive corrections. These correction mistakes can have negative impact on downstream analysis, such as k-mer statistics, de-novo assembly, and variant calling. This motivates the need for more precise error correction tools. Results We present CARE 2.0, a context-aware read error correction tool based on multiple sequence alignment targeting Illumina datasets. In addition to a number of newly introduced optimizations its most significant change is the replacement of CARE 1.0’s hand-crafted correction conditions with a novel classifier based on random decision forests trained on Illumina data. This results in up to two orders-of-magnitude fewer false-positive corrections compared to other state-of-the-art error correction software. At the same time, CARE 2.0 is able to achieve high numbers of true-positive corrections comparable to its competitors. On a simulated full human dataset with 914M reads CARE 2.0 generates only 1.2M false positives (FPs) (and 801.4M true positives (TPs)) at a highly competitive runtime while the best corrections achieved by other state-of-the-art tools contain at least 3.9M FPs and at most 814.5M TPs. Better de-novo assembly and improved k-mer analysis show the applicability of CARE 2.0 to real-world data. Conclusion False-positive corrections can negatively influence down-stream analysis. The precision of CARE 2.0 greatly reduces the number of those corrections compared to other state-of-the-art programs including BFC, Karect, Musket, Bcool, SGA, and Lighter. Thus, higher-quality datasets are produced which improve k-mer analysis and de-novo assembly in real-world datasets which demonstrates the applicability of machine learning techniques in the context of sequencing read error correction. CARE 2.0 is written in C++/CUDA for Linux systems and can be run on the CPU as well as on CUDA-enabled GPUs. It is available at https://github.com/fkallen/CARE. |
| ArticleNumber | 227 |
| Audience | Academic |
| Author | Kallenborn, Felix Schmidt, Bertil Cascitti, Julian |
| Author_xml | – sequence: 1 givenname: Felix orcidid: 0000-0003-4516-6357 surname: Kallenborn fullname: Kallenborn, Felix email: kallenborn@uni-mainz.de organization: Department of Computer Science, Johannes Gutenberg University Mainz – sequence: 2 givenname: Julian surname: Cascitti fullname: Cascitti, Julian organization: Department of Computer Science, Johannes Gutenberg University Mainz – sequence: 3 givenname: Bertil orcidid: 0000-0003-2597-8331 surname: Schmidt fullname: Schmidt, Bertil organization: Department of Computer Science, Johannes Gutenberg University Mainz |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/35698033$$D View this record in MEDLINE/PubMed |
| BookMark | eNp9kktv1DAUhSNURB_wB1igSGxgkcGPOI5ZVBqNCoxUCanQteW5uUk9ytiDnVTw73FmWtqpUOVFoutzPjsn5zQ7ct5hlr2lZEZpXX2KlNVCFYSxgpRSlAV_kZ3QUtKCUSKOHr0fZ6cxrgmhsibiVXbMRaVqwvlJdr2YX13kbEY-5wGbEazr8tb0EYutj3awt5hH_DWi2-1gCD7k4ENAGKx3MR_jNN8YuLEO8x5NcGnwOnu5g7y5e55l118ufi6-FZffvy4X88sChKRDQaEiwAlgiYyhgBZXTUtXRJZ1w5WpEWWjoGyRGKOgaYBUilFFVrIspTSUn2XLPbfxZq23wW5M-KO9sXo38KHTJgwWetQloBQmYWtkZSVUncJpKSAoRduGYmKd71nbcbXBBtANwfQH0MMdZ29052-1okpwNl3mwx0g-JRYHPTGRsC-Nw79GDWrZCUEJ0Ql6fsn0rUfg0tRTap0tTr9qQdVZ9IHWNf6dC5MUD2XJC0u5cSa_UeVVoMbC6kxrU3zA8PHA0PSDPh76MwYo17-uDrUvnscyr807guUBPVeAMHHGLDVYAczdSPdwvaaEj11Ve-7qlNX9a6rerKyJ9Z7-rMmvjfFJHYdhofknnH9BUwC-YU |
| CitedBy_id | crossref_primary_10_1038_s41598_024_52386_9 crossref_primary_10_3390_ijms252413250 crossref_primary_10_1002_qub2_99 crossref_primary_10_1016_j_ymeth_2023_06_007 crossref_primary_10_1186_s12859_024_05681_1 crossref_primary_10_1016_j_csbj_2024_05_025 crossref_primary_10_1016_j_jaci_2025_06_015 crossref_primary_10_1016_j_drudis_2024_103990 crossref_primary_10_1093_bfgp_elad026 crossref_primary_10_1186_s12859_024_05802_w |
| Cites_doi | 10.1093/bioinformatics/btw146 10.1093/bioinformatics/btv290 10.1038/s41598-019-51418-z 10.1093/bioinformatics/btv415 10.1093/bioinformatics/btu440 10.1093/bioinformatics/btr170 10.1093/bioinformatics/btaa738 10.1186/s12859-017-1784-8 10.1093/bioinformatics/btr708 10.1186/s12859-021-04547-0 10.1093/bioinformatics/btu368 10.1093/bioinformatics/btt086 10.1145/270563.571472 10.1038/s41598-019-52196-4 10.1101/gr.111351.110 10.1093/bioinformatics/btt407 10.1093/bioinformatics/btu856 10.1093/bioinformatics/btw746 10.1101/gr.126953.111 10.1186/s12859-019-2906-2 10.1093/bioinformatics/btz102 10.1093/bioinformatics/btr011 10.1089/cmb.2012.0021 10.1093/bioinformatics/bts690 10.1186/s13059-014-0509-9 10.1023/A:1010933404324 |
| ContentType | Journal Article |
| Copyright | The Author(s) 2022 2022. The Author(s). COPYRIGHT 2022 BioMed Central Ltd. 2022. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
| Copyright_xml | – notice: The Author(s) 2022 – notice: 2022. The Author(s). – notice: COPYRIGHT 2022 BioMed Central Ltd. – notice: 2022. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
| DBID | C6C AAYXX CITATION CGR CUY CVF ECM EIF NPM ISR 3V. 7QO 7SC 7X7 7XB 88E 8AL 8AO 8FD 8FE 8FG 8FH 8FI 8FJ 8FK ABUWG AEUYN AFKRA ARAPS AZQEC BBNVY BENPR BGLVJ BHPHI CCPQU DWQXO FR3 FYUFA GHDGH GNUQQ HCIFZ JQ2 K7- K9. L7M LK8 L~C L~D M0N M0S M1P M7P P5Z P62 P64 PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI PRINS Q9U 7X8 5PM DOA |
| DOI | 10.1186/s12859-022-04754-3 |
| DatabaseName | Springer Nature OA Free Journals CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Gale In Context: Science ProQuest Central (Corporate) Biotechnology Research Abstracts Computer and Information Systems Abstracts Health & Medical Collection (ProQuest) ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) Computing Database (Alumni Edition) ProQuest Pharma Collection Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Natural Science Collection ProQuest Hospital Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest One Sustainability ProQuest Central UK/Ireland Advanced Technologies & Computer Science Collection ProQuest Central Essentials Biological Science Collection ProQuest Central Technology Collection Natural Science Collection ProQuest One ProQuest Central Korea Engineering Research Database Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection Computer Science Database ProQuest Health & Medical Complete (Alumni) Advanced Technologies Database with Aerospace ProQuest Biological Science Collection Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Computing Database Health & Medical Collection (Alumni) Medical Database Biological Science Database Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection Biotechnology and BioEngineering Abstracts ProQuest Central Premium ProQuest One Academic (New) Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic (retired) ProQuest One Academic UKI Edition ProQuest Central China ProQuest Central Basic MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Publicly Available Content Database Computer Science Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts SciTech Premium Collection ProQuest Central China ProQuest One Applied & Life Sciences ProQuest One Sustainability Health Research Premium Collection Natural Science Collection Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Advanced Technologies & Aerospace Collection ProQuest Biological Science Collection ProQuest One Academic Eastern Edition ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) Biological Science Database ProQuest Hospital Collection (Alumni) Biotechnology and BioEngineering Abstracts ProQuest Health & Medical Complete ProQuest One Academic UKI Edition Engineering Research Database ProQuest One Academic ProQuest One Academic (New) Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest One Academic Middle East (New) ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Natural Science Collection ProQuest Pharma Collection ProQuest Central ProQuest Health & Medical Research Collection Biotechnology Research Abstracts Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Advanced Technologies Database with Aerospace ProQuest Computing ProQuest Central Basic ProQuest Computing (Alumni Edition) ProQuest SciTech Collection Computer and Information Systems Abstracts Professional Advanced Technologies & Aerospace Database ProQuest Medical Library ProQuest Central (Alumni) MEDLINE - Academic |
| DatabaseTitleList | MEDLINE MEDLINE - Academic Publicly Available Content Database |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: PIMPY name: Publicly Available Content Database url: http://search.proquest.com/publiccontent sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Biology |
| EISSN | 1471-2105 |
| EndPage | 17 |
| ExternalDocumentID | oai_doaj_org_article_4ce75ad398e246598147f1cec991fd1e PMC9195321 A707073779 35698033 10_1186_s12859_022_04754_3 |
| Genre | Journal Article |
| GeographicLocations | Germany |
| GeographicLocations_xml | – name: Germany |
| GrantInformation_xml | – fundername: Johannes Gutenberg-Universität Mainz (1030) – fundername: DeCoDeML Project by Rhein-Main-University Network – fundername: ; |
| GroupedDBID | --- 0R~ 23N 2WC 53G 5VS 6J9 7X7 88E 8AO 8FE 8FG 8FH 8FI 8FJ AAFWJ AAJSJ AAKPC AASML ABDBF ABUWG ACGFO ACGFS ACIHN ACIWK ACPRK ACUHS ADBBV ADMLS ADUKV AEAQA AENEX AEUYN AFKRA AFPKN AFRAH AHBYD AHMBA AHYZX ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS ARAPS AZQEC BAPOH BAWUL BBNVY BCNDV BENPR BFQNJ BGLVJ BHPHI BMC BPHCQ BVXVI C6C CCPQU CS3 DIK DU5 DWQXO E3Z EAD EAP EAS EBD EBLON EBS EMB EMK EMOBN ESX F5P FYUFA GNUQQ GROUPED_DOAJ GX1 HCIFZ HMCUK HYE IAO ICD IHR INH INR ISR ITC K6V K7- KQ8 LK8 M1P M48 M7P MK~ ML0 M~E O5R O5S OK1 OVT P2P P62 PGMZT PHGZM PHGZT PIMPY PJZUB PPXIY PQGLB PQQKQ PROAC PSQYO PUEGO RBZ RNS ROL RPM RSV SBL SOJ SV3 TR2 TUS UKHRP W2D WOQ WOW XH6 XSB AAYXX AFFHD CITATION ALIPV CGR CUY CVF ECM EIF NPM 3V. 7QO 7SC 7XB 8AL 8FD 8FK FR3 JQ2 K9. L7M L~C L~D M0N P64 PKEHL PQEST PQUKI PRINS Q9U 7X8 5PM |
| ID | FETCH-LOGICAL-c571t-1c60c30ce4e22e5cfebdf1b0748d39a8ee7d9c4fe0aa9cddc0692190b74477a13 |
| IEDL.DBID | DOA |
| ISICitedReferencesCount | 10 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000810679500002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1471-2105 |
| IngestDate | Fri Oct 03 12:53:49 EDT 2025 Tue Nov 04 01:56:32 EST 2025 Wed Oct 01 13:30:51 EDT 2025 Tue Oct 07 05:12:28 EDT 2025 Tue Nov 11 10:27:05 EST 2025 Tue Nov 04 17:09:44 EST 2025 Thu Nov 13 15:09:00 EST 2025 Thu Apr 03 07:04:28 EDT 2025 Tue Nov 18 21:55:10 EST 2025 Sat Nov 29 05:40:12 EST 2025 Sat Sep 06 07:27:21 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 1 |
| Keywords | Next-generation sequencing Machine learning Error correction |
| Language | English |
| License | 2022. The Author(s). Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c571t-1c60c30ce4e22e5cfebdf1b0748d39a8ee7d9c4fe0aa9cddc0692190b74477a13 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
| ORCID | 0000-0003-4516-6357 0000-0003-2597-8331 |
| OpenAccessLink | https://doaj.org/article/4ce75ad398e246598147f1cec991fd1e |
| PMID | 35698033 |
| PQID | 2678148178 |
| PQPubID | 44065 |
| PageCount | 17 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_4ce75ad398e246598147f1cec991fd1e pubmedcentral_primary_oai_pubmedcentral_nih_gov_9195321 proquest_miscellaneous_2676553009 proquest_journals_2678148178 gale_infotracmisc_A707073779 gale_infotracacademiconefile_A707073779 gale_incontextgauss_ISR_A707073779 pubmed_primary_35698033 crossref_citationtrail_10_1186_s12859_022_04754_3 crossref_primary_10_1186_s12859_022_04754_3 springer_journals_10_1186_s12859_022_04754_3 |
| PublicationCentury | 2000 |
| PublicationDate | 2022-06-13 |
| PublicationDateYYYYMMDD | 2022-06-13 |
| PublicationDate_xml | – month: 06 year: 2022 text: 2022-06-13 day: 13 |
| PublicationDecade | 2020 |
| PublicationPlace | London |
| PublicationPlace_xml | – name: London – name: England |
| PublicationTitle | BMC bioinformatics |
| PublicationTitleAbbrev | BMC Bioinformatics |
| PublicationTitleAlternate | BMC Bioinformatics |
| PublicationYear | 2022 |
| Publisher | BioMed Central BioMed Central Ltd Springer Nature B.V BMC |
| Publisher_xml | – name: BioMed Central – name: BioMed Central Ltd – name: Springer Nature B.V – name: BMC |
| References | L Song (4754_CR6) 2014 D Gusfield (4754_CR21) 1997; 28 Y Heo (4754_CR9) 2016; 32 W-C Kao (4754_CR12) 2011; 21 G Marcais (4754_CR26) 2011; 27 A Gurevich (4754_CR25) 2013; 29 L Ilie (4754_CR5) 2013 L Breiman (4754_CR22) 2001; 45 M Długosz (4754_CR10) 2017; 33 A Allam (4754_CR14) 2015; 31 A Limasset (4754_CR15) 2019; 36 I Fischer-Hwang (4754_CR2) 2019; 9 L Salmela (4754_CR11) 2011; 27 A Sharma (4754_CR19) 2022; 23 M Heydari (4754_CR1) 2017; 18 M Abdallah (4754_CR18) 2019 F Kallenborn (4754_CR17) 2020; 37 JT Simpson (4754_CR3) 2012; 22 F Pedregosa (4754_CR27) 2011; 12 A Bankevich (4754_CR24) 2012; 19 Y Liu (4754_CR4) 2013 P Greenfield (4754_CR7) 2014; 30 M Heydari (4754_CR16) 2019; 20 H Li (4754_CR8) 2015 MH Schulz (4754_CR13) 2014; 30 H Xin (4754_CR20) 2015; 31 W Huang (4754_CR23) 2012; 28 |
| References_xml | – volume: 32 start-page: 2369 issue: 15 year: 2016 ident: 4754_CR9 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btw146 – year: 2015 ident: 4754_CR8 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btv290 – volume: 9 start-page: 1 issue: 1 year: 2019 ident: 4754_CR2 publication-title: Sci Rep doi: 10.1038/s41598-019-51418-z – volume: 12 start-page: 2825 year: 2011 ident: 4754_CR27 publication-title: J Mach Learn Res – volume: 31 start-page: 3421 issue: 21 year: 2015 ident: 4754_CR14 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btv415 – volume: 30 start-page: i356 issue: 17 year: 2014 ident: 4754_CR13 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btu440 – volume: 27 start-page: 1455 issue: 11 year: 2011 ident: 4754_CR11 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btr170 – volume: 37 start-page: 889 issue: 7 year: 2020 ident: 4754_CR17 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btaa738 – volume: 18 start-page: 1 issue: 1 year: 2017 ident: 4754_CR1 publication-title: BMC Bioinform doi: 10.1186/s12859-017-1784-8 – volume: 28 start-page: 593 issue: 4 year: 2012 ident: 4754_CR23 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btr708 – volume: 23 start-page: 25 issue: 1 year: 2022 ident: 4754_CR19 publication-title: BMC Bioinform doi: 10.1186/s12859-021-04547-0 – volume: 30 start-page: 2723 issue: 19 year: 2014 ident: 4754_CR7 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btu368 – volume: 29 start-page: 1072 issue: 8 year: 2013 ident: 4754_CR25 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btt086 – volume: 28 start-page: 41 issue: 4 year: 1997 ident: 4754_CR21 publication-title: Acm Sigact News doi: 10.1145/270563.571472 – year: 2019 ident: 4754_CR18 publication-title: Sci Rep doi: 10.1038/s41598-019-52196-4 – volume: 21 start-page: 1181 issue: 7 year: 2011 ident: 4754_CR12 publication-title: Genome Res doi: 10.1101/gr.111351.110 – year: 2013 ident: 4754_CR5 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btt407 – volume: 31 start-page: 1553 issue: 10 year: 2015 ident: 4754_CR20 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btu856 – volume: 33 start-page: 1086 issue: 7 year: 2017 ident: 4754_CR10 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btw746 – volume: 22 start-page: 549 issue: 3 year: 2012 ident: 4754_CR3 publication-title: Genome Res doi: 10.1101/gr.126953.111 – volume: 20 start-page: 1 issue: 1 year: 2019 ident: 4754_CR16 publication-title: BMC Bioinform doi: 10.1186/s12859-019-2906-2 – volume: 36 start-page: 1374 year: 2019 ident: 4754_CR15 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btz102 – volume: 27 start-page: 764 issue: 6 year: 2011 ident: 4754_CR26 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btr011 – volume: 19 start-page: 455 issue: 5 year: 2012 ident: 4754_CR24 publication-title: J Comput Biol doi: 10.1089/cmb.2012.0021 – year: 2013 ident: 4754_CR4 publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts690 – year: 2014 ident: 4754_CR6 publication-title: Genome Biol doi: 10.1186/s13059-014-0509-9 – volume: 45 start-page: 63 year: 2001 ident: 4754_CR22 publication-title: Mach Learn doi: 10.1023/A:1010933404324 |
| SSID | ssj0017805 |
| Score | 2.4401786 |
| Snippet | Background
Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error... Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error correction... Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art error... Abstract Background Next-generation sequencing pipelines often perform error correction as a preprocessing step to obtain cleaned input data. State-of-the-art... |
| SourceID | doaj pubmedcentral proquest gale pubmed crossref springer |
| SourceType | Open Website Open Access Repository Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 227 |
| SubjectTerms | Algorithms Analysis Assembly Bioinformatics Biomedical and Life Sciences Candidates Computational Biology/Bioinformatics Computer Appl. in Life Sciences Construction Context Datasets Decision trees DNA sequencing Error correction Error correction & detection False positive reactions High-Throughput Nucleotide Sequencing - methods Humans Impact analysis Learning algorithms Life Sciences Machine Learning Methods Microarrays Next-generation sequencing Nucleotide sequence Nucleotide sequencing Pipelining (computers) Prevention Sequence Alignment Sequence Analysis, DNA - methods Software Statistical analysis |
| SummonAdditionalLinks | – databaseName: Biological Science Database dbid: M7P link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwELaggMSF9yNQUEBIHMDUTpzY4YKWqhVIqKoKRb1ZzthZKkFSkt1K_Hs8jndLiuiFazyJMp7xeBJ_8w0hL0QNzkgoKXAQVKimoDUzNeWmUQaLIV2or_j6Se7tqaOjaj_-cBsirHIVE0Ogth3gP_KtrERyJsWlenfyk2LXKDxdjS00LpMryJKQBeje_voUAfn6V4UyqtwaOLK1UcSvMyELQfPJZhQ4-_-OzH9sTedhk-fOTsOWtHvzf5W5RW7EZDSdjd5zm1xy7R1ybWxP-esuOdyeHeyk2Rv2Nu2R4dW_RNp4d3V0hHqdujQisXHE9X3Xp4DdPkKtxJAipn6e_ghwTZfG_hTze-Rwd-fL9gca2zBQKCRfUA4lg5yBEy7LXAGNq23Da597KJtXRjknbQWiccyYCqwFVlY-DrJaCiGl4fl9stF2rXtIUlOCNUXObM6UsEVmfDrU1Bas8GlQWcuE8JU9NESOcmyV8V2HbxVV6tGG2ttQBxvqPCGv1vecjAwdF0q_RzOvJZFdO1zo-rmOi1ULcLIwXjnlMlEWlbeVbDg48Ml0Y7lLyHN0Eo38GS0CdOZmOQz64-cDPZPIn4Qsjgl5GYWazusAJtY7-JlAyq2J5OZE0i9wmA6vnEjHADPoMw9KyLP1MN6JoLnWdcsgU2JXKOYf8WB03bXeeVFWiuV-PuTEqScTMx1pj78F-vEKT14znpDXK_c_e61_T_yji7V4TK5nYWEiK-Ym2Vj0S_eEXIXTxfHQPw3L-jedplP4 priority: 102 providerName: ProQuest – databaseName: SpringerLINK Contemporary 1997-Present dbid: RSV link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bi9UwEA66Kvji_VJdpYrgg3ZN2jRJfTsuuyjIImfdZd9COkmPC9pKe86C_95MetGuF9DXZlKayUwyZb75hpBnvARnJIgEGPCEqypPSmrKhJlKGSyGdKG-4vi9PDhQJyfFh6EorBvR7mNKMpzUwa2VeNUx5FpLEH1Oucx5kl0kl_x1p7Bhw_LweModIEv_WB7z23mzKygw9f96Hv90IZ0HS57LmIaLaP_6_y3hBrk2BJ7xoreUm-SCq2-RK30rym-3ydHuYrkXpzv0ddwim6t_bVx503RJD-s6c_GAusYR17ZNGwN29gh1EV2M-PlV_CVAM1089KJY3SFH-3sfd98mQ8uFBHLJ1gkDQSGj4LhLU5dD5UpbsdLHGcpmhVHOSVsArxw1pgBrgYrCn3m0lJxLaVh2l2zVTe3uk9gIsCbPqM2o4jZPjQ99qtKC5T7kEaWMCBt3QcPAR45tMT7r8F-ihO7Vpb26dFCXziLyYprztWfj-Kv0G9zcSRKZtMODpl3pwTE1Bydz4xenXMpFXijGZcXAgQ-cK8tcRJ6iaWjkyqgRjLMym67T7w6XeiGRKwkZGyPyfBCqGr8GMENtg9cE0mvNJLdnkt6ZYT48WqAeDpNOpwJ5yZQ37Ig8mYZxJgLkatdsgozADlDUv-Jeb7DTurNceC_JvD7kzJRnipmP1KefAtV4gVnWlEXk5WjQPz7rz4p_8G_iD8nVNPgEMmJuk611u3GPyGU4W5927ePg3N8BZUxKBQ priority: 102 providerName: Springer Nature |
| Title | CARE 2.0: reducing false-positive sequencing error corrections using machine learning |
| URI | https://link.springer.com/article/10.1186/s12859-022-04754-3 https://www.ncbi.nlm.nih.gov/pubmed/35698033 https://www.proquest.com/docview/2678148178 https://www.proquest.com/docview/2676553009 https://pubmed.ncbi.nlm.nih.gov/PMC9195321 https://doaj.org/article/4ce75ad398e246598147f1cec991fd1e |
| Volume | 23 |
| WOSCitedRecordID | wos000810679500002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVADU databaseName: BioMed Central Open Access Free customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: RBZ dateStart: 20000101 isFulltext: true titleUrlDefault: https://www.biomedcentral.com/search/ providerName: BioMedCentral – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: DOA dateStart: 20000101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources (ISSN International Center) customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: M~E dateStart: 20000101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVPQU databaseName: Biological Science Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: M7P dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/biologicalscijournals providerName: ProQuest – providerCode: PRVPQU databaseName: Computer Science Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: K7- dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/compscijour providerName: ProQuest – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: 7X7 dateStart: 20090101 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest advanced technologies & aerospace journals customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: P5Z dateStart: 20090101 isFulltext: true titleUrlDefault: https://search.proquest.com/hightechjournals providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central Database Suite (ProQuest) customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: BENPR dateStart: 20090101 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: Publicly Available Content Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: PIMPY dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/publiccontent providerName: ProQuest – providerCode: PRVAVX databaseName: SpringerLINK Contemporary 1997-Present customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: RSV dateStart: 20001201 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3fb9MwELZggMQL4jeBUQWExAOY2YljO7x1UycmoIo6No29WI7tlEmQoqSdxH-Pz0nLMgS88GKpvUuU3J3ti_zddwi9YKVxWhiODTUMM1lluCS6xFRXUkMxpAv1FccfxHQqT07y4kKrL8CEdfTAneF2mHEi0zbNpUsYz3JJmaioccYnNpWlDlZfIvL1x1R_fgBM_esSGcl3Wgo8bRiQ64SJjOF0sA0Ftv7f1-QLm9JlwOSlU9OwGe3fRrf6LDIed09_B11x9V10o-sr-eMeOtobzyZx8oa8jRugZvX3iCsfZw53GK1zF_cQapC4plk0sYE2HaHIoY0BDD-PvwWcpYv7xhLz--hof_Jp7x3u-ydgkwm6xNRwYlJiHHNJ4jJTudJWtPRJg_T21NI5YXPDKke0zo21hvDcL2CkFIwJoWn6AG3Vi9o9QrHmxuosJTYlktks0T6PqUprLPP5Cy9FhOjanMr05OLQ4-KrCh8ZkqvOBcq7QAUXqDRCrzbXfO-oNf6qvQte2mgCLXb4wweL6oNF_StYIvQcfKyA-KIGZM1cr9pWHRzO1FgA8RHQL0boZa9ULfw7GN0XKnhLAFfWQHN7oOlnphmK16Gk-pWhVQkHkjHpIzRCzzZiuBLQbrVbrIIOh3ZOxN_iYRd5m_dOM55Lknp7iEFMDgwzlNRnXwJveA5HpgmN0Ot19P56rD8b_vH_MPwTdDMJsw9IL7fR1rJZuafoujlfnrXNCF0VJyKMcoSu7U6mxWwU5rMf3ws8AkBu4cciO_Xy4uBj8dn_mh0e_wTUv08M |
| linkProvider | Directory of Open Access Journals |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lb9QwELZKAcGF9yNQICAQBzC1Eyd2kBBaSquudqlQaVFvxrGdpRJsSrJb1D_Fb8TjJFtSRG89cI0nUWYyM7bjme9D6CnLtVVcp1hTzTATRYJzonJMVSEUNENa31_xecy3tsTeXvZxCf3qemGgrLLLiT5Rm1LDP_LVKAVwJkG5eHvwAwNrFJyudhQajVuM7NFPt2Wr3wzfu-_7LIo21nfWNnHLKoB1wukMU50SHRNtmY0im-jC5qaguZtKhYkzJazlJtOssESpTBujSZq5sCY5Z4xzRWP33HPoPIsFh7gacbw4tQB-gK4xR6SrNQV0OAz18oTxhOG4N_l5joC_Z4I_psKTZZonzmr9FLhx9X8z3jV0pV1sh4MmOq6jJTu9gS429JtHN9Hu2mB7PYxekddhBQi2TumwcOFocVPKdmjDttIcRmxVlVWogc3E94LUIfQMTMLvvhzVhi3_xuQW2j0TnW6j5Wk5tXdRqFJtVBITExPBTBIpt9wrcqMNc8u8NOcBot33l7rFYAcqkG_S78VEKhufkc5npPcZGQfoxeKegwaB5FTpd-BWC0lAD_cXymoi22QkmbY8UU45YSOWJpnzDV5QbbXbLBSG2gA9AaeUgA8yhQKkiZrXtRx-2pYDDvhQgFIZoOetUFE6HbRq-zmcJQBSrCe50pN0CUz3hzunlW0CreWxxwbo8WIY7oSiwKkt514mBdYr4h5xpwmVhd5xkmaCxM4evBdEPcP0R6b7Xz28egYnyxEN0Msu3I5f69-Gv3e6Fo_Qpc2dD2M5Hm6N7qPLkU8KgAC6gpZn1dw-QBf04Wy_rh76lBKiL2cdhr8BSOuzEA |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lb9QwELagPMSF9yNQICAkDiXUThzb4baUrqioVlVLq94sZ-wslSBbJbuV-Pd4nOzSlIeEuMbjJB7P2GP5m28IecVLcEaCSIABT7iq8qSkpkyYqZTBZEgX8iuOduVkoo6Pi71zWfwB7b68kuxyGpClqZ5vntqqc3ElNluGvGsJItEplzlPssvkCkcgPZ7XD45W9wjI2L9Mlfltv8F2FFj7f12bz21OF4GTF25Pw6Y0vvX_w7lNbvYBaTzqLOgOueTqu-RaV6Ly-z1yuDXa347Tt_Rd3CDLq_9EXHmTdUkH9zpzcY_GxhbXNLMmBqz4EfIl2hhx9dP4W4BsurivUTG9Tw7H25-3PiZ9KYYEcsnmCQNBIaPguEtTl0PlSlux0scfymaFUc5JWwCvHDWmAGuBisKvhbSUnEtpWPaArNWz2j0isRFgTZ5Rm1HFbZ4aHxJVpQXLfSgkShkRtpwRDT1POZbL-KrDeUUJ3alLe3XpoC6dRWRj1ee0Y-n4q_R7nOiVJDJshwezZqp7h9UcnMyNH5xyKRd5oRiXFQMHPqCuLHMReYlmopFDo0aQztQs2lbvHOzrkUQOJWRyjMjrXqia-TGA6XMevCaQdmsguT6Q9E4Ow-alNep-kWl1KpCvTHkjj8iLVTP2ROBc7WaLICOwMhT1r3jYGe9q3FkuCkUzrw85MOuBYoYt9cmXQEFe4O1ryiLyZmncP3_rz4p__G_iz8n1vQ9jvbsz-fSE3EiDeyBp5jpZmzcL95RchbP5Sds8Cz7_A4FcVc0 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=CARE+2.0%3A+reducing+false-positive+sequencing+error+corrections+using+machine+learning&rft.jtitle=BMC+bioinformatics&rft.au=Kallenborn%2C+Felix&rft.au=Cascitti%2C+Julian&rft.au=Schmidt%2C+Bertil&rft.date=2022-06-13&rft.issn=1471-2105&rft.eissn=1471-2105&rft.volume=23&rft.issue=1&rft_id=info:doi/10.1186%2Fs12859-022-04754-3&rft.externalDBID=n%2Fa&rft.externalDocID=10_1186_s12859_022_04754_3 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon |