Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies

Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to thos...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Genome Biology Ročník 21; číslo 1; s. 245
Hlavní autori: Rhie, Arang, Walenz, Brian P., Koren, Sergey, Phillippy, Adam M.
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: London BioMed Central 14.09.2020
Springer Nature B.V
BMC
Predmet:
ISSN:1474-760X, 1474-7596, 1474-760X
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to those found in unassembled high-accuracy reads, Merqury estimates base-level accuracy and completeness. For trios, Merqury can also evaluate haplotype-specific accuracy, completeness, phase block continuity, and switch errors. Multiple visualizations, such as k-mer spectrum plots, can be generated for evaluation. We demonstrate on both human and plant genomes that Merqury is a fast and robust method for assembly validation.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ObjectType-Undefined-3
ISSN:1474-760X
1474-7596
1474-760X
DOI:10.1186/s13059-020-02134-9