Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies

Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to thos...

Full description

Saved in:
Bibliographic Details
Published in:Genome Biology Vol. 21; no. 1; p. 245
Main Authors: Rhie, Arang, Walenz, Brian P., Koren, Sergey, Phillippy, Adam M.
Format: Journal Article
Language:English
Published: London BioMed Central 14.09.2020
Springer Nature B.V
BMC
Subjects:
ISSN:1474-760X, 1474-7596, 1474-760X
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recent long-read assemblies often exceed the quality and completeness of available reference genomes, making validation challenging. Here we present Merqury, a novel tool for reference-free assembly evaluation based on efficient k-mer set operations. By comparing k-mers in a de novo assembly to those found in unassembled high-accuracy reads, Merqury estimates base-level accuracy and completeness. For trios, Merqury can also evaluate haplotype-specific accuracy, completeness, phase block continuity, and switch errors. Multiple visualizations, such as k-mer spectrum plots, can be generated for evaluation. We demonstrate on both human and plant genomes that Merqury is a fast and robust method for assembly validation.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ObjectType-Undefined-3
ISSN:1474-760X
1474-7596
1474-760X
DOI:10.1186/s13059-020-02134-9