RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets

Abstract For the past 20 years, the recombination detection program (RDP) project has focused on the development of a fast, flexible, and easy to use Windows-based recombination analysis tool. Whereas previous versions of this tool have relied on considerable user-mediated verification of detected r...

Full description

Saved in:
Bibliographic Details
Published in:Virus evolution Vol. 7; no. 1; p. veaa087
Main Authors: Martin, Darren P, Varsani, Arvind, Roumagnac, Philippe, Botha, Gerrit, Maslamoney, Suresh, Schwab, Tiana, Kelz, Zena, Kumar, Venkatesh, Murrell, Ben
Format: Journal Article
Language:English
Published: England Oxford University Press 01.01.2021
Subjects:
ISSN:2057-1577, 2057-1577
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract For the past 20 years, the recombination detection program (RDP) project has focused on the development of a fast, flexible, and easy to use Windows-based recombination analysis tool. Whereas previous versions of this tool have relied on considerable user-mediated verification of detected recombination events, the latest iteration, RDP5, is automated enough that it can be integrated within analysis pipelines and run without any user input. The main innovation enabling this degree of automation is the implementation of statistical tests to identify recombination signals that could be attributable to evolutionary processes other than recombination. The additional analysis time required for these tests has been offset by algorithmic improvements throughout the program such that, relative to RDP4, RDP5 will still run up to five times faster and be capable of analyzing alignments containing twice as many sequences (up to 5000) that are five times longer (up to 50 million sites). For users wanting to remove signals of recombination from their datasets before using them for downstream phylogenetics-based molecular evolution analyses, RDP5 can disassemble detected recombinant sequences into their constituent parts and output a variety of different recombination-free datasets in an array of different alignment formats. For users that are interested in exploring the recombination history of their datasets, all the manual verification, data management and data visualization components of RDP5 have been extensively updated to minimize the amount of time needed by users to individually verify and refine the program’s interpretation of each of the individual recombination events that it detects.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
https://orcid.org/0000-0003-4111-2415
http://orcid.org/0000-0002-8785-0870
ISSN:2057-1577
2057-1577
DOI:10.1093/ve/veaa087