High-Performance Sorting-Based k-mer Counting in Distributed Memory with Flexible Hybrid Parallelism

In generating large quantities of DNA data, high-throughput sequencing technologies require advanced bioinformatics infrastructures for efficient data analysis. k-mer counting, the process of quantifying the frequency of fixed-length k DNA subsequences, is a fundamental step in various bioinformatic...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org
Main Authors: Li, Yifan, Guidi, Giulia
Format: Paper
Language:English
Published: Ithaca Cornell University Library, arXiv.org 10.07.2024
Subjects:
ISSN:2331-8422
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first