Skip to main navigation Skip to search Skip to main content

SKiM: Accurately classifying metagenomic ONT reads in limited memory

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation Oxford Nanopore Technologies' devices, such as MinION, permit affordable, real-time DNA sequencing, and come with targeted sequencing capabilities. Such capabilities create new challenges for metagenomic classifiers that must be computationally efficient yet robust enough to handle potentially erroneous DNA reads, while ideally inspecting only a few hundred bases of a read. Currently available DNA classifiers leave room for improvement with respect to classification accuracy, memory usage, and the ability to operate in targeted sequencing scenarios. Results We present SKiM: Short K-mers in Metagenomics, a new lightweight metagenomic classifier designed for ONT reads. Compared to state-of-the-art classifiers, SKiM requires only a fraction of memory to run, and can classify DNA reads with higher accuracy after inspecting only their first few hundred bases. To achieve this, SKiM introduces new data compression techniques to maintain a reference database built from short k-mers, and treats classification as a statistical testing problem. Availability and implementation SKiM source code, documentation, and test data are available from: https://gitlab.com/SCoRe-Group/skim.

Original languageEnglish
Article numberbtaf537
JournalBioinformatics
Volume41
Issue number10
DOIs
StatePublished - Oct 1 2025

Fingerprint

Dive into the research topics of 'SKiM: Accurately classifying metagenomic ONT reads in limited memory'. Together they form a unique fingerprint.

Cite this