[HTML][HTML] From molecules to genomic variations: Accelerating genome analysis via intelligent algorithms and architectures

M Alser, J Lindegger, C Firtina, N Almadhoun… - Computational and …, 2022 - Elsevier
We now need more than ever to make genome analysis more intelligent. We need to read,
analyze, and interpret our genomes not only quickly, but also accurately and efficiently …

Personalized pangenome references

J Sirén, P Eskandar, MT Ungaro, G Hickey… - Nature …, 2024 - nature.com
Pangenomes reduce reference bias by representing genetic diversity better than a single
reference sequence. Yet when comparing a sample to a pangenome, variants in the …

Matchtigs: minimum plain text representation of k-mer sets

S Schmidt, S Khan, JN Alanko, GE Pibiri, AI Tomescu - Genome Biology, 2023 - Springer
We propose a polynomial algorithm computing a minimum plain-text representation of k-mer
sets, as well as an efficient near-minimum greedy heuristic. When compressing read sets of …

Compression algorithm for colored de Bruijn graphs

A Rahman, Y Dufresne, P Medvedev - Algorithms for Molecular Biology, 2024 - Springer
A colored de Bruijn graph (also called a set of k-mer sets), is a set of k-mers with every k-mer
assigned a set of colors. Colored de Bruijn graphs are used in a variety of applications …

Machine Learning Techniques for Antimicrobial Resistance Prediction of Pseudomonas Aeruginosa from Whole Genome Sequence Data

SM Noman, M Zeeshan, J Arshad… - Computational …, 2023 - Wiley Online Library
Aim. Due to the growing availability of genomic datasets, machine learning models have
shown impressive diagnostic potential in identifying emerging and reemerging pathogens …

Construction and representation of human pangenome graphs

F Andreace, P Lechat, Y Dufresne, R Chikhi - bioRxiv, 2023 - biorxiv.org
As a single reference genome cannot possibly represent all the variation present across
human individuals, pangenome graphs have been introduced to incorporate population …

Hyper-k-mers: efficient streaming k-mers representation

I Martayan, L Robidou, Y Shibuya, A Limasset - bioRxiv, 2024 - biorxiv.org
K-mers have become ubiquitous in modern bioinformatics pipelines. A key factor in their
success is the ability to filter out erroneous k-mers by removing those with low abundance …

Advances in practical k-mer sets: essentials for the curious

C Marchet - arXiv preprint arXiv:2409.05210, 2024 - arxiv.org
This paper provides a comprehensive survey of data structures for representing k-mer sets,
which are fundamental in high-throughput sequencing analysis. It categorizes the methods …

[图书][B] Compression Algorithms for De Bruijn Graph and Hidden Assembly Artifacts

A Rahman - 2023 - search.proquest.com
In this dissertation, I present four projects covering two main research objectives. The first
objective of my dissertation is to optimize storage usage of sequence analysis tools and …

Memory-bound k-mer selection for large and evolutionary diverse reference libraries

AO Berk Şapcı, S Mirarab - bioRxiv, 2024 - biorxiv.org
Using long k-mers to find sequence matches is increasingly used in many bioinformatic
applications, including metagenomic sequence classification. The accuracy of these …