Bioinformatics applications on apache spark

R Guo, Y Zhao, Q Zou, X Fang, S Peng - GigaScience, 2018 - academic.oup.com
With the rapid development of next-generation sequencing technology, ever-increasing
quantities of genomic data pose a tremendous challenge to data processing. Therefore …

Cloud computing enabled big multi-omics data analytics

S Koppad, GV Gkoutos… - … and biology insights, 2021 - journals.sagepub.com
High-throughput experiments enable researchers to explore complex multifactorial diseases
through large-scale analysis of omics data. Challenges for such high-dimensional data sets …

Compacting de Bruijn graphs from sequencing data quickly and in low memory

R Chikhi, A Limasset, P Medvedev - Bioinformatics, 2016 - academic.oup.com
Motivation: As the quantity of data per sequencing experiment increases, the challenges of
fragment assembly are becoming increasingly computational. The de Bruijn graph is a …

Memory-efficient assembly using Flye

B Freire, S Ladra, JR Paramá - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
In the past decade, next-generation sequencing (NGS) enabled the generation of genomic
data in a cost-effective, high-throughput manner. The most recent third-generation …

HipMer: an extreme-scale de novo genome assembler

E Georganas, A Buluç, J Chapman, S Hofmeyr… - Proceedings of the …, 2015 - dl.acm.org
De novo whole genome assembly reconstructs genomic sequences from short, overlapping,
and potentially erroneous DNA segments and is one of the most important computations in …

An effective and fast soccer ball detection and tracking method

XF Tong, HQ Lu, QS Liu - Proceedings of the 17th International …, 2004 - ieeexplore.ieee.org
A ball detection and tracking approach in real soccer game is proposed in this paper. In view
of difficulties of direct detection, an indirect strategy based on non-ball elimination is applied …

Extreme scale de novo metagenome assembly

E Georganas, R Egan, S Hofmeyr… - … Conference for High …, 2018 - ieeexplore.ieee.org
Metagenome assembly is the process of transforming a set of short, overlapping, and
potentially erroneous DNA segments from environmental samples into the accurate …

MPI+ threads: Runtime contention and remedies

A Amer, H Lu, Y Wei, P Balaji, S Matsuoka - ACM SIGPLAN Notices, 2015 - dl.acm.org
Hybrid MPI+ Threads programming has emerged as an alternative model to the “MPI
everywhere” model to better handle the increasing core density in cluster nodes. While the …

[图书][B] Exascale scientific applications: Scalability and performance portability

TP Straatsma, KB Antypas, TJ Williams - 2017 - books.google.com
From the Foreword:" The authors of the chapters in this book are the pioneers who will
explore the exascale frontier. The path forward will not be easy... These authors, along with …

MPI+ ULT: Overlapping communication and computation with user-level threads

H Lu, S Seo, P Balaji - … on Cyberspace Safety and Security, and …, 2015 - ieeexplore.ieee.org
As the core density of future processors keeps increasing, MPI+ Threads is becoming a
promising programming model for large scale SMP clusters. Generally speaking, hybrid …