MARS: improving multiple circular sequence alignment using refined sequences

LAK Ayad, SP Pissis - BMC genomics, 2017 - Springer
Background A fundamental assumption of all widely-used multiple sequence alignment
techniques is that the left-and right-most positions of the input sequences are relevant to the …

Sublinear algorithms for gap edit distance

E Goldenberg, R Krauthgamer… - 2019 IEEE 60th Annual …, 2019 - ieeexplore.ieee.org
The edit distance is a way of quantifying how similar two strings are to one another by
counting the minimum number of character insertions, deletions, and substitutions required …

[HTML][HTML] Alignment-free sequence comparison using absent words

P Charalampopoulos, M Crochemore, G Fici… - Information and …, 2018 - Elsevier
Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is
often realised by sequence alignment techniques, which are computationally expensive …

[HTML][HTML] Absent words in a sliding window with applications

M Crochemore, A Héliou, G Kucherov… - Information and …, 2020 - Elsevier
An absent word of a word y is a word that does not occur in y. It is then called minimal if all its
proper factors occur in y. In fact, minimal absent words (MAWs) provide useful information …

Computing DAWGs and minimal absent words in linear time for integer alphabets

Y Fujishige, Y Tsujimaru, S Inenaga… - … of Computer Science …, 2016 - drops.dagstuhl.de
The directed acyclic word graph (DAWG) of a string y is the smallest (partial) DFA which
recognizes all suffixes of y and has only O (n) nodes and edges. We present the first O (n) …

On avoided words, absent words, and their application to biological sequence analysis

Y Almirantis, P Charalampopoulos, J Gao… - Algorithms for Molecular …, 2017 - Springer
Background The deviation of the observed frequency of a word w from its expected
frequency in a given sequence x is used to determine whether or not the word is avoided …

Searching page-images of early music scanned with OMR: a scalable solution using minimal absent words

T Crawford, G Badkobeh, D Lewis - 2018 - research.gold.ac.uk
We define three retrieval tasks requiring efficient search of the musical content of a collection
of~ 32k page images of 16th-century music to find: duplicates; pages with the same musical …

[PDF][PDF] Analyzing Kinship in Severe Acute Respiratory Syndrome Coronavirus 2 DNA Sequences Based on Hierarchical and K-Means Clustering Methods Using …

E Banjarnahor, A Bustamam… - … Journal on Advanced …, 2022 - researchgate.net
Based on the World Health Organization data obtained in mid-April 2021, Coronavirus or
Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has already infected …

Minimal absent words in a sliding window and applications to on-line pattern matching

M Crochemore, A Héliou, G Kucherov… - … on fundamentals of …, 2017 - Springer
An absent (or forbidden) word of a word y is a word that does not occur in y. It is then called
minimal if all its proper factors occur in y. There exist linear-time and linear-space algorithms …

R-enum: Enumeration of characteristic substrings in BWT-runs bounded space

T Nishimoto, Y Tabei - arXiv preprint arXiv:2004.01493, 2020 - arxiv.org
Enumerating characteristic substrings (eg, maximal repeats, minimal unique substrings, and
minimal absent words) in a given string has been an important research topic because there …