To transformers and beyond: large language models for the genome

ME Consens, C Dufault, M Wainberg, D Forster… - arXiv preprint arXiv …, 2023 - arxiv.org
In the rapidly evolving landscape of genomics, deep learning has emerged as a useful tool
for tackling complex computational challenges. This review focuses on the transformative …

[HTML][HTML] Large language models in bioinformatics: applications and perspectives

J Liu, M Yang, Y Yu, H Xu, K Li, X Zhou - ArXiv, 2024 - ncbi.nlm.nih.gov
Large language models (LLMs) are a class of artificial intelligence models based on deep
learning, which have great performance in various tasks, especially in natural language …

[HTML][HTML] Deep learning for genomics: From early neural nets to modern large language models

T Yue, Y Wang, L Zhang, C Gu, H Xue, W Wang… - International Journal of …, 2023 - mdpi.com
The data explosion driven by advancements in genomic research, such as high-throughput
sequencing techniques, is constantly challenging conventional methods used in genomics …

Are genomic language models all you need? exploring genomic language models on protein downstream tasks

S Boshar, E Trop, BP de Almeida, L Copoiu, T Pierrot - bioRxiv, 2024 - biorxiv.org
Motivation: Large language models, trained on enormous corpora of biological sequences,
are state-of-the-art for downstream genomic and proteomic tasks. Since the genome …

Transformers and large language models for chemistry and drug discovery

AM Bran, P Schwaller - arXiv preprint arXiv:2310.06083, 2023 - arxiv.org
Language modeling has seen impressive progress over the last years, mainly prompted by
the invention of the Transformer architecture, sparking a revolution in many fields of machine …

Epigenomic language models powered by Cerebras

MV Trotter, CQ Nguyen, S Young, RT Woodruff… - arXiv preprint arXiv …, 2021 - arxiv.org
Large scale self-supervised pre-training of Transformer language models has advanced the
field of Natural Language Processing and shown promise in cross-application to the …

Bend: Benchmarking dna language models on biologically meaningful tasks

FI Marin, F Teufel, M Horlacher, D Madsen… - The Twelfth …, 2023 - openreview.net
The genome sequence contains the blueprint for governing cellular processes. While the
availability of genomes has vastly increased over the last decades, experimental annotation …

Modeling protein using large-scale pretrain language model

Y Xiao, J Qiu, Z Li, CY Hsieh, J Tang - arXiv preprint arXiv:2108.07435, 2021 - arxiv.org
Protein is linked to almost every life process. Therefore, analyzing the biological structure
and property of protein sequences is critical to the exploration of life, as well as disease …

An interdisciplinary outlook on large language models for scientific research

J Boyko, J Cohen, N Fox, MH Veiga, JI Li, J Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we describe the capabilities and constraints of Large Language Models
(LLMs) within disparate academic disciplines, aiming to delineate their strengths and …

An analysis of large language models: their impact and potential applications

G Bharathi Mohan, R Prasanna Kumar… - … and Information Systems, 2024 - Springer
Large language models (LLMs) have transformed the interpretation and creation of human
language in the rapidly developing field of computerized language processing. These …