Scientific discovery in the age of artificial intelligence
Artificial intelligence (AI) is being increasingly integrated into scientific discovery to augment
and accelerate research, helping scientists to generate hypotheses, design experiments …
and accelerate research, helping scientists to generate hypotheses, design experiments …
Challenges and applications of large language models
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
Evolutionary-scale prediction of atomic-level protein structure with a language model
Recent advances in machine learning have leveraged evolutionary information in multiple
sequence alignments to predict protein structure. We demonstrate direct inference of full …
sequence alignments to predict protein structure. We demonstrate direct inference of full …
Large language models generate functional protein sequences across diverse families
Deep-learning language models have shown promise in various biotechnological
applications, including protein design and engineering. Here we describe ProGen, a …
applications, including protein design and engineering. Here we describe ProGen, a …
ProtGPT2 is a deep unsupervised language model for protein design
Protein design aims to build novel proteins customized for specific purposes, thereby
holding the potential to tackle many environmental and biomedical problems. Recent …
holding the potential to tackle many environmental and biomedical problems. Recent …
Hyenadna: Long-range genomic sequence modeling at single nucleotide resolution
Genomic (DNA) sequences encode an enormous amount of information for gene regulation
and protein synthesis. Similar to natural language models, researchers have proposed …
and protein synthesis. Similar to natural language models, researchers have proposed …
Learning inverse folding from millions of predicted structures
We consider the problem of predicting a protein sequence from its backbone atom
coordinates. Machine learning approaches to this problem to date have been limited by the …
coordinates. Machine learning approaches to this problem to date have been limited by the …
[PDF][PDF] Language models of protein sequences at the scale of evolution enable accurate structure prediction
Large language models have recently been shown to develop emergent capabilities with
scale, going beyond simple pattern matching to perform higher level reasoning and …
scale, going beyond simple pattern matching to perform higher level reasoning and …
DeepLoc 2.0: multi-label subcellular localization prediction using protein language models
V Thumuluri, JJ Almagro Armenteros… - Nucleic acids …, 2022 - academic.oup.com
The prediction of protein subcellular localization is of great relevance for proteomics
research. Here, we propose an update to the popular tool DeepLoc with multi-localization …
research. Here, we propose an update to the popular tool DeepLoc with multi-localization …
SignalP 6.0 predicts all five types of signal peptides using protein language models
Signal peptides (SPs) are short amino acid sequences that control protein secretion and
translocation in all living organisms. SPs can be predicted from sequence data, but existing …
translocation in all living organisms. SPs can be predicted from sequence data, but existing …