AI in health and medicine
Artificial intelligence (AI) is poised to broadly reshape medicine, potentially improving the
experiences of both clinicians and patients. We discuss key findings from a 2-year weekly …
experiences of both clinicians and patients. We discuss key findings from a 2-year weekly …
A guide to machine learning for biologists
The expanding scale and inherent complexity of biological data have encouraged a growing
use of machine learning in biology to build informative and predictive models of the …
use of machine learning in biology to build informative and predictive models of the …
Large language models generate functional protein sequences across diverse families
Deep-learning language models have shown promise in various biotechnological
applications, including protein design and engineering. Here we describe ProGen, a …
applications, including protein design and engineering. Here we describe ProGen, a …
Evolutionary-scale prediction of atomic-level protein structure with a language model
Recent advances in machine learning have leveraged evolutionary information in multiple
sequence alignments to predict protein structure. We demonstrate direct inference of full …
sequence alignments to predict protein structure. We demonstrate direct inference of full …
ProtGPT2 is a deep unsupervised language model for protein design
Protein design aims to build novel proteins customized for specific purposes, thereby
holding the potential to tackle many environmental and biomedical problems. Recent …
holding the potential to tackle many environmental and biomedical problems. Recent …
OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization
AlphaFold2 revolutionized structural biology with the ability to predict protein structures with
exceptionally high accuracy. Its implementation, however, lacks the code and data required …
exceptionally high accuracy. Its implementation, however, lacks the code and data required …
Learning inverse folding from millions of predicted structures
We consider the problem of predicting a protein sequence from its backbone atom
coordinates. Machine learning approaches to this problem to date have been limited by the …
coordinates. Machine learning approaches to this problem to date have been limited by the …
[PDF][PDF] Language models of protein sequences at the scale of evolution enable accurate structure prediction
Large language models have recently been shown to develop emergent capabilities with
scale, going beyond simple pattern matching to perform higher level reasoning and …
scale, going beyond simple pattern matching to perform higher level reasoning and …
Genome-wide prediction of disease variant effects with a deep protein language model
Predicting the effects of coding variants is a major challenge. While recent deep-learning
models have improved variant effect prediction accuracy, they cannot analyze all coding …
models have improved variant effect prediction accuracy, they cannot analyze all coding …
SignalP 6.0 predicts all five types of signal peptides using protein language models
Signal peptides (SPs) are short amino acid sequences that control protein secretion and
translocation in all living organisms. SPs can be predicted from sequence data, but existing …
translocation in all living organisms. SPs can be predicted from sequence data, but existing …