AI in health and medicine

P Rajpurkar, E Chen, O Banerjee, EJ Topol - Nature medicine, 2022 - nature.com
Artificial intelligence (AI) is poised to broadly reshape medicine, potentially improving the
experiences of both clinicians and patients. We discuss key findings from a 2-year weekly …

A guide to machine learning for biologists

JG Greener, SM Kandathil, L Moffat… - Nature reviews Molecular …, 2022 - nature.com
The expanding scale and inherent complexity of biological data have encouraged a growing
use of machine learning in biology to build informative and predictive models of the …

Large language models generate functional protein sequences across diverse families

A Madani, B Krause, ER Greene, S Subramanian… - Nature …, 2023 - nature.com
Deep-learning language models have shown promise in various biotechnological
applications, including protein design and engineering. Here we describe ProGen, a …

Evolutionary-scale prediction of atomic-level protein structure with a language model

Z Lin, H Akin, R Rao, B Hie, Z Zhu, W Lu, N Smetanin… - Science, 2023 - science.org
Recent advances in machine learning have leveraged evolutionary information in multiple
sequence alignments to predict protein structure. We demonstrate direct inference of full …

ProtGPT2 is a deep unsupervised language model for protein design

N Ferruz, S Schmidt, B Höcker - Nature communications, 2022 - nature.com
Protein design aims to build novel proteins customized for specific purposes, thereby
holding the potential to tackle many environmental and biomedical problems. Recent …

OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

G Ahdritz, N Bouatta, C Floristean, S Kadyan, Q Xia… - Nature …, 2024 - nature.com
AlphaFold2 revolutionized structural biology with the ability to predict protein structures with
exceptionally high accuracy. Its implementation, however, lacks the code and data required …

Learning inverse folding from millions of predicted structures

C Hsu, R Verkuil, J Liu, Z Lin, B Hie… - International …, 2022 - proceedings.mlr.press
We consider the problem of predicting a protein sequence from its backbone atom
coordinates. Machine learning approaches to this problem to date have been limited by the …

[PDF][PDF] Language models of protein sequences at the scale of evolution enable accurate structure prediction

Z Lin, H Akin, R Rao, B Hie, Z Zhu, W Lu… - BioRxiv, 2022 - biorxiv.org
Large language models have recently been shown to develop emergent capabilities with
scale, going beyond simple pattern matching to perform higher level reasoning and …

Genome-wide prediction of disease variant effects with a deep protein language model

N Brandes, G Goldman, CH Wang, CJ Ye, V Ntranos - Nature Genetics, 2023 - nature.com
Predicting the effects of coding variants is a major challenge. While recent deep-learning
models have improved variant effect prediction accuracy, they cannot analyze all coding …

SignalP 6.0 predicts all five types of signal peptides using protein language models

F Teufel, JJ Almagro Armenteros, AR Johansen… - Nature …, 2022 - nature.com
Signal peptides (SPs) are short amino acid sequences that control protein secretion and
translocation in all living organisms. SPs can be predicted from sequence data, but existing …