Controllable protein design with language models

N Ferruz, B Höcker - Nature Machine Intelligence, 2022 - nature.com
The twenty-first century is presenting humankind with unprecedented environmental and
medical challenges. The ability to design novel proteins tailored for specific purposes would …

Opportunities and challenges in design and optimization of protein function

D Listov, CA Goverde, BE Correia… - … Reviews Molecular Cell …, 2024 - nature.com
The field of protein design has made remarkable progress over the past decade. Historically,
the low reliability of purely structure-based design methods limited their application, but …

Large language models generate functional protein sequences across diverse families

A Madani, B Krause, ER Greene, S Subramanian… - Nature …, 2023 - nature.com
Deep-learning language models have shown promise in various biotechnological
applications, including protein design and engineering. Here we describe ProGen, a …

OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

G Ahdritz, N Bouatta, C Floristean, S Kadyan, Q Xia… - Nature …, 2024 - nature.com
AlphaFold2 revolutionized structural biology with the ability to predict protein structures with
exceptionally high accuracy. Its implementation, however, lacks the code and data required …

Diffusion probabilistic modeling of protein backbones in 3d for the motif-scaffolding problem

BL Trippe, J Yim, D Tischer, D Baker… - arXiv preprint arXiv …, 2022 - arxiv.org
Construction of a scaffold structure that supports a desired motif, conferring protein function,
shows promise for the design of vaccines and enzymes. But a general solution to this motif …

Deep transfer learning for inter-chain contact predictions of transmembrane protein complexes

P Lin, Y Yan, H Tao, SY Huang - Nature Communications, 2023 - nature.com
Membrane proteins are encoded by approximately a quarter of human genes. Inter-chain
residue-residue contact information is important for structure prediction of membrane protein …

Big data mining, rational modification, and ancestral sequence reconstruction inferred multiple xylose isomerases for biorefinery

S Chen, Z Xu, B Ding, Y Zhang, S Liu, C Cai, M Li… - Science …, 2023 - science.org
The isomerization of xylose to xylulose is considered the most promising approach to initiate
xylose bioconversion. Here, phylogeny-guided big data mining, rational modification, and …

Deep learning driven biosynthetic pathways navigation for natural products with BioNavi-NP

S Zheng, T Zeng, C Li, B Chen, CW Coley… - Nature …, 2022 - nature.com
The complete biosynthetic pathways are unknown for most natural products (NPs), it is thus
valuable to make computer-aided bio-retrosynthesis predictions. Here, a navigable and user …

Sparks of function by de novo protein design

AE Chu, T Lu, PS Huang - Nature biotechnology, 2024 - nature.com
Abstract Information in proteins flows from sequence to structure to function, with each step
causally driven by the preceding one. Protein design is founded on inverting this process …

An all-atom protein generative model

AE Chu, J Kim, L Cheng, G El Nesr, M Xu… - Proceedings of the …, 2024 - pnas.org
Proteins mediate their functions through chemical interactions; modeling these interactions,
which are typically through sidechains, is an important need in protein design. However …