Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arXiv preprint arXiv …, 2023 - arxiv.org
As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …

De novo design of protein structure and function with RFdiffusion

JL Watson, D Juergens, NR Bennett, BL Trippe, J Yim… - Nature, 2023 - nature.com
There has been considerable recent progress in designing new proteins using deep-
learning methods,,,,,,,–. Despite this progress, a general deep-learning framework for protein …

De novo design of luciferases using deep learning

AHW Yeh, C Norn, Y Kipnis, D Tischer, SJ Pellock… - Nature, 2023 - nature.com
De novo enzyme design has sought to introduce active sites and substrate-binding pockets
that are predicted to catalyse a reaction of interest into geometrically compatible native …

Evolutionary-scale prediction of atomic-level protein structure with a language model

Z Lin, H Akin, R Rao, B Hie, Z Zhu, W Lu, N Smetanin… - Science, 2023 - science.org
Recent advances in machine learning have leveraged evolutionary information in multiple
sequence alignments to predict protein structure. We demonstrate direct inference of full …

Illuminating protein space with a programmable generative model

JB Ingraham, M Baranov, Z Costello, KW Barber… - Nature, 2023 - nature.com
Three billion years of evolution has produced a tremendous diversity of protein molecules,
but the full potential of proteins is likely to be much greater. Accessing this potential has …

Generalized biomolecular modeling and design with RoseTTAFold All-Atom

R Krishna, J Wang, W Ahern, P Sturmfels, P Venkatesh… - Science, 2024 - science.org
Deep-learning methods have revolutionized protein structure prediction and design but are
presently limited to protein-only systems. We describe RoseTTAFold All-Atom (RFAA), which …

Learning inverse folding from millions of predicted structures

C Hsu, R Verkuil, J Liu, Z Lin, B Hie… - International …, 2022 - proceedings.mlr.press
We consider the problem of predicting a protein sequence from its backbone atom
coordinates. Machine learning approaches to this problem to date have been limited by the …

[HTML][HTML] Progen2: exploring the boundaries of protein language models

E Nijkamp, JA Ruffolo, EN Weinstein, N Naik, A Madani - Cell systems, 2023 - cell.com
Attention-based models trained on protein sequences have demonstrated incredible
success at classification and generation tasks relevant for artificial-intelligence-driven …

Diffusion probabilistic modeling of protein backbones in 3d for the motif-scaffolding problem

BL Trippe, J Yim, D Tischer, D Baker… - arXiv preprint arXiv …, 2022 - arxiv.org
Construction of a scaffold structure that supports a desired motif, conferring protein function,
shows promise for the design of vaccines and enzymes. But a general solution to this motif …