Scaling laws for generative mixed-modal language models
Generative language models define distributions over sequences of tokens that can
represent essentially any combination of data modalities (eg, any permutation of image …
represent essentially any combination of data modalities (eg, any permutation of image …
Retrosynthesis prediction with an interpretable deep-learning framework based on molecular assembly tasks
Automating retrosynthesis with artificial intelligence expedites organic chemistry research in
digital laboratories. However, most existing deep-learning approaches are hard to explain …
digital laboratories. However, most existing deep-learning approaches are hard to explain …
Enhancing activity prediction models in drug discovery with the ability to understand human language
Activity and property prediction models are the central workhorses in drug discovery and
materials sciences, but currently, they have to be trained or fine-tuned for new tasks. Without …
materials sciences, but currently, they have to be trained or fine-tuned for new tasks. Without …
Scientific large language models: A survey on biological & chemical domains
Large Language Models (LLMs) have emerged as a transformative power in enhancing
natural language comprehension, representing a significant stride toward artificial general …
natural language comprehension, representing a significant stride toward artificial general …
Bayesian optimization of catalysts with in-context learning
Large language models (LLMs) are able to do accurate classification with zero or only a few
examples (in-context learning). We show a prompting system that enables regression with …
examples (in-context learning). We show a prompting system that enables regression with …
Coati: Multimodal contrastive pretraining for representing and traversing chemical space
B Kaufman, EC Williams, C Underkoffler… - Journal of Chemical …, 2024 - ACS Publications
Creating a successful small molecule drug is a challenging multiparameter optimization
problem in an effectively infinite space of possible molecules. Generative models have …
problem in an effectively infinite space of possible molecules. Generative models have …
Regression with large language models for materials and molecular property prediction
We demonstrate the ability of large language models (LLMs) to perform material and
molecular property regression tasks, a significant deviation from the conventional LLM use …
molecular property regression tasks, a significant deviation from the conventional LLM use …
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
In many scientific fields, large language models (LLMs) have revolutionized the way with
which text and other modalities of data (eg, molecules and proteins) are dealt, achieving …
which text and other modalities of data (eg, molecules and proteins) are dealt, achieving …
Lost in Translation: Chemical Language Models and the Misunderstanding of Molecule Structures
V Ganeeva, A Sakhovskiy, K Khrabrov… - Findings of the …, 2024 - aclanthology.org
The recent integration of chemistry with natural language processing (NLP) has advanced
drug discovery. Molecule representation in language models (LMs) is crucial in enhancing …
drug discovery. Molecule representation in language models (LMs) is crucial in enhancing …
Mollm: A unified language model to integrate biomedical text with 2d and 3d molecular representations
Motivation The present paradigm of deep learning models for molecular representation
relies mostly on 1D or 2D formats, neglecting significant 3D structural information that offers …
relies mostly on 1D or 2D formats, neglecting significant 3D structural information that offers …