SELFormer: molecular representation learning via SELFIES language models

A Yüksel, E Ulusoy, A Ünlü… - Machine Learning: Science …, 2023 - iopscience.iop.org
Automated computational analysis of the vast chemical space is critical for numerous fields
of research such as drug discovery and material science. Representation learning …

Invalid SMILES are beneficial rather than detrimental to chemical language models

MA Skinnider - Nature Machine Intelligence, 2024 - nature.com
Generative machine learning models have attracted intense interest for their ability to
sample novel molecules with desired chemical or biological properties. Among these …

Gpt-molberta: Gpt molecular features language model for molecular property prediction

S Balaji, R Magar, Y Jadhav, AB Farimani - arXiv preprint arXiv …, 2023 - arxiv.org
With the emergence of Transformer architectures and their powerful understanding of textual
data, a new horizon has opened up to predict the molecular properties based on text …

Chemical species ontology for data integration and knowledge discovery

L Pascazio, S Rihm, A Naseri, S Mosbach… - Journal of Chemical …, 2023 - ACS Publications
Web ontologies are important tools in modern scientific research because they provide a
standardized way to represent and manage web-scale amounts of complex data. In …

Language models in molecular discovery

N Janakarajan, T Erdmann, S Swaminathan… - … Supported by Informatics, 2024 - Springer
The success of language models, especially transformer-based architectures, has trickled
into other scientific domains, giving rise to the concept of “scientific language models” that …

Using test-time augmentation to investigate explainable AI: inconsistencies between method, model and human intuition

PBR Hartog, F Krüger, S Genheden, IV Tetko - Journal of Cheminformatics, 2024 - Springer
Stakeholders of machine learning models desire explainable artificial intelligence (XAI) to
produce human-understandable and consistent interpretations. In computational toxicity …

Interpreting Neural Network Models for Toxicity Prediction by Extracting Learned Chemical Features

M Walter, SJ Webb, VJ Gillet - Journal of Chemical Information …, 2024 - ACS Publications
Neural network models have become a popular machine-learning technique for the toxicity
prediction of chemicals. However, due to their complex structure, it is difficult to understand …

De novo generated combinatorial library design

SV Johansson, MH Chehreghani, O Engkvist… - Digital …, 2024 - pubs.rsc.org
Artificial intelligence (AI) contributes new methods for designing compounds in drug
discovery, ranging from de novo design models suggesting new molecular structures or …

[HTML][HTML] Corrigendum to “Modeling PROTAC degradation activity with machine learning”[Artif. Intell. Life Sci. 6 (2024) 100104]

S Ribes, E Nittinger, C Tyrchan, R Mercado - Artificial Intelligence in the Life …, 2024 - Elsevier
PROTACs are a promising therapeutic modality that harnesses the cell's built-in degradation
machinery to degrade specific proteins. Despite their potential, developing new PROTACs is …

Prediction of molecular-specific mutagenic alerts and related mechanisms of chemicals by a convolutional neural network (CNN) model based on SMILES split

C Chen, Z Huang, X Zou, S Li, D Zhang… - Science of The Total …, 2024 - Elsevier
Structural alerts (SAs) are essential to identify chemicals for toxicity evaluation and health
risk assessment. We constructed a novel SMILES split-based deep learning model (SSDL) …