Combining machine learning and computational chemistry for predictive insights into chemical systems

JA Keith, V Vassilev-Galindo, B Cheng… - Chemical …, 2021 - ACS Publications
Machine learning models are poised to make a transformative impact on chemical sciences
by dramatically accelerating computational algorithms and amplifying insights available from …

Machine learning force fields

OT Unke, S Chmiela, HE Sauceda… - Chemical …, 2021 - ACS Publications
In recent years, the use of machine learning (ML) in computational chemistry has enabled
numerous advances previously out of reach due to the computational complexity of …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arXiv preprint arXiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Data selection for language models via importance resampling

SM Xie, S Santurkar, T Ma… - Advances in Neural …, 2023 - proceedings.neurips.cc
Selecting a suitable pretraining dataset is crucial for both general-domain (eg, GPT-3) and
domain-specific (eg, Codex) language models (LMs). We formalize this problem as selecting …

A unifying review of deep and shallow anomaly detection

L Ruff, JR Kauffmann, RA Vandermeulen… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Deep learning approaches to anomaly detection (AD) have recently improved the state of
the art in detection performance on complex data sets, such as large collections of images or …

Fairness in machine learning: A survey

S Caton, C Haas - ACM Computing Surveys, 2024 - dl.acm.org
When Machine Learning technologies are used in contexts that affect citizens, companies as
well as researchers need to be confident that there will not be any unexpected social …

A brief review of domain adaptation

A Farahani, S Voghoei, K Rasheed… - Advances in data science …, 2021 - Springer
Classical machine learning assumes that the training and test sets come from the same
distributions. Therefore, a model learned from the labeled training data is expected to …

Learning protein fitness models from evolutionary and assay-labeled data

C Hsu, H Nisonoff, C Fannjiang, J Listgarten - Nature biotechnology, 2022 - nature.com
Abstract Machine learning-based models of protein fitness typically learn from either
unlabeled, evolutionarily related sequences or variant sequences with experimentally …

Unleashing the power of graph data augmentation on covariate distribution shift

Y Sui, Q Wu, J Wu, Q Cui, L Li, J Zhou… - Advances in Neural …, 2023 - proceedings.neurips.cc
The issue of distribution shifts is emerging as a critical concern in graph representation
learning. From the perspective of invariant learning and stable learning, a recently well …

[HTML][HTML] Deep neural network battery charging curve prediction using 30 points collected in 10 min

J Tian, R Xiong, W Shen, J Lu, XG Yang - Joule, 2021 - cell.com
Accurate degradation monitoring over battery life is indispensable for the safe and durable
operation of battery-powered applications. In this work, we extend conventional capacity …