Artificial intelligence for natural product drug discovery

MW Mullowney, KR Duncan, SS Elsayed… - Nature Reviews Drug …, 2023 - nature.com
Developments in computational omics technologies have provided new means to access
the hidden diversity of natural products, unearthing new potential for drug discovery. In …

Decision trees: from efficient prediction to responsible AI

H Blockeel, L Devos, B Frénay, G Nanfack… - Frontiers in Artificial …, 2023 - frontiersin.org
This article provides a birds-eye view on the role of decision trees in machine learning and
data science over roughly four decades. It sketches the evolution of decision tree research …

Adbench: Anomaly detection benchmark

S Han, X Hu, H Huang, M Jiang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Given a long list of anomaly detection algorithms developed in the last few decades, how do
they perform with regard to (i) varying levels of supervision,(ii) different types of anomalies …

Tabllm: Few-shot classification of tabular data with large language models

S Hegselmann, A Buendia, H Lang… - International …, 2023 - proceedings.mlr.press
We study the application of large language models to zero-shot and few-shot classification
of tabular data. We prompt the large language model with a serialization of the tabular data …

[HTML][HTML] Explainable Artificial Intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions

L Longo, M Brcic, F Cabitza, J Choi, R Confalonieri… - Information …, 2024 - Elsevier
Understanding black box models has become paramount as systems based on opaque
Artificial Intelligence (AI) continue to flourish in diverse real-world applications. In response …

Tabpfn: A transformer that solves small tabular classification problems in a second

N Hollmann, S Müller, K Eggensperger… - arXiv preprint arXiv …, 2022 - arxiv.org
We present TabPFN, a trained Transformer that can do supervised classification for small
tabular datasets in less than a second, needs no hyperparameter tuning and is competitive …

When do neural nets outperform boosted trees on tabular data?

D McElfresh, S Khandagale… - Advances in …, 2024 - proceedings.neurips.cc
Tabular data is one of the most commonly used types of data in machine learning. Despite
recent advances in neural nets (NNs) for tabular data, there is still an active discussion on …

Machine learning models to accelerate the design of polymeric long-acting injectables

P Bannigan, Z Bao, RJ Hickman, M Aldeghi… - Nature …, 2023 - nature.com
Long-acting injectables are considered one of the most promising therapeutic strategies for
the treatment of chronic diseases as they can afford improved therapeutic efficacy, safety …

Xtab: Cross-table pretraining for tabular transformers

B Zhu, X Shi, N Erickson, M Li, G Karypis… - arXiv preprint arXiv …, 2023 - arxiv.org
The success of self-supervised learning in computer vision and natural language processing
has motivated pretraining methods on tabular data. However, most existing tabular self …

A performance-driven benchmark for feature selection in tabular deep learning

V Cherepanova, R Levin, G Somepalli… - Advances in …, 2024 - proceedings.neurips.cc
Academic tabular benchmarks often contain small sets of curated features. In contrast, data
scientists typically collect as many features as possible into their datasets, and even …