Wild-time: A benchmark of in-the-wild distribution shift over time

H Yao, C Choi, B Cao, Y Lee… - Advances in Neural …, 2022 - proceedings.neurips.cc
Distribution shifts occur when the test distribution differs from the training distribution, and
can considerably degrade performance of machine learning models deployed in the real …

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

I Arawjo, C Swoopes, P Vaithilingam… - Proceedings of the CHI …, 2024 - dl.acm.org
Evaluating outputs of large language models (LLMs) is challenging, requiring making—and
making sense of—many responses. Yet tools that go beyond basic prompting tend to require …

Understanding and mitigating bias in imaging artificial intelligence

AS Tejani, YS Ng, Y Xi, JC Rayan - RadioGraphics, 2024 - pubs.rsna.org
Artificial intelligence (AI) algorithms are prone to bias at multiple stages of model
development, with potential for exacerbating health disparities. However, bias in imaging AI …

Developing machine learning systems worthy of trust for infection science: a requirement for future implementation into clinical practice

BR McFadden, M Reynolds, TJJ Inglis - Frontiers in Digital Health, 2023 - frontiersin.org
Infection science is a discipline of healthcare which includes clinical microbiology, public
health microbiology, mechanisms of microbial disease, and antimicrobial countermeasures …

On the Robustness of ML-Based Network Intrusion Detection Systems: An Adversarial and Distribution Shift Perspective

M Wang, N Yang, DH Gunasinghe, N Weng - Computers, 2023 - mdpi.com
Utilizing machine learning (ML)-based approaches for network intrusion detection systems
(NIDSs) raises valid concerns due to the inherent susceptibility of current ML models to …

Diagnosing model performance under distribution shift

TT Cai, H Namkoong, S Yadlowsky - arXiv preprint arXiv:2303.02011, 2023 - arxiv.org
Prediction models can perform poorly when deployed to target distributions different from the
training distribution. To understand these operational failure modes, we develop a method …

Enhancing false negative and positive rates for efficient insider threat detection

M AlSlaiman, MI Salman, MM Saleh, B Wang - Computers & Security, 2023 - Elsevier
Insider threats on information security can become a burden for organizations. However,
outsider attacks have received more attention compared to insider attacks. Many …

Systematic evaluation of local and global machine learning models for the prediction of ADME properties

E Di Lascio, G Gerebtzoff… - Molecular …, 2023 - ACS Publications
Machine learning (ML) has become an indispensable tool to predict absorption, distribution,
metabolism, and excretion (ADME) properties in pharmaceutical research. ML algorithms …

Explainability through uncertainty: Trustworthy decision-making with neural networks

A Thuy, DF Benoit - European Journal of Operational Research, 2024 - Elsevier
Uncertainty is a key feature of any machine learning model and is particularly important in
neural networks, which tend to be overconfident. This overconfidence is worrying under …

Angler: Helping machine translation practitioners prioritize model improvements

S Robertson, ZJ Wang, D Moritz, MB Kery… - Proceedings of the 2023 …, 2023 - dl.acm.org
Machine learning (ML) models can fail in unexpected ways in the real world, but not all
model failures are equal. With finite time and resources, ML practitioners are forced to …