Wild-time: A benchmark of in-the-wild distribution shift over time
Distribution shifts occur when the test distribution differs from the training distribution, and
can considerably degrade performance of machine learning models deployed in the real …
can considerably degrade performance of machine learning models deployed in the real …
ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing
Evaluating outputs of large language models (LLMs) is challenging, requiring making—and
making sense of—many responses. Yet tools that go beyond basic prompting tend to require …
making sense of—many responses. Yet tools that go beyond basic prompting tend to require …
Understanding and mitigating bias in imaging artificial intelligence
Artificial intelligence (AI) algorithms are prone to bias at multiple stages of model
development, with potential for exacerbating health disparities. However, bias in imaging AI …
development, with potential for exacerbating health disparities. However, bias in imaging AI …
Developing machine learning systems worthy of trust for infection science: a requirement for future implementation into clinical practice
Infection science is a discipline of healthcare which includes clinical microbiology, public
health microbiology, mechanisms of microbial disease, and antimicrobial countermeasures …
health microbiology, mechanisms of microbial disease, and antimicrobial countermeasures …
On the Robustness of ML-Based Network Intrusion Detection Systems: An Adversarial and Distribution Shift Perspective
Utilizing machine learning (ML)-based approaches for network intrusion detection systems
(NIDSs) raises valid concerns due to the inherent susceptibility of current ML models to …
(NIDSs) raises valid concerns due to the inherent susceptibility of current ML models to …
Diagnosing model performance under distribution shift
Prediction models can perform poorly when deployed to target distributions different from the
training distribution. To understand these operational failure modes, we develop a method …
training distribution. To understand these operational failure modes, we develop a method …
Enhancing false negative and positive rates for efficient insider threat detection
M AlSlaiman, MI Salman, MM Saleh, B Wang - Computers & Security, 2023 - Elsevier
Insider threats on information security can become a burden for organizations. However,
outsider attacks have received more attention compared to insider attacks. Many …
outsider attacks have received more attention compared to insider attacks. Many …
Systematic evaluation of local and global machine learning models for the prediction of ADME properties
E Di Lascio, G Gerebtzoff… - Molecular …, 2023 - ACS Publications
Machine learning (ML) has become an indispensable tool to predict absorption, distribution,
metabolism, and excretion (ADME) properties in pharmaceutical research. ML algorithms …
metabolism, and excretion (ADME) properties in pharmaceutical research. ML algorithms …
Explainability through uncertainty: Trustworthy decision-making with neural networks
Uncertainty is a key feature of any machine learning model and is particularly important in
neural networks, which tend to be overconfident. This overconfidence is worrying under …
neural networks, which tend to be overconfident. This overconfidence is worrying under …
Angler: Helping machine translation practitioners prioritize model improvements
Machine learning (ML) models can fail in unexpected ways in the real world, but not all
model failures are equal. With finite time and resources, ML practitioners are forced to …
model failures are equal. With finite time and resources, ML practitioners are forced to …