Designing machine learning systems

H Yao, C Choi, B Cao, Y Lee… - Advances in Neural …, 2022 - proceedings.neurips.cc

Distribution shifts occur when the test distribution differs from the training distribution, and
can considerably degrade performance of machine learning models deployed in the real …

被引用次数：76 相关文章所有 9 个版本

[PDF] arxiv.org

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

I Arawjo, C Swoopes, P Vaithilingam… - Proceedings of the CHI …, 2024 - dl.acm.org

Evaluating outputs of large language models (LLMs) is challenging, requiring making—and
making sense of—many responses. Yet tools that go beyond basic prompting tend to require …

被引用次数：49 相关文章所有 4 个版本

[PDF] rsna.org

Understanding and mitigating bias in imaging artificial intelligence

AS Tejani, YS Ng, Y Xi, JC Rayan - RadioGraphics, 2024 - pubs.rsna.org

Artificial intelligence (AI) algorithms are prone to bias at multiple stages of model
development, with potential for exacerbating health disparities. However, bias in imaging AI …

被引用次数：16 相关文章所有 5 个版本

[PDF] frontiersin.org

Developing machine learning systems worthy of trust for infection science: a requirement for future implementation into clinical practice

BR McFadden, M Reynolds, TJJ Inglis - Frontiers in Digital Health, 2023 - frontiersin.org

Infection science is a discipline of healthcare which includes clinical microbiology, public
health microbiology, mechanisms of microbial disease, and antimicrobial countermeasures …

被引用次数：5 相关文章所有 8 个版本

[PDF] mdpi.com

On the Robustness of ML-Based Network Intrusion Detection Systems: An Adversarial and Distribution Shift Perspective

M Wang, N Yang, DH Gunasinghe, N Weng - Computers, 2023 - mdpi.com

Utilizing machine learning (ML)-based approaches for network intrusion detection systems
(NIDSs) raises valid concerns due to the inherent susceptibility of current ML models to …

被引用次数：5 相关文章所有 4 个版本

[PDF] arxiv.org

Diagnosing model performance under distribution shift

TT Cai, H Namkoong, S Yadlowsky - arXiv preprint arXiv:2303.02011, 2023 - arxiv.org

Prediction models can perform poorly when deployed to target distributions different from the
training distribution. To understand these operational failure modes, we develop a method …

被引用次数：22 相关文章所有 2 个版本

Enhancing false negative and positive rates for efficient insider threat detection

M AlSlaiman, MI Salman, MM Saleh, B Wang - Computers & Security, 2023 - Elsevier

Insider threats on information security can become a burden for organizations. However,
outsider attacks have received more attention compared to insider attacks. Many …

被引用次数：29 相关文章所有 2 个版本

Systematic evaluation of local and global machine learning models for the prediction of ADME properties

E Di Lascio, G Gerebtzoff… - Molecular …, 2023 - ACS Publications

Machine learning (ML) has become an indispensable tool to predict absorption, distribution,
metabolism, and excretion (ADME) properties in pharmaceutical research. ML algorithms …

被引用次数：20 相关文章所有 5 个版本

[PDF] arxiv.org

Explainability through uncertainty: Trustworthy decision-making with neural networks

A Thuy, DF Benoit - European Journal of Operational Research, 2024 - Elsevier

Uncertainty is a key feature of any machine learning model and is particularly important in
neural networks, which tend to be overconfident. This overconfidence is worrying under …

被引用次数：13 相关文章所有 7 个版本

[PDF] acm.org

Angler: Helping machine translation practitioners prioritize model improvements

S Robertson, ZJ Wang, D Moritz, MB Kery… - Proceedings of the 2023 …, 2023 - dl.acm.org

Machine learning (ML) models can fail in unexpected ways in the real world, but not all
model failures are equal. With finite time and resources, ML practitioners are forced to …

被引用次数：14 相关文章所有 3 个版本