Epistemic neural networks

B Goldman, S Kearnes, T Kramer, P Riley… - Journal of medicinal …, 2022 - ACS Publications

One application area of computational methods in drug discovery is the automated design of
small molecules. Despite the large number of publications describing methods and their …

被引用次数：35 相关文章所有 5 个版本

[PDF] sciencedirect.com

A survey on uncertainty reasoning and quantification in belief theory and its application to deep learning

Z Guo, Z Wan, Q Zhang, X Zhao, Q Zhang, LM Kaplan… - Information …, 2024 - Elsevier

An in-depth understanding of uncertainty is the first step to making effective decisions under
uncertainty. Machine/deep learning (ML/DL) has been hugely leveraged to solve complex …

被引用次数：11 相关文章所有 4 个版本

[PDF] arxiv.org

A survey of reinforcement learning from human feedback

T Kaufmann, P Weng, V Bengs… - arXiv preprint arXiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning
(RL) that learns from human feedback instead of relying on an engineered reward function …

被引用次数：101 相关文章所有 4 个版本

[PDF] arxiv.org

Semantic anomaly detection with large language models

A Elhafsi, R Sinha, C Agia, E Schmerling… - Autonomous …, 2023 - Springer

As robots acquire increasingly sophisticated skills and see increasingly complex and varied
environments, the threat of an edge case or anomalous failure is ever present. For example …

被引用次数：59 相关文章所有 4 个版本

[PDF] mlr.press

Do bayesian neural networks need to be fully stochastic?

M Sharma, S Farquhar, E Nalisnick… - International …, 2023 - proceedings.mlr.press

We investigate the benefit of treating all the parameters in a Bayesian neural network
stochastically and find compelling theoretical and empirical evidence that this standard …

被引用次数：48 相关文章所有 4 个版本

[PDF] arxiv.org

Selectively answering ambiguous questions

JR Cole, MJQ Zhang, D Gillick, JM Eisenschlos… - arXiv preprint arXiv …, 2023 - arxiv.org

Trustworthy language models should abstain from answering questions when they do not
know the answer. However, the answer to a question can be unknown for a variety of …

被引用次数：40 相关文章所有 5 个版本

[PDF] arxiv.org

Truncation sampling as language model desmoothing

J Hewitt, CD Manning, P Liang - arXiv preprint arXiv:2210.15191, 2022 - arxiv.org

Long samples of text from neural language models can be of poor quality. Truncation
sampling algorithms--like top-$ p $ or top-$ k $--address this by setting some words' …

被引用次数：53 相关文章所有 3 个版本

[PDF] neurips.cc

Deep ensembles work, but are they necessary?

T Abe, EK Buchanan, G Pleiss… - Advances in …, 2022 - proceedings.neurips.cc

Ensembling neural networks is an effective way to increase accuracy, and can often match
the performance of individual larger models. This observation poses a natural question …

被引用次数：59 相关文章所有 6 个版本

[PDF] arxiv.org

On the practicality of deterministic epistemic uncertainty

J Postels, M Segu, T Sun, L Sieber, L Van Gool… - arXiv preprint arXiv …, 2021 - arxiv.org

A set of novel approaches for estimating epistemic uncertainty in deep neural networks with
a single forward pass has recently emerged as a valid alternative to Bayesian Neural …

被引用次数：65 相关文章所有 8 个版本

[PDF] arxiv.org

Self-exploring language models: Active preference elicitation for online alignment

S Zhang, D Yu, H Sharma, H Zhong, Z Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Preference optimization, particularly through Reinforcement Learning from Human
Feedback (RLHF), has achieved significant success in aligning Large Language Models …

被引用次数：11 相关文章所有 3 个版本

Defining levels of automated chemical design

A survey on uncertainty reasoning and quantification in belief theory and its application to deep learning

A survey of reinforcement learning from human feedback

Semantic anomaly detection with large language models

Do bayesian neural networks need to be fully stochastic?

Selectively answering ambiguous questions

Truncation sampling as language model desmoothing

Deep ensembles work, but are they necessary?

On the practicality of deterministic epistemic uncertainty

Self-exploring language models: Active preference elicitation for online alignment

高级搜索

引用