Qualitative multi-armed bandits: A quantile-based approach

V Bengs, R Busa-Fekete, A El Mesaoudi-Paul… - Journal of Machine …, 2021 - jmlr.org

In machine learning, the notion of multi-armed bandits refers to a class of online learning
problems, in which an agent is supposed to simultaneously explore and exploit a given set …

被引用次数：102 相关文章所有 7 个版本

[PDF] mlr.press

Optimal thompson sampling strategies for support-aware cvar bandits

D Baudry, R Gautron, E Kaufmann… - … on Machine Learning, 2021 - proceedings.mlr.press

In this paper we study a multi-arm bandit problem in which the quality of each arm is
measured by the Conditional Value at Risk (CVaR) at some level alpha of the reward …

被引用次数：40 相关文章所有 12 个版本

[PDF] arxiv.org

Sequential estimation of quantiles with applications to A/B testing and best-arm identification

SR Howard, A Ramdas - Bernoulli, 2022 - projecteuclid.org

Sequential estimation of quantiles with applications to A/B testing and best-arm identification
Page 1 Bernoulli 28(3), 2022, 1704–1728 https://doi.org/10.3150/21-BEJ1388 Sequential …

被引用次数：67 相关文章所有 7 个版本

[PDF] sciencedirect.com

Risk verification of stochastic systems with neural network controllers

M Cleaveland, L Lindemann, R Ivanov, GJ Pappas - Artificial Intelligence, 2022 - Elsevier

Motivated by the fragility of neural network (NN) controllers in safety-critical applications, we
present a data-driven framework for verifying the risk of stochastic dynamical systems with …

被引用次数：9 相关文章所有 8 个版本

[PDF] arxiv.org

STL robustness risk over discrete-time stochastic processes

L Lindemann, N Matni… - 2021 60th IEEE …, 2021 - ieeexplore.ieee.org

We present a framework to interpret signal temporal logic (STL) formulas over discrete-time
stochastic processes in terms of the induced risk. Each realization of a stochastic process …

被引用次数：20 相关文章所有 7 个版本

[PDF] google.com

Quantile context-aware social IoT service big data recommendation with D2D communication

Y Yang, J Xu, Z Xu, P Zhou, T Qiu - IEEE Internet of Things …, 2020 - ieeexplore.ieee.org

With the rapid development of the Internet-of-Things (IoT) networks, millions of IoT services
provided through wireless networks are waiting for people's exploration. Such a large …

被引用次数：26 相关文章所有 3 个版本

[PDF] neurips.cc

Distribution-free model-agnostic regression calibration via nonparametric methods

S Liu, Z Cai, X Li - Advances in Neural Information …, 2024 - proceedings.neurips.cc

In this paper, we consider the uncertainty quantification problem for regression models.
Specifically, we consider an individual calibration objective for characterizing the quantiles …

被引用次数：3 相关文章所有 5 个版本

[PDF] acm.org

Risk of stochastic systems for temporal logic specifications

L Lindemann, L Jiang, N Matni, GJ Pappas - ACM Transactions on …, 2023 - dl.acm.org

The wide availability of data coupled with the computational advances in artificial
intelligence and machine learning promise to enable many future technologies such as …

被引用次数：8 相关文章所有 7 个版本

[PDF] acm.org

Rapid regression detection in software deployments through sequential testing

M Lindon, C Sanden, V Shirikian - … of the 28th ACM SIGKDD Conference …, 2022 - dl.acm.org

The practice of continuous deployment has enabled companies to reduce time-to-market by
increasing the rate at which software can be deployed. However, deploying more frequently …

被引用次数：11 相关文章所有 3 个版本

[PDF] mlr.press

Quantile bandits for best arms identification

M Zhang, CS Ong - International conference on machine …, 2021 - proceedings.mlr.press

We consider a variant of the best arm identification task in stochastic multi-armed bandits.
Motivated by risk-averse decision-making problems, our goal is to identify a set of $ m …

被引用次数：16 相关文章所有 5 个版本