Accelerating approximate thompson sampling with underdamped langevin monte carlo

H Zheng, W Deng, C Moya… - … Conference on Artificial …, 2024 - proceedings.mlr.press
Abstract Approximate Thompson sampling with Langevin Monte Carlo broadens its reach
from Gaussian posterior sampling to encompass more general smooth posteriors. However …

Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs

T Jin, HL Hsu, W Chang, P Xu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
We study the multi-agent multi-armed bandit (MAMAB) problem, where agents are factored
into overlapping groups. Each group represents a hyperedge, forming a hypergraph over …

Epsilon-Greedy Thompson Sampling to Bayesian Optimization

B Do, T Adebiyi, R Zhang - … of Computing and …, 2024 - asmedigitalcollection.asme.org
Bayesian optimization (BO) has become a powerful tool for solving simulation-based
engineering optimization problems thanks to its ability to integrate physical and …

Efficient and robust sequential decision making algorithms

P Xu - AI Magazine, 2024 - Wiley Online Library
Sequential decision‐making involves making informed decisions based on continuous
interactions with a complex environment. This process is ubiquitous in various applications …

{\epsilon}-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment

HL Hsu, Q Gao, M Pajic - arXiv preprint arXiv:2403.06814, 2024 - arxiv.org
Deep Brain Stimulation (DBS) stands as an effective intervention for alleviating the motor
symptoms of Parkinson's disease (PD). Traditional commercial DBS devices are only able to …

Only pay for what is uncertain: Variance-adaptive thompson sampling

A Saha, B Kveton - arXiv preprint arXiv:2303.09033, 2023 - arxiv.org
Most bandit algorithms assume that the reward variances or their upper bounds are known,
and that they are the same for all arms. This naturally leads to suboptimal performance and …

Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning

HL Hsu, W Wang, M Pajic, P Xu - arXiv preprint arXiv:2404.10728, 2024 - arxiv.org
We present the first study on provably efficient randomized exploration in cooperative multi-
agent reinforcement learning (MARL). We propose a unified algorithm framework for …

ϵ-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment

HL Hsu, Q Gao, M Pajic - 2024 ACM/IEEE 15th International …, 2024 - ieeexplore.ieee.org
Deep Brain Stimulation (DBS) stands as an effective intervention for alleviating the motor
symptoms of Parkinson's disease (PD). Traditional commercial DBS devices are only able to …

Joint User Association and Pairing in Multi-UAV-Assisted NOMA Networks: A Decaying-Epsilon Thompson Sampling Framework

B Uwizeyimana, M Abo-Zahhad, O Muta… - IEEE …, 2024 - ieeexplore.ieee.org
Unmanned aerial vehicles (UAVs) are expected to be integrated into future wireless
networks to offer services, especially in unreachable or congested areas. To improve the …

The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models

J Lee, CK Chiang, M Sugiyama - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Thompson sampling (TS) has been known for its outstanding empirical performance
supported by theoretical guarantees across various reward models in the classical …