Low-rank tensor bandits

Y Kang, CJ Hsieh, TCM Lee - Advances in Neural …, 2022 - proceedings.neurips.cc

In the stochastic contextual low-rank matrix bandit problem, the expected reward of an action
is given by the inner product between the action's feature matrix and some fixed, but initially …

被引用次数：20 相关文章所有 4 个版本

[PDF] neurips.cc

An analysis of ensemble sampling

C Qin, Z Wen, X Lu, B Van Roy - Advances in Neural …, 2022 - proceedings.neurips.cc

Ensemble sampling serves as a practical approximation to Thompson sampling when
maintaining an exact posterior distribution over model parameters is computationally …

被引用次数：27 相关文章所有 8 个版本

[PDF] mlr.press

Optimal algorithms for latent bandits with cluster structure

S Pal, AS Suggala, K Shanmugam… - … Conference on Artificial …, 2023 - proceedings.mlr.press

We consider the problem of latent bandits with cluster structure where there are multiple
users, each with an associated multi-armed bandit problem. These users are grouped into …

被引用次数：8 相关文章所有 4 个版本

[PDF] neurips.cc

Optimal gradient-based algorithms for non-concave bandit optimization

B Huang, K Huang, S Kakade, JD Lee… - Advances in …, 2021 - proceedings.neurips.cc

Bandit problems with linear or concave reward have been extensively studied, but relatively
few works have studied bandits with non-concave reward. This work considers a large family …

被引用次数：17 相关文章所有 11 个版本

[PDF] arxiv.org

Online low rank matrix completion

P Jain, S Pal - arXiv preprint arXiv:2209.03997, 2022 - arxiv.org

We study the problem of {\em online} low-rank matrix completion with $\mathsf {M} $ users,
$\mathsf {N} $ items and $\mathsf {T} $ rounds. In each round, the algorithm recommends …

被引用次数：14 相关文章所有 2 个版本

[PDF] arxiv.org

Speed up the cold-start learning in two-sided bandits with many arms

M Bayati, J Cao, W Chen - arXiv preprint arXiv:2210.00340, 2022 - arxiv.org

Multi-armed bandit (MAB) algorithms are efficient approaches to reduce the opportunity cost
of online experimentation and are used by companies to find the best product from …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Targeted advertising on social networks using online variational tensor regression

T Idé, K Murugesan, D Bouneffouf, N Abe - arXiv preprint arXiv …, 2022 - arxiv.org

This paper is concerned with online targeted advertising on social networks. The main
technical task we address is to estimate the activation probability for user pairs, which …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

Online matrix completion: A collaborative approach with hott items

D Baby, S Pal - arXiv preprint arXiv:2408.05843, 2024 - arxiv.org

We investigate the low rank matrix completion problem in an online setting with ${M} $
users, ${N} $ items, ${T} $ rounds, and an unknown rank-$ r $ reward matrix ${R}\in\mathbb …

被引用次数：1 相关文章

[PDF] openreview.net

Online Low Rank Matrix Completion

S Pal, P Jain - The Eleventh International Conference on Learning …, 2022 - openreview.net

We study the problem of online low-rank matrix completion with $\mathsf {M} $ users,
$\mathsf {N} $ items and $\mathsf {T} $ rounds. In each round, the algorithm recommends …

被引用次数：3 相关文章

[PDF] arxiv.org

Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems

Y Kang, CJ Hsieh, T Lee - arXiv preprint arXiv:2401.07298, 2024 - arxiv.org

In the stochastic contextual low-rank matrix bandit problem, the expected reward of an action
is given by the inner product between the action's feature matrix and some fixed, but initially …

Efficient frameworks for generalized low-rank matrix bandit problems

An analysis of ensemble sampling

Optimal algorithms for latent bandits with cluster structure

Optimal gradient-based algorithms for non-concave bandit optimization

Online low rank matrix completion

Speed up the cold-start learning in two-sided bandits with many arms

Targeted advertising on social networks using online variational tensor regression

Online matrix completion: A collaborative approach with hott items

Online Low Rank Matrix Completion

Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems

高级搜索

引用