Dynamic pricing with limited supply

AV Den Boer - Surveys in operations research and management …, 2015 - Elsevier

The topic of dynamic pricing and learning has received a considerable amount of attention
in recent years, from different scientific communities. We survey these literature streams: we …

被引用次数：622 相关文章所有 13 个版本

[PDF] nowpublishers.com

Introduction to multi-armed bandits

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com

Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

被引用次数：1217 相关文章所有 7 个版本

[PDF] acm.org

Bandits with knapsacks

A Badanidiyuru, R Kleinberg, A Slivkins - Journal of the ACM (JACM), 2018 - dl.acm.org

Multi-armed bandit problems are the predominant theoretical model of exploration-
exploitation tradeoffs in learning, and they have countless applications ranging from medical …

被引用次数：527 相关文章所有 11 个版本

[PDF] washington.edu

A perspective on incentive design: Challenges and opportunities

LJ Ratliff, R Dong, S Sekar, T Fiez - Annual Review of Control …, 2019 - annualreviews.org

The increasingly tight coupling between humans and system operations in domains ranging
from intelligent infrastructure to e-commerce has led to a challenging new class of problems …

被引用次数：57 相关文章所有 6 个版本

[PDF] ssrn.com

Feature-based dynamic pricing

MC Cohen, I Lobel, R Paes Leme - Management Science, 2020 - pubsonline.informs.org

We consider the problem faced by a firm that receives highly differentiated products in an
online fashion. The firm needs to price these products to sell them to its customer base …

被引用次数：242 相关文章所有 17 个版本

[PDF] psu.edu

Truthful incentives in crowdsourcing tasks using regret minimization mechanisms

A Singla, A Krause - Proceedings of the 22nd international conference …, 2013 - dl.acm.org

What price should be offered to a worker for a task in an online labor market? How can one
enable workers to express the amount they desire to receive for the task completion …

被引用次数：348 相关文章所有 8 个版本

[PDF] arxiv.org

MNL-bandit: A dynamic learning approach to assortment selection

S Agrawal, V Avadhanula, V Goyal… - Operations …, 2019 - pubsonline.informs.org

We consider a dynamic assortment selection problem where in every round the retailer
offers a subset (assortment) of N substitutable products to a consumer, who selects one of …

被引用次数：192 相关文章所有 11 个版本

[PDF] arxiv.org

Bandits with concave rewards and convex knapsacks

S Agrawal, NR Devanur - Proceedings of the fifteenth ACM conference …, 2014 - dl.acm.org

In this paper, we consider a very general model for exploration-exploitation tradeoff which
allows arbitrary concave rewards and convex constraints on the decisions across time, in …

被引用次数：238 相关文章所有 6 个版本

[PDF] arxiv.org

Adversarial bandits with knapsacks

N Immorlica, K Sankararaman, R Schapire… - Journal of the ACM, 2022 - dl.acm.org

We consider Bandits with Knapsacks (henceforth, BwK), a general model for multi-armed
bandits under supply/budget constraints. In particular, a bandit algorithm needs to solve a …

被引用次数：133 相关文章所有 12 个版本

[PDF] siam.org

Fast algorithms for online stochastic convex programming

S Agrawal, NR Devanur - Proceedings of the twenty-sixth annual ACM-SIAM …, 2014 - SIAM

We introduce the online stochastic Convex Programming (CP) problem, a very general
version of stochastic online problems which allows arbitrary concave objectives and convex …

被引用次数：203 相关文章所有 8 个版本