- 学术资源搜索

Efficient decentralized multi-agent learning in asymmetric queuing systems

D Freund, T Lykouris, W Weng - Conference on Learning …, 2022 - proceedings.mlr.press

We study decentralized multi-agent learning in bipartite queuing systems, a standard model
for service systems. In particular, N agents request service from K servers in a fully …

被引用次数：18 相关文章

[PDF] neurips.cc

Quantifying the cost of learning in queueing systems

D Freund, T Lykouris, W Weng - Advances in Neural …, 2024 - proceedings.neurips.cc

Queueing systems are widely applicable stochastic models with use cases in
communication networks, healthcare, service systems, etc. Although their optimal control …

被引用次数：3 相关文章所有 5 个版本

[PDF] openreview.net

Learning to schedule tasks with deadline and throughput constraints

Q Liu, Z Fang - IEEE INFOCOM 2023-IEEE Conference on …, 2023 - ieeexplore.ieee.org

We consider the task scheduling scenario where the controller activates one from K task
types at each time. Each task induces a random completion time, and a reward is obtained …

被引用次数：17 相关文章所有 2 个版本

[PDF] mlr.press

Learning while scheduling in multi-server systems with unknown statistics: Maxweight with discounted ucb

Z Yang, R Srikant, L Ying - International Conference on …, 2023 - proceedings.mlr.press

Multi-server queueing systems are widely used models for job scheduling in machine
learning, wireless networks, and crowdsourcing. This paper considers a multi-server system …

被引用次数：18 相关文章所有 6 个版本

[PDF] arxiv.org

Experimenting under stochastic congestion

S Li, R Johari, X Kuang, S Wager - arXiv preprint arXiv:2302.12093, 2023 - arxiv.org

We study randomized experiments in a service system when stochastic congestion can arise
from temporarily limited supply and/or demand. Such congestion gives rise to cross-unit …

被引用次数：18 相关文章所有 3 个版本

[PDF] aaai.org

Decentralized scheduling with qos constraints: Achieving o (1) qos regret of multi-player bandits

Q Liu, Z Fang - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

We consider a decentralized multi-player multi-armed bandit (MP-MAB) problem where
players cannot observe the actions and rewards of other players and no explicit …

被引用次数：3 相关文章

[PDF] arxiv.org

Learning to defer in content moderation: The human-ai interplay

T Lykouris, W Weng - arXiv preprint arXiv:2402.12237, 2024 - arxiv.org

Successful content moderation in online platforms relies on a human-AI collaboration
approach. A typical heuristic estimates the expected harmfulness of a post and uses fixed …

被引用次数：6 相关文章所有 2 个版本

[PDF] neurips.cc

Bayesian learning of optimal policies in markov decision processes with countably infinite state-space

S Adler, V Subramanian - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Abstract Models of many real-life applications, such as queueing models of communication
networks or computing systems, have a countably infinite state-space. Algorithmic and …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Queue scheduling with adversarial bandit learning

J Huang, L Golubchik, L Huang - arXiv preprint arXiv:2303.01745, 2023 - arxiv.org

In this paper, we study scheduling of a queueing system with zero knowledge of
instantaneous network conditions. We consider a one-hop single-server queueing system …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Mean-field analysis for load balancing on spatial graphs

D Rutten, D Mukherjee - Abstract Proceedings of the 2023 ACM …, 2023 - dl.acm.org

A pivotal methodological tool behind the analysis of large-scale load balancing systems is
mean-field analysis. The high-level idea is to represent the system state by aggregate …

被引用次数：9 相关文章所有 6 个版本