Dual-Directed Algorithm Design for Efficient Pure Exploration

C Qin, W You - arXiv preprint arXiv:2310.19319, 2023 - arxiv.org
We consider pure-exploration problems in the context of stochastic sequential adaptive
experiments with a finite set of alternative options. The goal of the decision-maker is to …

Simulation Budget Allocation for Improving Scheduling and Routing of Automated Guided Vehicles in Warehouse Management

GB Zhang, HB Li, XT Liu, YJ Peng - … of the Operations Research Society of …, 2024 - Springer
Simulation budget allocation is a widely used technique for evaluating and optimizing
dynamic discrete event stochastic system via efficient sampling. In warehouse management …

Top-Two Thompson Sampling for Contextual Top-mc Selection Problems

X Shi, Y Peng, G Zhang - arXiv preprint arXiv:2306.17704, 2023 - arxiv.org
We aim to efficiently allocate a fixed simulation budget to identify the top-mc designs for
each context among a finite number of contexts. The performance of each design under a …

An Efficient Node Selection Policy for Monte Carlo Tree Search with Neural Networks

X Liu, Y Peng, G Zhang, R Zhou - INFORMS Journal on …, 2024 - pubsonline.informs.org
Monte Carlo tree search (MCTS) has been gaining increasing popularity, and the success of
AlphaGo has prompted a new trend of incorporating a value network and a policy network …

A Simulation Optimization Method for Scheduling Automated Guided Vehicles in a Stochastic Warehouse Management System

G Zhang, X Liu, Y Peng - 2023 Winter Simulation Conference …, 2023 - ieeexplore.ieee.org
We consider the problem of scheduling automated guided vehicles (AGVs) in a stochastic
warehouse management system. This problem was studied in the Case Study Competition …