Improving online marketing experiments with drifting multi-armed bandits- 学术资源搜索

Improving online marketing experiments with drifting multi-armed bandits

G Burtini, J Loeppky, R Lawrence - International Conference on …, 2015 - scitepress.org

International Conference on Enterprise Information Systems, 2015•scitepress.org

Restless bandits model the exploration vs. exploitation trade-off in a changing (non-stationary) world. Restless bandits have been studied in both the context of continuously-changing (drifting) and change-point (sudden) restlessness. In this work, we study specific classes of drifting restless bandits selected for their relevance to modelling an online website optimization process. The contribution in this work is a simple, feasible weighted least squares technique capable of utilizing contextual arm parameters while considering the parameter space drifting non-stationary within reasonable bounds. We produce a reference implementation, then evaluate and compare its performance in several different true world states, finding experimentally that performance is robust to time drifting factors similar to those seen in many real world cases.

scitepress.org

展开收起

被引用次数：31 相关文章所有 7 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果