Inferring agent objectives at different scales of a complex adaptive system

文章

学术资源搜索

获得 5 条结果（用时0.07秒）

我的图书馆

Inferring agent objectives at different scales of a complex adaptive system

在引用文章中搜索

[PDF] arxiv.org

Adversarial recovery of agent rewards from latent spaces of the limit order book

J Roa-Vicens, Y Wang, V Mison, Y Gal… - arXiv preprint arXiv …, 2019 - arxiv.org

Inverse reinforcement learning has proved its ability to explain state-action trajectories of
expert agents by recovering their underlying reward functions in increasingly challenging …

被引用次数：3 相关文章所有 9 个版本

[PDF] arxiv.org

Towards Inverse Reinforcement Learning for Limit Order Book Dynamics

J Roa-Vicens, C Chtourou, A Filos, F Rullan… - arXiv preprint arXiv …, 2019 - arxiv.org

Multi-agent learning is a promising method to simulate aggregate competitive behaviour in
finance. Learning expert agents' reward functions through their external demonstrations is …

被引用次数：3 相关文章所有 6 个版本

[PDF] sagepub.com Full View

Portfolio optimization for cointelated pairs: SDEs vs Machine learning

B Mahdavi-Damghani, K Mustafayeva… - Algorithmic …, 2021 - journals.sagepub.com

With the recent rise of Machine Learning (ML) as a candidate to partially replace classic
Financial Mathematics (FM) methodologies, we investigate the performances of both in …

被引用次数：2 相关文章所有 12 个版本

[PDF] researchgate.net

[PDF][PDF] Portfolio Optimization for Cointelated Pairs: Financial Mathematics or Machine Learning?

B Mahdavi-Damghani, K Mustafayeva, C Buescu… - researchgate.net

We investigate the problem of dynamic portfolio optimization in continuous-time, finite-
horizon setting for a portfolio of two stocks. The stocks follow the Cointelation model recently …

[引用][C] Financial Mathematics or Machine Learning in the context of Portfolio Optimization for Cointelated Pairs?

B Mahdavi-Damghani, K Mustafayeva, S Roberts…