Query complexity of derivative-free optimization

B Recht - Annual Review of Control, Robotics, and Autonomous …, 2019 - annualreviews.org

This article surveys reinforcement learning from the perspective of optimization and control,
with a focus on continuous control applications. It reviews the general formulation …

被引用次数：718 相关文章所有 5 个版本

[PDF] neurips.cc

Fine-tuning language models with just forward passes

S Malladi, T Gao, E Nichani… - Advances in …, 2023 - proceedings.neurips.cc

Fine-tuning language models (LMs) has yielded success on diverse downstream tasks, but
as LMs grow in size, backpropagation requires a prohibitively large amount of memory …

被引用次数：96 相关文章所有 6 个版本

[图书][B] Control systems and reinforcement learning

S Meyn - 2022 - books.google.com

A high school student can create deep Q-learning code to control her robot, without any
understanding of the meaning of'deep'or'Q', or why the code sometimes fails. This book is …

被引用次数：128 相关文章所有 3 个版本

[PDF] arxiv.org

Derivative-free optimization methods

J Larson, M Menickelly, SM Wild - Acta Numerica, 2019 - cambridge.org

In many optimization problems arising from scientific, engineering and artificial intelligence
applications, objective and constraint functions are available only as the output of a black …

被引用次数：457 相关文章所有 9 个版本

[PDF] arxiv.org

Derivative-free reinforcement learning: A review

H Qian, Y Yu - Frontiers of Computer Science, 2021 - Springer

Reinforcement learning is about learning agent models that make the best sequential
decisions in unknown environments. In an unknown environment, the agent needs to …

被引用次数：43 相关文章所有 8 个版本

[PDF] neurips.cc

Simple random search of static linear policies is competitive for reinforcement learning

H Mania, A Guy, B Recht - Advances in neural information …, 2018 - proceedings.neurips.cc

Abstract Model-free reinforcement learning aims to offer off-the-shelf solutions for controlling
dynamical systems without requiring models of the system dynamics. We introduce a model …

被引用次数：292 相关文章所有 5 个版本

[PDF] arxiv.org

Simple random search provides a competitive approach to reinforcement learning

H Mania, A Guy, B Recht - arXiv preprint arXiv:1803.07055, 2018 - arxiv.org

A common belief in model-free reinforcement learning is that methods based on random
search in the parameter space of policies exhibit significantly worse sample complexity than …

被引用次数：374 相关文章所有 2 个版本

[PDF] jmlr.org

Derivative-free methods for policy optimization: Guarantees for linear quadratic systems

D Malik, A Pananjady, K Bhatia, K Khamaru… - Journal of Machine …, 2020 - jmlr.org

We study derivative-free methods for policy optimization over the class of linear policies. We
focus on characterizing the convergence rate of these methods when applied to linear …

被引用次数：208 相关文章所有 10 个版本

[PDF] arxiv.org

Optimal rates for zero-order convex optimization: The power of two function evaluations

JC Duchi, MI Jordan, MJ Wainwright… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org

We consider derivative-free algorithms for stochastic and nonstochastic convex optimization
problems that use only function values rather than gradients. Focusing on nonasymptotic …

被引用次数：503 相关文章所有 9 个版本

[PDF] google.com

Radiative backpropagation: An adjoint method for lightning-fast differentiable rendering

M Nimier-David, S Speierer, B Ruiz… - ACM Transactions on …, 2020 - dl.acm.org

Physically based differentiable rendering has recently evolved into a powerful tool for
solving inverse problems involving light. Methods in this area perform a differentiable …

被引用次数：104 相关文章所有 4 个版本