Rankitect: Ranking architecture search battling world-class engineers at meta scale

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Rankitect: Ranking architecture search battling world-class engineers at meta scale

在引用文章中搜索

[PDF] arxiv.org

Finite-time convergence and sample complexity of actor-critic multi-objective reinforcement learning

T Zhou, FNU Hairi, H Yang, J Liu, T Tong… - arXiv preprint arXiv …, 2024 - arxiv.org

Reinforcement learning with multiple, potentially conflicting objectives is pervasive in real-
world applications, while this problem remains theoretically under-explored. This paper …

被引用次数：1 相关文章所有 5 个版本

[PDF] openreview.net

PILOT: An -Convergent Approach for Policy Evaluation with Nonlinear Function Approximation

Z Liu, X Zhang, J Liu, Z Zhu, S Lu - The Twelfth International Conference on … - openreview.net

Learning an accurate value function for a given policy is a critical step in solving
reinforcement learning (RL) problems. So far, however, the convergence speed and sample …

[PDF] ohiolink.edu

Complex-Structured Optimization Problems in Distributed Learning

Z Liu - 2024 - search.proquest.com

In recent years, machine learning (ML) has achieved astonishing success in many areas,
including robotics, image recognition, natural language processing, and recommender …