关注
Yuexiang Zhai
Yuexiang Zhai
其他姓名Simon Zhai
在 berkeley.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Complete dictionary learning via l4-norm maximization over the orthogonal group
Y Zhai, Z Yang, Z Liao, J Wright, Y Ma
Journal of Machine Learning Research 21 (165), 1-68, 2020
672020
Learning to Reconstruct 3D Manhattan Wireframes from a Single Image
Y Zhou, H Qi, Y Zhai, Q Sun, Z Chen, LY Wei, Y Ma
International Conference on Computer Vision (ICCV), 2019, 2019
662019
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning
M Nakamoto, S Zhai, A Singh, M Sobol Mark, Y Ma, C Finn, A Kumar, ...
Advances in Neural Information Processing Systems 36, 2024
572024
Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning
Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee, Y Ma
Conference on Parsimony and Learning, 202-227, 2024
57*2024
Eyes wide shut? exploring the visual shortcomings of multimodal llms
S Tong, Z Liu, Y Zhai, Y Ma, Y LeCun, S Xie
arXiv preprint arXiv:2401.06209, 2024
502024
Unpacking reward shaping: Understanding the benefits of reward engineering on sample complexity
A Gupta, A Pacchiano, Y Zhai, S Kakade, S Levine
Advances in Neural Information Processing Systems 35, 15281-15295, 2022
342022
Geometric analysis of nonconvex optimization landscapes for overcomplete learning
Q Qu, Y Zhai, X Li, Y Zhang, Z Zhu
International Conference on Learning Representations, 2020
312020
Convolutional normalization: Improving deep convolutional network robustness and training
S Liu, X Li, Y Zhai, C You, Z Zhu, C Fernandez-Granda, Q Qu
Advances in neural information processing systems 34, 28919-28928, 2021
242021
Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness
Y Zhai, H Mehta, Z Zhou, Y Ma
International Conference on Learning Representations (ICLR), 2020, 2020
212020
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
Y Zhai, C Baek, Z Zhou, J Jiao, Y Ma
Journal of Artificial Intelligence Research 73, 847-896, 2022
202022
Analysis of the optimization landscapes for overcomplete representation learning
Q Qu, Y Zhai, X Li, Y Zhang, Z Zhu
arXiv preprint arXiv:1912.02427, 2019
172019
Lmrl gym: Benchmarks for multi-turn reinforcement learning with language models
M Abdulhai, I White, CV Snell, C Sun, J Hong, Y Zhai, K Xu, S Levine
132023
Understanding the complexity gains of single-task rl with a curriculum
Q Li, Y Zhai, Y Ma, S Levine
International Conference on Machine Learning, 20412-20451, 2023
122023
RLIF: Interactive Imitation Learning as Reinforcement Learning
J Luo, P Dong, Y Zhai, Y Ma, S Levine
arXiv preprint arXiv:2311.12996, 2023
42023
Closed-Loop Transcription via Convolutional Sparse Coding
X Dai, K Chen, S Tong, J Zhang, X Gao, M Li, D Pai, Y Zhai, XI Yuan, ...
arXiv preprint arXiv:2302.09347, 2023
32023
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
Y Yu, S Buchanan, D Pai, T Chu, Z Wu, S Tong, H Bai, Y Zhai, ...
arXiv preprint arXiv:2311.13110, 2023
22023
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Y Zhai, H Bai, Z Lin, J Pan, S Tong, Y Zhou, A Suhr, S Xie, Y LeCun, Y Ma, ...
arXiv preprint arXiv:2405.10292, 2024
12024
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement
R Zhang, Y Zhai, A Zanette
arXiv preprint arXiv:2402.15703, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–18