关注
Tianjun Zhang
标题
引用次数
引用次数
年份
Gorilla: Large language model connected with massive apis
SG Patil, T Zhang, X Wang, JE Gonzalez
arXiv preprint arXiv:2305.15334, 2023
2172023
AgentBench: Evaluating LLMs as Agents
X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ...
arXiv preprint arXiv:2308.03688, 2023
194*2023
Synetgy: Algorithm-hardware co-design for convnet accelerators on embedded fpgas
Y Yang, Q Huang, B Wu, T Zhang, L Ma, G Gambardella, M Blott, ...
Proceedings of the 2019 ACM/SIGDA international symposium on field …, 2019
1372019
Contrastive code representation learning
P Jain, A Jain, T Zhang, P Abbeel, JE Gonzalez, I Stoica
arXiv preprint arXiv:2007.04973, 2020
1282020
GenAx: A genome sequencing accelerator
D Fujiki, A Subramaniyan, T Zhang, Y Zeng, R Das, D Blaauw, ...
2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018
1012018
ANODEV2: A coupled neural ODE framework
T Zhang, Z Yao, A Gholami, JE Gonzalez, K Keutzer, MW Mahoney, ...
Advances in Neural Information Processing Systems 32, 2019
972019
Contrastive learning as goal-conditioned reinforcement learning
B Eysenbach, T Zhang, S Levine, RR Salakhutdinov
Advances in Neural Information Processing Systems 35, 35603-35620, 2022
922022
Tempera: Test-time prompting via reinforcement learning
T Zhang, X Wang, D Zhou, D Schuurmans, JE Gonzalez
arXiv preprint arXiv:2211.11890, 2022
832022
Noveld: A simple yet effective exploration criterion
T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian
Advances in Neural Information Processing Systems 34, 25217-25230, 2021
592021
Bebold: Exploration beyond the boundary of explored regions
T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian
arXiv preprint arXiv:2012.08621, 2020
462020
Made: Exploration via maximizing deviation from explored regions
T Zhang, P Rashidinejad, J Jiao, Y Tian, JE Gonzalez, S Russell
Advances in Neural Information Processing Systems 34, 9663-9680, 2021
422021
Making linear mdps practical via contrastive representation learning
T Zhang, T Ren, M Yang, J Gonzalez, D Schuurmans, B Dai
International Conference on Machine Learning, 26447-26466, 2022
402022
The wisdom of hindsight makes language models better instruction followers
T Zhang, F Liu, J Wong, P Abbeel, JE Gonzalez
International Conference on Machine Learning, 41414-41428, 2023
372023
Multitask vision-language prompt tuning
S Shen, S Yang, T Zhang, B Zhai, JE Gonzalez, K Keutzer, T Darrell
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
352024
Multi-agent collaboration via reward attribution decomposition
T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian
arXiv preprint arXiv:2010.08531, 2020
352020
Controllable text-to-image generation with gpt-4
T Zhang, Y Zhang, V Vineet, N Joshi, X Wang
arXiv preprint arXiv:2305.18583, 2023
302023
Efficient planning in a compact latent action space
Z Jiang, T Zhang, M Janner, Y Li, T Rocktäschel, E Grefenstette, Y Tian
arXiv preprint arXiv:2208.10291, 2022
292022
Raft: Adapting language model to domain specific rag
T Zhang, SG Patil, N Jain, S Shen, M Zaharia, I Stoica, JE Gonzalez
arXiv preprint arXiv:2403.10131, 2024
282024
Spectral decomposition representation for reinforcement learning
T Ren, T Zhang, L Lee, JE Gonzalez, D Schuurmans, B Dai
arXiv preprint arXiv:2208.09515, 2022
232022
A free lunch from the noise: Provable and practical exploration for representation learning
T Ren, T Zhang, C Szepesvári, B Dai
Uncertainty in Artificial Intelligence, 1686-1696, 2022
192022
系统目前无法执行此操作,请稍后再试。
文章 1–20