关注
Jesse Mu
Jesse Mu
Anthropic
在 anthropic.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
STaR: Self-taught reasoner bootstrapping reasoning with reasoning
E Zelikman, YT Wu, J Mu, ND Goodman
364*2022
Compositional explanations of neurons
J Mu, J Andreas
Advances in Neural Information Processing Systems 33, 17153-17163, 2020
1772020
Parkinson's disease subtypes identified from cluster analysis of motor and non-motor symptoms
J Mu, KR Chaudhuri, C Bielza, J de Pedro-Cuesta, P Larrañaga, ...
Frontiers in aging neuroscience 9, 301, 2017
1482017
Learning to compress prompts with gist tokens
J Mu, X Li, N Goodman
Advances in Neural Information Processing Systems 36, 2023
1002023
Improving intrinsic exploration with language abstractions
J Mu, V Zhong, R Raileanu, M Jiang, N Goodman, T Rocktäschel, ...
Advances in Neural Information Processing Systems 35, 33947-33960, 2022
562022
Shaping visual representations with language for few-shot classification
J Mu, P Liang, N Goodman
arXiv preprint arXiv:1911.02683, 2019
462019
Active learning helps pretrained models learn the intended task
A Tamkin, D Nguyen, S Deshpande, J Mu, N Goodman
Advances in Neural Information Processing Systems 35, 28140-28153, 2022
362022
Emergent communication of generalizations
J Mu, N Goodman
Advances in neural information processing systems 34, 17994-18007, 2021
352021
Sleeper agents: Training deceptive llms that persist through safety training
E Hubinger, C Denison, J Mu, M Lambert, M Tong, M MacDiarmid, ...
arXiv preprint arXiv:2401.05566, 2024
332024
Many-shot jailbreaking
C Anil, E Durmus, M Sharma, J Benton, S Kundu, J Batson, N Rimsky, ...
Anthropic, April, 2024
212024
Improving policy learning via language dynamics distillation
V Zhong, J Mu, L Zettlemoyer, E Grefenstette, T Rocktäschel
Advances in Neural Information Processing Systems 35, 12504-12515, 2022
212022
Learning to refer informatively by amortizing pragmatic reasoning
J White, J Mu, ND Goodman
arXiv preprint arXiv:2006.00418, 2020
202020
Do we need natural language? Exploring restricted language interfaces for complex domains
J Mu, A Sarkar
Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing …, 2019
172019
Learning outside the box: Discourse-level features improve metaphor identification
J Mu, H Yannakoudakis, E Shutova
arXiv preprint arXiv:1904.02246, 2019
172019
Calibrate your listeners! Robust communication-based training for pragmatic speakers
RE Wang, J White, J Mu, ND Goodman
arXiv preprint arXiv:2110.05422, 2021
82021
The meta-science of adult statistical word segmentation: Part 1
JK Hartshorne, L Skorb, SL Dietz, CR Garcia, GL Iozzo, KE Lamirato, ...
Collabra: Psychology 5 (1), 1, 2019
8*2019
Multi-party referential communication in complex strategic games
J Mankewitz, V Boyce, B Waldon, G Loukatou, D Yu, J Mu, ND Goodman, ...
PsyArXiv, 2021
32021
Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems
D Yu, ND Goodman, J Mu
arXiv preprint arXiv:2305.11374, 2023
22023
Evaluating Hierarchies of Verb Argument Structure with Hierarchical Clustering
J Mu, JK Hartshorne, T O’Donnell
Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017
22017
Emergent Covert Signaling in Adversarial Reference Games
D Yu, J Mu, N Goodman
Emergent Communication Workshop at ICLR 2022, 0
2*
系统目前无法执行此操作,请稍后再试。
文章 1–20