Alex Tamkin 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	4736	4733
h 指数	19	19
i10 指数	23	23

2000

1000

500

1500

20192020202120222023202418 24 164 692 1866 1945

开放获取的出版物数量

查看全部

4 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Noah D. GoodmanStanford University在 stanford.edu 的电子邮件经过验证
Deep GanguliAnthropic在 cns.nyu.edu 的电子邮件经过验证
Emma BrunskillAssociate Professor of Computer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Dan JurafskyProfessor of Linguistics and Computer Science, Stanford University在 stanford.edu 的电子邮件经过验证
James LandayProfessor of Computer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Christopher PottsProfessor of Linguistics and, by courtesy, of Computer Science在 stanford.edu 的电子邮件经过验证
Ignacio CasesPostdoc at CSAIL, MIT在 stanford.edu 的电子邮件经过验证
Christopher ShallueCenter for Astrophysics | Harvard & Smithsonian在 cfa.harvard.edu 的电子邮件经过验证

关注

Alex Tamkin

Research Scientist, Anthropic

在 cs.stanford.edu 的电子邮件经过验证 - 首页

Machine Learning Natural Language Processing Computer Vision


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
On the opportunities and risks of foundation models R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2021	3496	2021
Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models A Tamkin, M Brundage, J Clark, D Ganguli arXiv preprint arXiv:2102.02503, https://arxiv.org/abs/2102.02503, 2021	272	2021
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning T Bricken, A Templeton, J Batson, B Chen, A Jermyn, T Conerly, ... https://transformer-circuits.pub/2023/monosemantic-features/index.html, 2023	123	2023
Towards measuring the representation of subjective global opinions in language models E Durmus, K Nguyen, TI Liao, N Schiefer, A Askell, A Bakhtin, C Chen, ... arXiv preprint arXiv:2306.16388, 2023	101	2023
Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy R Keramati, C Dann, A Tamkin, E Brunskill Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), 2020	87	2020
Studying large language model generalization with influence functions R Grosse, J Bae, C Anil, N Elhage, A Tamkin, A Tajdini, B Steiner, D Li, ... arXiv preprint arXiv:2308.03296, 2023	70	2023
Viewmaker Networks: Learning Views for Unsupervised Representation Learning A Tamkin, M Wu, N Goodman ICLR 2021, 2020	68	2020
Drone.io: A Gestural and Visual Interface for Human-Drone Interaction JR Cauchard, A Tamkin, CY Wang, L Vink, M Park, T Fang, JA Landay 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI …, 2019	57	2019
Investigating transferability in pretrained language models A Tamkin, T Singh, D Giovanardi, N Goodman Findings of EMNLP 2020, 2020	44	2020
Scaling monosemanticity: Extracting interpretable features from claude 3 sonnet A Templeton, T Conerly, J Marcus, J Lindsey, T Bricken, B Chen, ... Transformer Circuits Thread, 2024	41	2024
Language Through a Prism: A Spectral Approach for Multiscale Language Representations A Tamkin, D Jurafsky, N Goodman NeurIPS 2020, 2020	38	2020
Active Learning Helps Pretrained Models Learn the Intended Task A Tamkin, D Nguyen, S Deshpande, J Mu, N Goodman NeurIPS 2022, 2022	36	2022
DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning A Tamkin, V Liu, R Lu, D Fein, C Schultz, N Goodman NeurIPS 2021, 2021	36	2021
Distributionally-Aware Exploration for CVaR Bandits A Tamkin, R Keramati, C Dann, E Brunskill NeurIPS 2019 Workshop on Safety and Robustness in Decision Making, 2019	36	2019
Recursive Routing Networks: Learning to Compose Modules for Language Understanding I Cases, C Rosenbaum, M Riemer, A Geiger, T Klinger, A Tamkin, O Li, ... NAACL 2019, 2019	29	2019
Evaluating and mitigating discrimination in language model decisions A Tamkin, A Askell, L Lovitt, E Durmus, N Joseph, S Kravec, K Nguyen, ... arXiv preprint arXiv:2312.03689, 2023	27	2023
C5t5: Controllable generation of organic molecules with transformers D Rothchild, A Tamkin, J Yu, U Misra, J Gonzalez arXiv preprint arXiv:2108.10307, 2021	26	2021
Eliciting human preferences with language models BZ Li, A Tamkin, N Goodman, J Andreas arXiv preprint arXiv:2310.11589, 2023	24	2023
Task Ambiguity in Humans and Language Models A Tamkin, K Handa, A Shrestha, N Goodman ICLR 2023, 2023	23	2023
Many-shot jailbreaking C Anil, E Durmus, M Sharma, J Benton, S Kundu, J Batson, N Rimsky, ... Anthropic, April, 2024	18	2024

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用