关注
Aaquib Syed
标题
引用次数
引用次数
年份
Attribution patching outperforms automated circuit discovery
A Syed, C Rager, A Conmy
arXiv preprint arXiv:2310.10348, 2023
162023
Machine learning with textural analysis of longitudinal multiparametric MRI and molecular subtypes accurately predicts pathologic complete response in patients with invasive …
A Syed, R Adam, T Ren, J Lu, T Maldjian, TQ Duong
PloS one 18 (1), e0280320, 2023
132023
Prune and tune: Improving efficient pruning techniques for massive language models
A Syed, PH Guo, V Sundarapandiyan
82023
Refusal in Language Models Is Mediated by a Single Direction
A Arditi, O Obeso, A Syed, D Paleka, N Rimsky, W Gurnee, N Nanda
arXiv preprint arXiv:2406.11717, 2024
32024
Robust Unlearning via Mechanistic Localizations
PH Guo, A Syed, A Sheshadri, A Ewart, GK Dziugaite
ICML 2024 Workshop on Mechanistic Interpretability, 0
系统目前无法执行此操作,请稍后再试。
文章 1–5