Distilling Reasoning Capabilities into Smaller Language Models K Shridhar*, A Stolfo*, M Sachan ACL 2023 (Findings), 7059-7073, 2023 | 125* | 2023 |
A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models A Stolfo*, Z Jin*, K Shridhar, B Schölkopf, M Sachan ACL 2023, 2022 | 38 | 2022 |
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis A Stolfo, Y Belinkov, M Sachan EMNLP 2023, 2023 | 35 | 2023 |
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models Y Hou, J Li, Y Fei, A Stolfo, W Zhou, G Zeng, A Bosselut, M Sachan EMNLP 2023, 2023 | 10 | 2023 |
A Simple Unsupervised Approach for Coreference Resolution using Rule-based Weak Supervision A Stolfo, C Tanner, V Gupta, M Sachan Proceedings of the 11th Joint Conference on Lexical and Computational …, 2022 | 6 | 2022 |
Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners? A Opedal*, A Stolfo*, H Shirakami, Y Jiao, R Cotterell, B Schölkopf, ... ICML 2024, 2024 | 4 | 2024 |
Longtonotes: OntoNotes with Longer Coreference Chains K Shridhar, N Monath, R Thirukovalluru, A Stolfo, M Zaheer, A McCallum, ... EACL 2023 (Findings), 2022 | 3 | 2022 |
Confidence Regulation Neurons in Language Models A Stolfo*, B Wu*, W Gurnee, Y Belinkov, X Song, M Sachan, N Nanda Mechanistic Interpretability Workshop at ICML 2024, 2024 | | 2024 |
Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study A Stolfo NAACL 2024 (Findings), 2024 | | 2024 |