Bidirectional Attention Flow for Machine Comprehension M Seo, A Kembhavi, A Farhadi, H Hajishirzi ICLR, 2017 | 2373 | 2017 |
Self-instruct: Aligning language models with self-generated instructions Y Wang, Y Kordi, S Mishra, A Liu, NA Smith, D Khashabi, H Hajishirzi arXiv preprint arXiv:2212.10560, 2022 | 1196 | 2022 |
Espnet: Efficient spatial pyramid of dilated convolutions for semantic segmentation S Mehta, M Rastegari, A Caspi, L Shapiro, H Hajishirzi Proceedings of the european conference on computer vision (ECCV), 552-568, 2018 | 965 | 2018 |
Rethinking the role of demonstrations: What makes in-context learning work? S Min, X Lyu, A Holtzman, M Artetxe, M Lewis, H Hajishirzi, L Zettlemoyer arXiv preprint arXiv:2202.12837, 2022 | 923 | 2022 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 875 | 2022 |
Multi-task identification of entities, relations, and coreference for scientific knowledge graph construction Y Luan, L He, M Ostendorf, H Hajishirzi arXiv preprint arXiv:1808.09602, 2018 | 717 | 2018 |
Unifiedqa: Crossing format boundaries with a single qa system D Khashabi, S Min, T Khot, A Sabharwal, O Tafjord, P Clark, H Hajishirzi arXiv preprint arXiv:2005.00700, 2020 | 649* | 2020 |
Entity, relation, and event extraction with contextualized span representations D Wadden, U Wennberg, Y Luan, H Hajishirzi arXiv preprint arXiv:1909.03546, 2019 | 645 | 2019 |
Fine-tuning pretrained language models: Weight initializations, data orders, and early stopping J Dodge, G Ilharco, R Schwartz, A Farhadi, H Hajishirzi, N Smith arXiv preprint arXiv:2002.06305, 2020 | 585 | 2020 |
Cross-task generalization via natural language crowdsourcing instructions S Mishra, D Khashabi, C Baral, H Hajishirzi arXiv preprint arXiv:2104.08773, 2021 | 533 | 2021 |
ESPNetv2: A Light-weight, Power Efficient, and General Purpose Convolutional Neural Network S Mehta, M Rastegari, L Shapiro, H Hajishirzi arXiv preprint arXiv:1811.11431, 2018 | 522 | 2018 |
Robust fine-tuning of zero-shot models M Wortsman, G Ilharco, JW Kim, M Li, S Kornblith, R Roelofs, RG Lopes, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 489 | 2022 |
Super-naturalinstructions: Generalization via declarative instructions on 1600+ nlp tasks Y Wang, S Mishra, P Alipoormolabashi, Y Kordi, A Mirzaei, A Arunkumar, ... arXiv preprint arXiv:2204.07705, 2022 | 470* | 2022 |
Evaluating models' local decision boundaries via contrast sets M Gardner, Y Artzi, V Basmova, J Berant, B Bogin, S Chen, P Dasigi, ... arXiv preprint arXiv:2004.02709, 2020 | 452 | 2020 |
Openclip G Ilharco, M Wortsman, R Wightman, C Gordon, N Carlini, R Taori, ... | 432* | 2021 |
Mathqa: Towards interpretable math word problem solving with operation-based formalisms A Amini, S Gabriel, P Lin, R Koncel-Kedziorski, Y Choi, H Hajishirzi arXiv preprint arXiv:1905.13319, 2019 | 385 | 2019 |
Fact or fiction: Verifying scientific claims D Wadden, S Lin, K Lo, LL Wang, M van Zuylen, A Cohan, H Hajishirzi arXiv preprint arXiv:2004.14974, 2020 | 378 | 2020 |
A General Framework for Information Extraction using Dynamic Span Graphs Y Luan, D Wadden, L He, A Shah, M Ostendorf, H Hajishirzi | 377 | 2019 |
Learning to solve arithmetic word problems with verb categorization MJ Hosseini, H Hajishirzi, O Etzioni, N Kushman Proceedings of the 2014 Conference on Empirical Methods in Natural Language …, 2014 | 368 | 2014 |
Text generation from knowledge graphs with graph transformers R Koncel-Kedziorski, D Bekal, Y Luan, M Lapata, H Hajishirzi arXiv preprint arXiv:1904.02342, 2019 | 363 | 2019 |