GLUE: A multi-task benchmark and analysis platform for natural language understanding A Wang, A Singh, J Michael, F Hill, O Levy, SR Bowman Proceedings of the 7th annual International Conference on Learning …, 2019 | 6977 | 2019 |
Superglue: A stickier benchmark for general-purpose language understanding systems A Wang, Y Pruksachatkun, N Nangia, A Singh, J Michael, F Hill, O Levy, ... Advances in neural information processing systems 32, 2019 | 2115 | 2019 |
Supervised open information extraction G Stanovsky, J Michael, L Zettlemoyer, I Dagan Proceedings of the 2018 Conference of the North American Chapter of the …, 2018 | 354 | 2018 |
AmbigQA: Answering ambiguous open-domain questions S Min, J Michael, H Hajishirzi, L Zettlemoyer arXiv preprint arXiv:2004.10645, 2020 | 219 | 2020 |
Language models don't always say what they think: unfaithful explanations in chain-of-thought prompting M Turpin, J Michael, E Perez, S Bowman Advances in Neural Information Processing Systems 36, 2024 | 205 | 2024 |
Large-scale QA-SRL parsing N FitzGerald, J Michael, L He, L Zettlemoyer Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018 | 115 | 2018 |
Crowdsourcing question-answer meaning representations J Michael, G Stanovsky, L He, I Dagan, L Zettlemoyer Proceedings of the 16th Annual Conference of the North American Chapter of …, 2018 | 88 | 2018 |
Prompting contrastive explanations for commonsense reasoning tasks B Paranjape, J Michael, M Ghazvininejad, L Zettlemoyer, H Hajishirzi arXiv preprint arXiv:2106.06823, 2021 | 70 | 2021 |
The winograd schema challenge and reasoning about correlation D Bailey, AJ Harrison, Y Lierler, V Lifschitz, J Michael 2015 AAAI Spring Symposium Series, 2015 | 58 | 2015 |
Gpqa: A graduate-level google-proof q&a benchmark D Rein, BL Hou, AC Stickland, J Petty, RY Pang, J Dirani, J Michael, ... arXiv preprint arXiv:2311.12022, 2023 | 51 | 2023 |
Controlled crowdsourcing for high-quality QA-SRL annotation P Roit, A Klein, D Stepanov, J Mamou, J Michael, G Stanovsky, ... arXiv preprint arXiv:1911.03243, 2019 | 50 | 2019 |
We're afraid language models aren't modeling ambiguity A Liu, Z Wu, J Michael, A Suhr, P West, A Koller, S Swayamdipta, ... arXiv preprint arXiv:2304.14399, 2023 | 46 | 2023 |
Asking without telling: Exploring latent ontologies in contextual representations J Michael, JA Botha, I Tenney arXiv preprint arXiv:2004.14513, 2020 | 41 | 2020 |
kNN-Prompt: Nearest Neighbor Zero-Shot Inference W Shi, J Michael, S Gururangan, L Zettlemoyer arXiv preprint arXiv:2205.13792, 2022 | 40 | 2022 |
Human-in-the-loop parsing L He, J Michael, M Lewis, L Zettlemoyer Proceedings of the 2016 Conference on Empirical Methods in Natural Language …, 2016 | 39 | 2016 |
Asking it all: Generating contextualized questions for any semantic role V Pyatkin, P Roit, J Michael, R Tsarfaty, Y Goldberg, I Dagan arXiv preprint arXiv:2109.04832, 2021 | 37 | 2021 |
What do nlp researchers believe? results of the nlp community metasurvey J Michael, A Holtzman, A Parrish, A Mueller, A Wang, A Chen, D Madaan, ... arXiv preprint arXiv:2208.12852, 2022 | 25 | 2022 |
Debate helps supervise unreliable experts J Michael, S Mahdi, D Rein, J Petty, J Dirani, V Padmakumar, ... arXiv preprint arXiv:2311.08702, 2023 | 12 | 2023 |
Eliciting language model behaviors using reverse language models J Pfau, A Infanger, A Sheshadri, A Panda, J Michael, C Huebner Socially Responsible Language Modelling Research, 2023 | 6 | 2023 |
Inducing semantic roles without syntax J Michael, L Zettlemoyer Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021 | 5 | 2021 |