Cmmlu: Measuring massive multitask language understanding in chinese H Li, Y Zhang, F Koto, Y Yang, H Zhao, Y Gong, N Duan, T Baldwin arXiv preprint arXiv:2306.09212, 2023 | 97 | 2023 |
A framework for few-shot language model evaluation L Gao, J Tow, B Abbasi, S Biderman, S Black, A DiPofi, C Foster, ... URL https://zenodo. org/records/10256836 7, 2023 | 48* | 2023 |
Jais and jais-chat: Arabic-centric foundation and instruction-tuned open generative large language models N Sengupta, SK Sahu, B Jia, S Katipomu, H Li, F Koto, OM Afzal, ... arXiv preprint arXiv:2308.16149, 2023 | 43 | 2023 |
Do-not-answer: Evaluating safeguards in LLMs Y Wang, H Li, X Han, P Nakov, T Baldwin Findings of the Association for Computational Linguistics: EACL 2024, 896-911, 2024 | 41* | 2024 |
MultiSpanQA: A dataset for multi-span question answering H Li, M Tomko, M Vasardani, T Baldwin Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 40 | 2022 |
Bactrian-x: Multilingual replicable instruction-following models with low-rank adaptation H Li, F Koto, M Wu, AF Aji, T Baldwin arXiv preprint arXiv:2305.15011, 2023 | 39 | 2023 |
Neural character-level dependency parsing for Chinese H Li, Z Zhang, Y Ju, H Zhao Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 33 | 2018 |
Place questions and human-generated answers: A data analysis approach E Hamzei, H Li, M Vasardani, T Baldwin, S Winter, M Tomko Geospatial Technologies for Local and Regional Development: Proceedings of …, 2020 | 28 | 2020 |
Llm360: Towards fully transparent open-source llms Z Liu, A Qiao, W Neiswanger, H Wang, B Tan, T Tao, J Li, Y Wang, S Sun, ... arXiv preprint arXiv:2312.06550, 2023 | 27 | 2023 |
Kfcnet: Knowledge filtering and contrastive learning network for generative commonsense reasoning H Li, Y Gong, J Jiao, R Zhang, T Baldwin, N Duan arXiv preprint arXiv:2109.06704, 2021 | 24 | 2021 |
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis S Fan, C Lin, H Li, Z Lin, J Su, H Zhang, Y Gong, J Guo, N Duan EMNLP 2022, 2022 | 22 | 2022 |
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU F Koto, N Aisyah, H Li, T Baldwin EMNLP 2023, 2023 | 13 | 2023 |
Target word masking for location metonymy resolution H Li, M Vasardani, M Tomko, T Baldwin arXiv preprint arXiv:2010.16097, 2020 | 13 | 2020 |
The semantics of place-related questions W Kuhn, E Hamzei, M Tomko, S Winter, H Li Journal of Spatial Information Science, 157-168, 2021 | 12 | 2021 |
UniMelb at SemEval-2019 Task 12: Multi-model combination for toponym resolution H Li, M Wang, T Baldwin, M Tomko, M Vasardani Proceedings of the 13th International Workshop on Semantic Evaluation, 1313-1318, 2019 | 11 | 2019 |
Neural factoid geospatial question answering H Li, E Hamzei, I Majic, H Hua, J Renz, M Tomko, M Vasardani, S Winter, ... Journal of Spatial Information Science, 65-90, 2021 | 10 | 2021 |
Fact-checking the output of large language models via token-level uncertainty quantification E Fadeeva, A Rubashevskii, A Shelmanov, S Petrakov, H Li, H Mubarak, ... arXiv preprint arXiv:2403.04696, 2024 | 6 | 2024 |
Can Large Language Model Comprehend Ancient Chinese? A Preliminary Test on ACLUE Y Zhang, H Li Ancient Language Processing Workshop, 2023, 2023 | 6 | 2023 |
Arabicmmlu: Assessing massive multitask language understanding in arabic F Koto, H Li, S Shatnawi, J Doughman, AB Sadallah, A Alraeesi, ... arXiv preprint arXiv:2402.12840, 2024 | 4 | 2024 |
Lessons from the Trenches on Reproducible Evaluation of Language Models S Biderman, H Schoelkopf, L Sutawika, L Gao, J Tow, B Abbasi, AF Aji, ... arXiv preprint arXiv:2405.14782, 2024 | 3 | 2024 |