NusaCrowd: Open source initiative for Indonesian NLP resources S Cahyawijaya, H Lovenia, AF Aji, G Winata, B Wilie, F Koto, R Mahendra, ... Findings of the Association for Computational Linguistics: ACL 2023, 13745-13818, 2023 | 1047 | 2023 |
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP F Koto, A Rahimi, JH Lau, T Baldwin Proceedings of the 28th COLING 2020, 757-770, 2020 | 197 | 2020 |
Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs F Koto, GY Rahmaningtyas 2017 International Conference on Asian Language Processing (IALP), 391-394, 2017 | 128 | 2017 |
CMMLU: Measuring Massive Multitask Language Understanding in Chinese H Li, Y Zhang, F Koto, Y Yang, H Zhao, Y Gong, N Duan, T Baldwin Findings of ACL 2024, 2024 | 86 | 2024 |
A comparative study on twitter sentiment analysis: Which features are good? F Koto, M Adriani Proceedings of the 20th NLDB 2015, 453-457, 2015 | 72 | 2015 |
SMOTE-Out, SMOTE-Cosine, and Selected-SMOTE: An Enhancement Strategy to Handle Imbalance in Data Level F Koto The 6th ICACSIS, 2014 | 63 | 2014 |
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization F Koto, JH Lau, T Baldwin Proceedings of EMNLP 2021, 2021 | 57 | 2021 |
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia AF Aji, GI Winata, F Koto, S Cahyawijaya, A Romadhony, R Mahendra, ... Proceedings of ACL 2022, 2022 | 52 | 2022 |
Nusax: Multilingual parallel sentiment dataset for 10 indonesian local languages GI Winata, AF Aji, S Cahyawijaya, R Mahendra, F Koto, A Romadhony, ... Proceedings of the 17th EACL 2023, 2022 | 47 | 2022 |
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models N Sengupta, SK Sahu, B Jia, S Katipomu, H Li, F Koto, OM Afzal, ... Technical Report, 2023 | 42 | 2023 |
Discourse Probing of Pretrained Language Models F Koto, JH Lau, T Baldwin Proceedings of NAACL 2021, 2021 | 42 | 2021 |
Bactrian-x: Multilingual replicable instruction-following models with low-rank adaptation H Li, F Koto, M Wu, AF Aji, T Baldwin arXiv preprint arXiv:2305.15011, 2023 | 38 | 2023 |
Apparatus and method for sharing personal electronic-data of health A Kurniawan, O ABDILLAH, Fajri US Patent App. 15/221,140, 2017 | 36* | 2017 |
Liputan6: A Large-scale Indonesian Dataset for Text Summarization F Koto, JH Lau, T Baldwin Proceedings of AACL 2020, 2020 | 34 | 2020 |
Top-down Discourse Parsing via Sequence Labelling F Koto, JH Lau, T Baldwin Proceedings of the 16th EACL 2021, 2021 | 31 | 2021 |
FFCI: A framework for interpretable automatic evaluation of summarization F Koto, T Baldwin, JH Lau Journal of Artificial Intelligence Research (JAIR) 73, 1553–1607, 2022 | 29 | 2022 |
Llm360: Towards fully transparent open-source llms Z Liu, A Qiao, W Neiswanger, H Wang, B Tan, T Tao, J Li, Y Wang, S Sun, ... arXiv preprint arXiv:2312.06550, 2023 | 23 | 2023 |
A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization F Koto The 10th International Conference on Language Resources and Evaluation (LREC), 2016 | 23 | 2016 |
HBE: Hashtag-based emotion lexicons for twitter sentiment analysis F Koto, M Adriani Proceedings of the 7th Forum for Information Retrieval Evaluation, 31-34, 2015 | 23 | 2015 |
The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis F Koto, M Adriani The 29th International Conference on Advanced Information Networking and …, 2015 | 17 | 2015 |