Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán Proceedings of the 16th Conference of the European Chapter of the …, 2021 | 316 | 2021 |
Unsupervised quality estimation for neural machine translation M Fomicheva, S Sun, L Yankovskaya, F Blain, F Guzmán, M Fishel, ... Transactions of the Association for Computational Linguistics 8, 539-555, 2020 | 138 | 2020 |
Cross-lingual learning-to-rank with shared representations S Sasaki, S Sun, S Schamoni, K Duh, K Inui Proceedings of the 2018 Conference of the North American Chapter of the …, 2018 | 61 | 2018 |
MLQE-PE: A multilingual quality estimation and post-editing dataset M Fomicheva, S Sun, E Fonseca, C Zerva, F Blain, V Chaudhary, ... arXiv preprint arXiv:2010.04480, 2020 | 55 | 2020 |
CLIRMatrix: A Massively Large Collection of Bilingual and Multilingual Datasets for Cross-Lingual Information Retrieval S Sun, K Duh Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 48 | 2020 |
Are we estimating or guesstimating translation quality? S Sun, F Guzmán, L Specia Proceedings of the 58th annual meeting of the association for computational …, 2020 | 27 | 2020 |
BERGAMOT-LATTE submissions for the WMT20 quality estimation shared task M Fomicheva, S Sun, L Yankovskaya, F Blain, V Chaudhary, M Fishel, ... Association for Computational Linguistics, 2020 | 25 | 2020 |
Collecting verified COVID-19 question answer pairs A Poliak, M Fleming, C Costello, K Murray, M Yarmohammadi, S Pandya, ... Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020, 2020 | 18 | 2020 |
AfriCLIRMatrix: Enabling cross-lingual information retrieval for african languages O Ogundepo, X Zhang, S Sun, K Duh, J Lin Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 12 | 2022 |
Modeling document interactions for learning to rank with regularized self-attention S Sun, K Duh arXiv preprint arXiv:2005.03932, 2020 | 8 | 2020 |
Clireval: Evaluating machine translation as a cross-lingual information retrieval task S Sun, S Sia, K Duh Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | 7 | 2020 |
Battle of the Large Language Models: Dolly vs LLaMA vs Vicuna vs Guanaco vs Bard vs ChatGPT--A Text-to-SQL Parsing Comparison S Sun, Y Zhang, J Yan, Y Gao, D Ong, B Chen, J Su arXiv preprint arXiv:2310.10190, 2023 | 6 | 2023 |
An exploratory study on multilingual quality estimation S Sun, M Fomicheva, F Blain, V Chaudhary, A El-Kishky, A Renduchintala, ... Proceedings of the 1st Conference of the Asia-Pacific Chapter of the …, 2020 | 5 | 2020 |
Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia. arXiv cs H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán CL, 1907 | 5 | 1907 |
An analysis of bert faq retrieval models for covid-19 infobot S Sun, J Sedoc | 4 | 2020 |
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications S Sun, A El-Kishky, V Chaudhary, J Cross, F Guzmán, L Specia Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 2 | 2021 |
Interface display method and apparatus, device, and storage medium S Sun US Patent App. 18/197,207, 2023 | 1 | 2023 |
End-to-end Gated Self-attentive Memory Network for Dialog Response Selection S Sun, YC Tam, J Cao, C Yan, Z Fu, C Niu, J Zhou The 7th Dialog System Technology Challenge (DSTC7), 2019 | 1 | 2019 |
言語横断的情報検索の大規模データセットとパラメータ共有モデル 佐々木翔大, S Sun, S Schamoni, K Duh, 乾健太郎 言語処理学会第 24 回年次大会 3, 2018 | 1 | 2018 |
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages H Lovenia, R Mahendra, SM Akbar, LJV Miranda, J Santoso, E Aco, ... arXiv preprint arXiv:2406.10118, 2024 | | 2024 |