Adversarial deep averaging networks for cross-lingual sentiment classification X Chen, Y Sun, B Athiwaratkun, C Cardie, K Weinberger Transactions of the Association for Computational Linguistics 6, 557-570, 2018 | 348 | 2018 |
There are many consistent explanations of unlabeled data: Why you should average B Athiwaratkun, M Finzi, P Izmailov, AG Wilson arXiv preprint arXiv:1806.05594, 2018 | 292* | 2018 |
Malware classification with LSTM and GRU language models and a character-level CNN B Athiwaratkun, JW Stokes 2017 IEEE international conference on acoustics, speech and signal …, 2017 | 292 | 2017 |
Structured prediction as translation between augmented natural languages G Paolini, B Athiwaratkun, J Krone, J Ma, A Achille, R Anubhai, ... arXiv preprint arXiv:2101.05779, 2021 | 261 | 2021 |
Probabilistic fasttext for multi-sense word embeddings B Athiwaratkun, AG Wilson, A Anandkumar arXiv preprint arXiv:1806.02901, 2018 | 190 | 2018 |
Multimodal word distributions B Athiwaratkun, AG Wilson arXiv preprint arXiv:1704.08424, 2017 | 124 | 2017 |
Multi-lingual evaluation of code generation models B Athiwaratkun, SK Gouda, Z Wang, X Li, Y Tian, M Tan, WU Ahmad, ... arXiv preprint arXiv:2210.14868, 2022 | 65 | 2022 |
Hierarchical density order embeddings B Athiwaratkun, AG Wilson arXiv preprint arXiv:1804.09843, 2018 | 65 | 2018 |
Augmented natural language for generative sequence labeling B Athiwaratkun, CN Santos, J Krone, B Xiang arXiv preprint arXiv:2009.13272, 2020 | 56 | 2020 |
Improving stability in deep reinforcement learning with weight averaging E Nikishin, P Izmailov, B Athiwaratkun, D Podoprikhin, T Garipov, ... Uncertainty in artificial intelligence workshop on uncertainty in Deep learning, 2018 | 49 | 2018 |
Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, and Bing Xiang B Athiwaratkun, SK Gouda, Z Wang, X Li, Y Tian, M Tan, WU Ahmad, ... Multi-lingual evaluation of code generation models, 2022 | 21 | 2022 |
Towards greener yet powerful code generation via quantization: An empirical study X Wei, SK Gonugondla, S Wang, W Ahmad, B Ray, H Qian, X Li, V Kumar, ... Proceedings of the 31st ACM Joint European Software Engineering Conference …, 2023 | 5* | 2023 |
Generative context pair selection for multi-hop question answering D Dua, CN Santos, P Ng, B Athiwaratkun, B Xiang, M Gardner, S Singh arXiv preprint arXiv:2104.08744, 2021 | 5 | 2021 |
Infinite symmetric ergodic index and related examples in infinite measure I Loh, C Silva, B Athiwaratkun arXiv preprint arXiv:1702.01455, 2017 | 3 | 2017 |
Mixture-of-Agents Enhances Large Language Model Capabilities J Wang, J Wang, B Athiwaratkun, C Zhang, J Zou arXiv preprint arXiv:2406.04692, 2024 | 1 | 2024 |
Programmatically generating evaluation data sets for code generation models P Athiwaratkun, Z Lin, R Keerthi, Z Wang, T Yuchen, H Ding, SRA Bontala, ... US Patent App. 17/847,113, 2023 | 1 | 2023 |
On io-efficient attention mechanisms: Context-aware bifurcated attention and the generalized multi-group attention B Athiwaratkun, SK Gonugondla, SK Gouda, H Qian, H Ding, Q Sun, ... Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023 | 1 | 2023 |
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies J Wang, S Jain, D Zhang, B Ray, V Kumar, B Athiwaratkun arXiv preprint arXiv:2406.06461, 2024 | | 2024 |
Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model K Chen, R Thapa, R Chalamala, B Athiwaratkun, SL Song, J Zou arXiv preprint arXiv:2406.00977, 2024 | | 2024 |
Bifurcated attention for single-context large-batch sampling B Athiwaratkun, SK Gonugondla, SK Gouda, H Qian, H Ding, Q Sun, ... arXiv preprint arXiv:2403.08845, 2024 | | 2024 |