S3eval: A synthetic, scalable, systematic evaluation suite for large language models F Lei*, Q Liu*, Y Huang*, S He, J Zhao, K Liu NAACL 2024, 2023 | 11 | 2023 |
S HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering F Lei, X Li, Y Wei, S He, Y Huang, J Zhao, K Liu ACL 2023, 2023 | 10 | 2023 |
Key-point-driven data synthesis with its enhancement on mathematical reasoning Y Huang, X Liu, Y Gong, Z Gou, Y Shen, N Duan, W Chen arXiv preprint arXiv:2403.02333, 2024 | 8 | 2024 |
Competition-level problems are effective llm evaluators Y Huang*, Z Lin*, X Liu, Y Gong, S Lu, F Lei, Y Liang, Y Shen, C Lin, ... ACL 2024, 2023 | 6 | 2023 |
TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering F Lei, T Luo, P Yang, W Liu, H Liu, J Lei, Y Huang, Y Wei, S He, J Zhao, ... arXiv preprint arXiv:2310.15075, 2023 | 5 | 2023 |
Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent X Yu, T Luo, Y Wei, F Lei, Y Huang, P Hao, L Zhu arXiv preprint arXiv:2402.13717, 2024 | 4 | 2024 |
Adfa: Attention-augmented differentiable top-k feature adaptation for unsupervised medical anomaly detection Y Huang, G Liu, Y Luo, G Yang ICIP 2023, 2023 | 2 | 2023 |
Spatial and Planar Consistency for Semi-Supervised Volumetric Medical Image Segmentation Y Zhou, Y Huang, G Yang BMVC 2023, 2023 | | 2023 |