Agieval: A human-centric benchmark for evaluating foundation models W Zhong, R Cui, Y Guo, Y Liang, S Lu, Y Wang, A Saied, W Chen, ... arXiv preprint arXiv:2304.06364, 2023 | 199 | 2023 |
Online continual learning through mutual information maximization Y Guo, B Liu, D Zhao International conference on machine learning, 8109-8126, 2022 | 98 | 2022 |
Adaptive orthogonal projection for batch and online continual learning Y Guo, W Hu, D Zhao, B Liu Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 6783-6791, 2022 | 34 | 2022 |
Learning to program with natural language Y Guo, Y Liang, C Wu, W Wu, D Zhao, N Duan arXiv preprint arXiv:2304.10464, 2023 | 14 | 2023 |
Dealing with cross-task class discrimination in online continual learning Y Guo, B Liu, D Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 12 | 2023 |
AGIEval: A human-centric benchmark for evaluating foundation models. CoRR, abs/2304.06364, 2023. doi: 10.48550 W Zhong, R Cui, Y Guo, Y Liang, S Lu, Y Wang, A Saied, W Chen, ... arXiv preprint arXiv.2304.06364, 0 | 9 | |
Class-incremental learning based on label generation Y Shao, Y Guo, D Zhao, B Liu arXiv preprint arXiv:2306.12619, 2023 | 7 | 2023 |
AGIEval: A human-centric benchmark for evaluating foundation models (2023) W Zhong, R Cui, Y Guo, Y Liang, S Lu, Y Wang, A Saied, W Chen, ... arXiv preprint arXiv:2304.06364, 0 | 6 | |
EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation W You, W Wu, Y Liang, S Mao, C Wu, M Cao, Y Cai, Y Guo, Y Xia, F Wei, ... arXiv preprint arXiv:2310.08185, 2023 | 5 | 2023 |
Pptc benchmark: Evaluating large language models for powerpoint task completion Y Guo, Z Zhang, Y Liang, D Zhao, D Nan arXiv preprint arXiv:2311.01767, 2023 | 4 | 2023 |
Class incremental learning via likelihood ratio based task prediction H Lin, Y Shao, W Qian, N Pan, Y Guo, B Liu arXiv preprint arXiv:2309.15048, 2023 | 3 | 2023 |
Efficient Continual Pre-training by Mitigating the Stability Gap Y Guo, J Fu, H Zhang, D Zhao, Y Shen arXiv preprint arXiv:2406.14833, 2024 | | 2024 |
PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion Z Zhang, Y Guo, Y Liang, D Zhao, N Duan arXiv preprint arXiv:2403.03788, 2024 | | 2024 |
Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast Y Guo, Y Liang, D Zhao, B Liu, D Nan arXiv preprint arXiv:2305.11449, 2023 | | 2023 |
Learning to Plan with Natural Language Y Guo, Y Liang, C Wu, W Wu, D Zhao, N Duan arXiv preprint arXiv:2304.10464, 2023 | | 2023 |