Xiezhi: An ever-updating benchmark for holistic domain knowledge evaluation Z Gu, X Zhu, H Ye, L Zhang, J Wang, Y Zhu, S Jiang, Z Xiong, Z Li, W Wu, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18099 …, 2024 | 32 | 2024 |
Can Large Language Models Understand Real-World Complex Instructions? Q He, J Zeng, W Huang, L Chen, J Xiao, Q He, X Zhou, J Liang, Y Xiao Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18188 …, 2024 | 29 | 2024 |
Knowledgpt: Enhancing large language models with retrieval and storage access on knowledge bases X Wang, Q Yang, Y Qiu, J Liang, Q He, Z Gu, Y Xiao, W Wang arXiv preprint arXiv:2308.11761, 2023 | 26 | 2023 |
Parsing natural language into propositional and first-order logic with dual reinforcement learning X Lu, J Liu, Z Gu, H Tong, C Xie, J Huang, Y Xiao, W Wang Proceedings of the 29th International Conference on Computational …, 2022 | 14 | 2022 |
Learning what you need from what you did: Product taxonomy expansion with user behaviors supervision S Cheng, Z Gu, B Liu, R Xie, W Wu, Y Xiao 2022 IEEE 38th International Conference on Data Engineering (ICDE), 3280-3293, 2022 | 5 | 2022 |
Sem4SAP: Synonymous Expression Mining from Open Knowledge Graph for Language Model Synonym-Aware Pretraining Z Gu, S Jiang, W Huang, J Liang, H Feng, Y Xiao arXiv preprint arXiv:2303.14425, 2023 | 4 | 2023 |
DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence? Z Gu, L Zhang, X Zhu, J Chen, W Huang, Y Zhang, S Wang, Z Ye, Y Gao, ... arXiv preprint arXiv:2406.12641, 2024 | 2* | 2024 |
Agent Group Chat: An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior Z Gu, X Zhu, H Guo, L Zhang, Y Cai, H Shen, J Chen, Z Ye, Y Dai, Y Gao, ... arXiv preprint arXiv:2403.13433, 2024 | 2 | 2024 |
The missing piece in model editing: A deep dive into the hidden damage brought by model editing J Wang, Z Gu, Z Xiong, H Feng, Y Xiao arXiv preprint arXiv:2403.07825, 2024 | 2 | 2024 |
ConcEPT: Concept-Enhanced Pre-Training for Language Models X Wang, Z Gu, J Liang, D Lu, Y Xiao, W Wang arXiv preprint arXiv:2401.05669, 2024 | 2 | 2024 |
Evaluation of phase-adjusted interventions for COVID-19 using an improved SEIR model H Jiang, Z Gu, H Liu, J Huang, Z Wang, Y Xiong, Y Tong, J Yin, F Jiang, ... Epidemiology & Infection 152, e9, 2024 | 2 | 2024 |
VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It X Zhu, Z Gu, S Jiang, Z Li, H Feng, Y Xiao arXiv preprint arXiv:2407.12005, 2024 | | 2024 |
StructBench: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding Z Gu, H Ye, Z Zhou, H Feng, Y Xiao arXiv preprint arXiv:2406.10621, 2024 | | 2024 |
GANTEE: Generative Adversarial Network for Taxonomy Enterance Evaluation Z Gu, S Jiang, J Liu, Y Xiao, H Feng, Z Li, J Liang, Z Jian Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 6380-6388, 2023 | | 2023 |
我爱我家——上海石库门 顾洲洪 现代语文: 中旬. 教学研究, 17-17, 2011 | | 2011 |