Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective G Feng, Y Gu, B Zhang, H Ye, D He, L Wang Advances in Neural Information Processing Systems (Oral), 2024, 2023 | 70 | 2023 |
A complete expressiveness hierarchy for subgraph gnns via subgraph weisfeiler-lehman tests B Zhang, G Feng, Y Du, D He, L Wang ICML2023, 2023 | 34 | 2023 |
DPO Meets PPO: Reinforced Token Optimization for RLHF H Zhong, G Feng, W Xiong, L Zhao, D He, J Bian, L Wang arXiv preprint arXiv:2404.18922, 2024 | 9 | 2024 |
Do Efficient Transformers Really Save Computation? K Yang, J Ackermann, Z He, G Feng, B Zhang, Y Feng, Q Ye, D He, ... arXiv preprint arXiv:2402.13934, 2024 | 5 | 2024 |
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation Z He, G Feng, S Luo, K Yang, D He, J Xu, Z Zhang, H Yang, L Wang arXiv preprint arXiv:2401.16421, 2024 | | 2024 |
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity G Feng, H Zhong arXiv preprint arXiv:2312.17248, 2023 | | 2023 |