On the convergence and sample efficiency of variance-reduced policy gradient method J Zhang, C Ni, C Szepesvari, M Wang Advances in Neural Information Processing Systems 34, 2228-2240, 2021 | 63 | 2021 |
Learning to control in metric space with optimal regret C Ni, LF Yang, M Wang 2019 57th Annual Allerton Conference on Communication, Control, and …, 2019 | 30 | 2019 |
Reward-directed conditional diffusion: Provable distribution estimation and reward improvement H Yuan, K Huang, C Ni, M Chen, M Wang Advances in Neural Information Processing Systems 36, 2024 | 22 | 2024 |
Representation learning for low-rank general-sum markov games C Ni, Y Song, X Zhang, Z Ding, C Jin, M Wang The Eleventh International Conference on Learning Representations, 2023 | 18* | 2023 |
Off-policy fitted q-evaluation with differentiable function approximators: Z-estimation and inference theory R Zhang, X Zhang, C Ni, M Wang International Conference on Machine Learning, 26713-26749, 2022 | 18 | 2022 |
Learning good state and action representations for Markov decision process via tensor decomposition C Ni, Y Duan, M Dahleh, M Wang, AR Zhang Journal of Machine Learning Research 24 (115), 1-53, 2023 | 10* | 2023 |
Diffusion model for data-driven black-box optimization Z Li, H Yuan, K Huang, C Ni, Y Ye, M Chen, M Wang arXiv preprint arXiv:2403.13219, 2024 | 5 | 2024 |
Optimal estimation of policy gradient via double fitted iteration C Ni, R Zhang, X Ji, X Zhang, M Wang International Conference on Machine Learning, 16724-16783, 2022 | 5* | 2022 |
Maximum likelihood tensor decomposition of Markov decision process C Ni, M Wang 2019 IEEE International Symposium on Information Theory (ISIT), 3062-3066, 2019 | 3 | 2019 |
Bandit theory and thompson sampling-guided directed evolution for sequence optimization H Yuan, C Ni, H Wang, X Zhang, L Cong, C Szepesvári, M Wang Advances in Neural Information Processing Systems 35, 38291-38304, 2022 | 2 | 2022 |
Cell2State: Learning Cell State Representations From Barcoded Single-Cell Gene-Expression Transitions Y Wu, JC Kim, C Ni, L Cong, M Wang | | |