A new algorithm for non-stationary contextual bandits: Efficient, optimal and parameter-free Y Chen, CW Lee, H Luo, CY Wei Conference on Learning Theory, 696-726, 2019 | 140* | 2019 |
Fair contextual multi-armed bandits: Theory and experiments Y Chen, A Cuellar, H Luo, J Modi, H Nemlekar, S Nikolaidis Conference on Uncertainty in Artificial Intelligence, 181-190, 2020 | 63 | 2020 |
Reward-free rl is no harder than reward-aware rl in linear markov decision processes AJ Wagenmaker, Y Chen, M Simchowitz, S Du, K Jamieson International Conference on Machine Learning, 22430-22456, 2022 | 54 | 2022 |
Multi-armed bandits with fairness constraints for distributing resources to human teammates H Claure, Y Chen, J Modi, M Jung, S Nikolaidis Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot …, 2020 | 48 | 2020 |
First-order regret in reinforcement learning with linear function approximation: A robust estimation approach AJ Wagenmaker, Y Chen, M Simchowitz, S Du, K Jamieson International Conference on Machine Learning, 22384-22429, 2022 | 30 | 2022 |
Improved corruption robust algorithms for episodic reinforcement learning Y Chen, S Du, K Jamieson International Conference on Machine Learning, 1561-1570, 2021 | 28 | 2021 |
Online and bandit algorithms for nonstationary stochastic saddle-point optimization A Roy, Y Chen, K Balasubramanian, P Mohapatra arXiv preprint arXiv:1912.01698, 2019 | 24 | 2019 |
Active multi-task representation learning Y Chen, K Jamieson, S Du International Conference on Machine Learning, 3271-3298, 2022 | 11 | 2022 |
Improved active multi-task representation learning via lasso Y Wang, Y Chen, K Jamieson, SS Du International Conference on Machine Learning, 35548-35578, 2023 | 8 | 2023 |
Corruption robust active learning Y Chen, SS Du, KG Jamieson Advances in Neural Information Processing Systems 34, 29643-29654, 2021 | 8 | 2021 |
More practical and adaptive algorithms for online quantum state learning Y Chen, X Wang arXiv preprint arXiv:2006.01013, 2020 | 6 | 2020 |
Causal bandits: Online decision-making in endogenous settings J Zhang, Y Chen, A Singh arXiv preprint arXiv:2211.08649, 2022 | 5 | 2022 |
Labelbench: A comprehensive framework for benchmarking adaptive label-efficient learning J Zhang, Y Chen, G Canal, AM Das, G Bhatt, S Mussmann, Y Zhu, ... Journal of Data-centric Machine Learning Research, 2024 | 4* | 2024 |
Active representation learning for general task space with applications in robotics Y Chen, Y Huang, SS Du, KG Jamieson, G Shi Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |
Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning Y Wang, Y Chen, W Yan, K Jamieson, SS Du arXiv preprint arXiv:2402.02055, 2024 | 1 | 2024 |
An experimental design framework for label-efficient supervised finetuning of large language models G Bhatt, Y Chen, AM Das, J Zhang, ST Truong, S Mussmann, Y Zhu, ... arXiv preprint arXiv:2401.06692, 2024 | 1 | 2024 |
Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler Y Chen, K Sankararaman, A Lazaric, M Pirotta, D Karamshuk, Q Wang, ... arXiv preprint arXiv:2211.02233, 2022 | | 2022 |
A Deep Bayesian Bandits Approach for Anticancer Therapy: Exploration via Functional Prior M Lu, Y Chen, SI Lee arXiv preprint arXiv:2205.02944, 2022 | | 2022 |