Dynamic clustering based contextual combinatorial multi-armed bandit for online recommendation C Yan, H Han, Y Zhang, D Zhu, Y Wan Knowledge-Based Systems 257, 109927, 2022 | 10 | 2022 |
Small language model can self-correct H Han, J Liang, J Shi, Q He, Y Xiao Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18162 …, 2024 | 2 | 2024 |
Two-phase multi-armed bandit for online recommendation C Yan, H Han, Z Wang, Y Zhang 2021 IEEE 8th International Conference on Data Science and Advanced …, 2021 | 2 | 2021 |
SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation Z Luo, H Han, H Zhao, G Jiang, C Du, T Li, J Liang, D Yang, Y Xiao arXiv preprint arXiv:2405.16552, 2024 | 1 | 2024 |
Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience H Han, T Li, S Chen, J Shi, C Du, Y Xiao, J Liang, X Lin arXiv preprint arXiv:2404.10315, 2024 | | 2024 |
Large Language Model Can Continue Evolving From Mistakes H Zhao, H Han, J Shi, C Du, J Liang, Y Xiao arXiv preprint arXiv:2404.08707, 2024 | | 2024 |
Enhancing Multi-Behavior Recommendations Through Capturing Dynamic Preferences C Yan, X Guan, H Han, Z Zhang, Y Zhang International Journal of Software Engineering and Knowledge Engineering 33 …, 2023 | | 2023 |
Thompson Sampling with Time-Varying Reward for Contextual Bandits C Yan, H Xu, H Han, Y Zhang, Z Wang International Conference on Database Systems for Advanced Applications, 54-63, 2023 | | 2023 |
MB-DP: A Multi-behavior Recommendation Model Integrating Dynamic Preferences. C Yan, X Guan, H Han, Z Zhang SEKE, 608-613, 2023 | | 2023 |