SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations

C Gao, R Chen, S Yuan, K Huang, Y Yu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have attracted significant attention in recommendation
systems. Current LLM-based recommender systems primarily rely on supervised fine-tuning …

Multi-Response Preference Optimization with Augmented Ranking Dataset

H Gwon, I Ahn, YH Kim, S Park, TJ Jun - arXiv preprint arXiv:2412.07812, 2024 - arxiv.org
Recent advancements in Large Language Models (LLMs) have been remarkable, with new
models consistently surpassing their predecessors. These advancements are underpinned …