Offline prompt polishing for low quality instructions

J Yu, Z Zhou, L Li, L Li, Y Yan, R Xu, Z Lan - Neurocomputing, 2024 - Elsevier
Instruction-tuning is an effective avenue for making large language models (LLMs) better at
following real users' instructions. However, it's challenging in aligning to human preference …