Unveiling the generalization power of fine-tuned large language models

H Yang, Y Zhang, J Xu, H Lu, PA Heng… - arXiv preprint arXiv …, 2024 - arxiv.org
While Large Language Models (LLMs) have demonstrated exceptional multitasking abilities,
fine-tuning these models on downstream, domain-specific datasets is often necessary to …

Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

Q Guo, R Wang, J Guo, X Tan, J Bian… - Findings of the …, 2024 - aclanthology.org
While large language models (LLMs) have achieved impressive performance across diverse
tasks, recent studies showcase that causal LLMs suffer from the “reversal curse”. It is a …