Consecutive Batch Model Editing with HooK Layers

S Li, Y Deng, D Cai, H Lu, L Chen… - Proceedings of the 2024 …, 2024 - aclanthology.org
As the typical retraining paradigm is unacceptably time-and resource-consuming,
researchers are turning to model editing to find an effective way that supports both …

Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts

T Fu, Y Hou, J McAuley, R Yan - arXiv preprint arXiv:2408.05094, 2024 - arxiv.org
The task of multi-objective alignment aims at balancing and controlling the different
alignment objectives (eg, helpfulness, harmlessness and honesty) of large language models …