Consecutive Batch Model Editing with HooK Layers
As the typical retraining paradigm is unacceptably time-and resource-consuming,
researchers are turning to model editing to find an effective way that supports both …
researchers are turning to model editing to find an effective way that supports both …
Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts
The task of multi-objective alignment aims at balancing and controlling the different
alignment objectives (eg, helpfulness, harmlessness and honesty) of large language models …
alignment objectives (eg, helpfulness, harmlessness and honesty) of large language models …