On the transformations across reward model, parameter update, and in-context prompt

文章

学术资源搜索

获得 2 条结果（用时0.03秒）

我的图书馆

On the transformations across reward model, parameter update, and in-context prompt

在引用文章中搜索

[PDF] aclanthology.org

Consecutive Batch Model Editing with HooK Layers

S Li, Y Deng, D Cai, H Lu, L Chen… - Proceedings of the 2024 …, 2024 - aclanthology.org

As the typical retraining paradigm is unacceptably time-and resource-consuming,
researchers are turning to model editing to find an effective way that supports both …

Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts

T Fu, Y Hou, J McAuley, R Yan - arXiv preprint arXiv:2408.05094, 2024 - arxiv.org

The task of multi-objective alignment aims at balancing and controlling the different
alignment objectives (eg, helpfulness, harmlessness and honesty) of large language models …