Decomposing and editing predictions by modeling model computation

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Decomposing and editing predictions by modeling model computation

在引用文章中搜索

[PDF] arxiv.org

When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models

TY Chang, J Thomason, R Jia - arXiv preprint arXiv:2406.13131, 2024 - arxiv.org

This paper studies in-context learning (ICL) by decomposing the output of large language
models into the individual contributions of attention heads and MLPs (components). We …

被引用次数：1 相关文章

[PDF] arxiv.org

Learned feature representations are biased by complexity, learning order, position, and more

AK Lampinen, SCY Chan, K Hermann - arXiv preprint arXiv:2405.05847, 2024 - arxiv.org

Representation learning, and interpreting learned representations, are key areas of focus in
machine learning and neuroscience. Both fields generally use representations as a means …

被引用次数：1 相关文章所有 2 个版本