A practical review of mechanistic interpretability for transformer-based language models

M Wang, Y Yao, Z Xu, S Qiao, S Deng, P Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Understanding knowledge mechanisms in Large Language Models (LLMs) is crucial for
advancing towards trustworthy AGI. This paper reviews knowledge mechanism analysis …

被引用次数：9 相关文章所有 5 个版本

[PDF] arxiv.org

A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions

O Shorinwa, Z Mei, J Lidard, AZ Ren… - arXiv preprint arXiv …, 2024 - arxiv.org

The remarkable performance of large language models (LLMs) in content generation,
coding, and common-sense reasoning has spurred widespread integration into many facets …

Shared Imagination: LLMs Hallucinate Alike

Y Zhou, C Xiong, S Savarese, CS Wu - arXiv preprint arXiv:2407.16604, 2024 - arxiv.org

Despite the recent proliferation of large language models (LLMs), their training recipes--
model architecture, pre-training data and optimization algorithm--are often very similar. This …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Unpacking sdxl turbo: Interpreting text-to-image models with sparse autoencoders

V Surkov, C Wendler, M Terekhov… - arXiv preprint arXiv …, 2024 - arxiv.org

Sparse autoencoders (SAEs) have become a core ingredient in the reverse engineering of
large-language models (LLMs). For LLMs, they have been shown to decompose …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

System 2 reasoning capabilities are nigh

SC Lowe - arXiv preprint arXiv:2410.03662, 2024 - arxiv.org

In recent years, machine learning models have made strides towards human-like reasoning
capabilities from several directions. In this work, we review the current state of the literature …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks

J Martinez-Gil - arXiv preprint arXiv:2410.05275, 2024 - arxiv.org

Assessing the degree of similarity of code fragments is crucial for ensuring software quality,
but it remains challenging due to the need to capture the deeper semantic aspects of code …

被引用次数：1 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] On the role of knowledge graphs in AI-based scientific discovery

M D'aquin - Journal of Web Semantics, 2024 - Elsevier

Research and the scientific activity are widely seen as an area where the current trends in
AI, namely the development of deep learning models (including large language models), are …