Towards interpretable natural language understanding with explanations as latent variables

From anecdotal evidence to quantitative evaluation methods: A systematic review on evaluating explainable ai

M Nauta, J Trienes, S Pathak, E Nguyen… - ACM Computing …, 2023 - dl.acm.org

The rising popularity of explainable artificial intelligence (XAI) to understand high-performing
black boxes raised the question of how to evaluate explanations of machine learning (ML) …

被引用次数：401 相关文章所有 8 个版本

[PDF] ieee.org

Towards human-centered explainable ai: A survey of user studies for model explanations

Y Rong, T Leemann, TT Nguyen… - IEEE transactions on …, 2023 - ieeexplore.ieee.org

Explainable AI (XAI) is widely viewed as a sine qua non for ever-expanding AI research. A
better understanding of the needs of XAI users, as well as human-centered evaluations of …

被引用次数：74 相关文章所有 11 个版本

[PDF] arxiv.org

Challenging big-bench tasks and whether chain-of-thought can solve them

M Suzgun, N Scales, N Schärli, S Gehrmann… - arXiv preprint arXiv …, 2022 - arxiv.org

BIG-Bench (Srivastava et al., 2022) is a diverse evaluation suite that focuses on tasks
believed to be beyond the capabilities of current language models. Language models have …

被引用次数：585 相关文章所有 5 个版本

[PDF] neurips.cc

Chain-of-thought prompting elicits reasoning in large language models

J Wei, X Wang, D Schuurmans… - Advances in neural …, 2022 - proceedings.neurips.cc

We explore how generating a chain of thought---a series of intermediate reasoning steps---
significantly improves the ability of large language models to perform complex reasoning. In …

被引用次数：9593 相关文章所有 17 个版本

[PDF] arxiv.org

Can language models learn from explanations in context?

AK Lampinen, I Dasgupta, SCY Chan… - arXiv preprint arXiv …, 2022 - arxiv.org

Language Models (LMs) can perform new tasks by adapting to a few in-context examples.
For humans, explanations that connect examples to task principles can improve learning …

被引用次数：269 相关文章所有 6 个版本

[PDF] neurips.cc

Star: Bootstrapping reasoning with reasoning

E Zelikman, Y Wu, J Mu… - Advances in Neural …, 2022 - proceedings.neurips.cc

Generating step-by-step" chain-of-thought" rationales improves language model
performance on complex reasoning tasks like mathematics or commonsense question …

被引用次数：477 相关文章所有 9 个版本

[PDF] neurips.cc

Self-evaluation guided beam search for reasoning

Y Xie, K Kawaguchi, Y Zhao, JX Zhao… - Advances in …, 2024 - proceedings.neurips.cc

Breaking down a problem into intermediate steps has demonstrated impressive
performance in Large Language Model (LLM) reasoning. However, the growth of the …

被引用次数：68 相关文章所有 6 个版本

[PDF] arxiv.org

Symbolic chain-of-thought distillation: Small models can also" think" step-by-step

LH Li, J Hessel, Y Yu, X Ren, KW Chang… - arXiv preprint arXiv …, 2023 - arxiv.org

Chain-of-thought prompting (eg," Let's think step-by-step") primes large language models to
verbalize rationalization for their predictions. While chain-of-thought can lead to dramatic …

被引用次数：106 相关文章所有 6 个版本

[PDF] arxiv.org

When can models learn from explanations? a formal framework for understanding the roles of explanation data

P Hase, M Bansal - arXiv preprint arXiv:2102.02201, 2021 - arxiv.org

Many methods now exist for conditioning model outputs on task instructions, retrieved
documents, and user-provided explanations and feedback. Rather than relying solely on …

被引用次数：75 相关文章所有 7 个版本

[PDF] arxiv.org

Improved logical reasoning of language models via differentiable symbolic programming

H Zhang, J Huang, Z Li, M Naik, E Xing - arXiv preprint arXiv:2305.03742, 2023 - arxiv.org

Pre-trained large language models (LMs) struggle to perform logical reasoning reliably
despite advances in scale and compositionality. In this work, we tackle this challenge …

被引用次数：29 相关文章所有 10 个版本