Towards explainable evaluation metrics for machine translation

C Leiter, P Lertvittayakumjorn, M Fomicheva… - Journal of Machine …, 2024 - jmlr.org
Unlike classical lexical overlap metrics such as BLEU, most current evaluation metrics for
machine translation (for example, COMET or BERTScore) are based on black-box large …

Recent Developments on Accountability and Explainability for Complex Reasoning Tasks

P Atanasova - Accountable and Explainable Methods for Complex …, 2024 - Springer
This chapter delves into the recent accountability tools tailored for the evolving landscape of
machine learning models for complex reasoning tasks. With the increasing integration of …

Towards explainable evaluation metrics for natural language generation

C Leiter, P Lertvittayakumjorn, M Fomicheva… - arXiv preprint arXiv …, 2022 - arxiv.org
Unlike classical lexical overlap metrics such as BLEU, most current evaluation metrics (such
as BERTScore or MoverScore) are based on black-box language models such as BERT or …

A comparison of explainable artificial intelligence methods in the phase classification of multi-principal element alloys

K Lee, MV Ayyasamy, Y Ji, PV Balachandran - Scientific Reports, 2022 - nature.com
We demonstrate the capabilities of two model-agnostic local post-hoc model interpretability
methods, namely breakDown (BD) and shapley (SHAP), to explain the predictions of a black …

Thermostat: A large collection of NLP model explanations and analysis tools

N Feldhus, R Schwarzenberg, S Möller - arXiv preprint arXiv:2108.13961, 2021 - arxiv.org
In the language domain, as in other domains, neural explainability takes an ever more
important role, with feature attribution methods on the forefront. Many such methods require …

Improve interpretability of neural networks via sparse contrastive coding

J Liu, Y Lin, L Jiang, J Liu, Z Wen… - Findings of the …, 2022 - aclanthology.org
Although explainable artificial intelligence (XAI) has achieved remarkable developments in
recent years, there are few efforts have been devoted to the following problems, namely, i) …

Local explanation of dialogue response generation

YL Tuan, C Pryor, W Chen, L Getoor… - Advances in Neural …, 2021 - proceedings.neurips.cc
In comparison to the interpretation of classification models, the explanation of sequence
generation models is also an important problem, however it has seen little attention. In this …

REV: information-theoretic evaluation of free-text rationales

H Chen, F Brahman, X Ren, Y Ji, Y Choi… - arXiv preprint arXiv …, 2022 - arxiv.org
Generating free-text rationales is a promising step towards explainable NLP, yet evaluating
such rationales remains a challenge. Existing metrics have mostly focused on measuring the …

A multi-facet analysis of BERT-based entity matching models

M Paganelli, D Tiano, F Guerra - The VLDB Journal, 2023 - Springer
State-of-the-art Entity Matching approaches rely on transformer architectures, such as BERT,
for generating highly contextualized embeddings of terms. The embeddings are then used to …

Explaining interactions between text spans

SR Choudhury, P Atanasova, I Augenstein - arXiv preprint arXiv …, 2023 - arxiv.org
Reasoning over spans of tokens from different parts of the input is essential for natural
language understanding (NLU) tasks such as fact-checking (FC), machine reading …