D-separation for causal self-explanation

Y Zhao, Z Wang, X Li, J Liang, R Li - Proceedings of the 62nd …, 2024 - aclanthology.org

Most existing rationalization approaches are susceptible to degeneration accumulation due
to a lack of effective control over the learning direction of the model during training. To …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Mare: Multi-aspect rationale extractor on unsupervised rationale extraction

H Jiang, J Duan, Z Qu, J Wang - arXiv preprint arXiv:2410.03531, 2024 - arxiv.org

Unsupervised rationale extraction aims to extract text snippets to support model predictions
without explicit rationale annotation. Researchers have made many efforts to solve this task …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization

W Liu, Z Deng, Z Niu, J Wang, H Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

An important line of research in the field of explainability is to extract a small subset of crucial
rationales from the full input. The most widely used criterion for rationale extraction is the …

A Unified Causal View of Instruction Tuning

L Chen, W Huang, R Zhang, W Chen, J Guo… - arXiv preprint arXiv …, 2024 - arxiv.org

Instruction tuning on a mixture of tasks has improved zero-shot capabilities in natural
language processing (NLP). Nevertheless, existing methods often learn features that exhibit …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Interlocking-free Selective Rationalization Through Genetic-based Learning

F Ruggeri, G Signorelli - arXiv preprint arXiv:2412.10312, 2024 - arxiv.org

A popular end-to-end architecture for selective rationalization is the select-then-predict
pipeline, comprising a generator to extract highlights fed to a predictor. Such a cooperative …

Adversarial Attack for Explanation Robustness of Rationalization Models

Y Zhang, L Kong, H Wang, R Li, J Wang… - arXiv preprint arXiv …, 2024 - ebooks.iospress.nl

Rationalization models, which select a subset of input text as rationale—crucial for humans
to understand and trust predictions—have recently emerged as a prominent research area …