Towards faithful model explanation in nlp: A survey

Q Lyu, M Apidianaki, C Callison-Burch - Computational Linguistics, 2024 - direct.mit.edu
End-to-end neural Natural Language Processing (NLP) models are notoriously difficult to
understand. This has given rise to numerous efforts towards model explainability in recent …

Grammar-constrained decoding for structured NLP tasks without finetuning

S Geng, M Josifoski, M Peyrard, R West - arXiv preprint arXiv:2305.13971, 2023 - arxiv.org
Despite their impressive performance, large language models (LMs) still struggle with
reliably generating complex output structures when not finetuned to follow the required …

Gluecons: A generic benchmark for learning under constraints

HR Faghihi, A Nafar, C Zheng, R Mirzaee… - Proceedings of the …, 2023 - ojs.aaai.org
Recent research has shown that integrating domain knowledge into deep learning
architectures is effective; It helps reduce the amount of required data, improves the accuracy …

Deal: Decoding-time alignment for large language models

JY Huang, S Sengupta, D Bonadiman, Y Lai… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) are nowadays expected to generate content aligned with
human preferences. Current work focuses on alignment at model training time, through …

Global constraints with prompting for zero-shot event argument classification

Z Lin, H Zhang, Y Song - arXiv preprint arXiv:2302.04459, 2023 - arxiv.org
Determining the role of event arguments is a crucial subtask of event extraction. Most
previous supervised models leverage costly annotations, which is not practical for open …

Constraint reasoning embedded structured prediction

N Jiang, M Zhang, WJ Van Hoeve, Y Xue - Journal of machine learning …, 2022 - jmlr.org
Many real-world structured prediction problems need machine learning to capture data
distribution and constraint reasoning to ensure structure validity. Nevertheless, constrained …

Long horizon forecasting with temporal point processes

P Deshpande, K Marathe, A De… - Proceedings of the 14th …, 2021 - dl.acm.org
In recent years, marked temporal point processes (MTPPs) have emerged as a powerful
modeling machinery to characterize asynchronous events in a wide variety of applications …

Automata-based constraints for language model decoding

T Koo, F Liu, L He - arXiv preprint arXiv:2407.08103, 2024 - arxiv.org
LMs are often expected to generate strings in some formal language; for example, structured
data, API calls, or code snippets. Although LMs can be tuned to improve their adherence to …

[HTML][HTML] Unleashing the true potential of sequence-to-sequence models for sequence tagging and structure parsing

H He, JD Choi - Transactions of the Association for Computational …, 2023 - direct.mit.edu
Abstract Sequence-to-Sequence (S2S) models have achieved remarkable success on
various text generation tasks. However, learning complex structures with S2S models …

Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models

AD McCarthy, H Zhang, S Kumar… - Findings of the …, 2023 - aclanthology.org
One challenge in speech translation is that plenty of spoken content is long-form, but short
units are necessary for obtaining high-quality translations. To address this mismatch, we …