A general-purpose algorithm for constrained sequential inference

Q Lyu, M Apidianaki, C Callison-Burch - Computational Linguistics, 2024 - direct.mit.edu

End-to-end neural Natural Language Processing (NLP) models are notoriously difficult to
understand. This has given rise to numerous efforts towards model explainability in recent …

被引用次数：65 相关文章所有 4 个版本

[PDF] arxiv.org

Grammar-constrained decoding for structured NLP tasks without finetuning

S Geng, M Josifoski, M Peyrard, R West - arXiv preprint arXiv:2305.13971, 2023 - arxiv.org

Despite their impressive performance, large language models (LMs) still struggle with
reliably generating complex output structures when not finetuned to follow the required …

被引用次数：23 相关文章所有 6 个版本

[PDF] aaai.org

Gluecons: A generic benchmark for learning under constraints

HR Faghihi, A Nafar, C Zheng, R Mirzaee… - Proceedings of the …, 2023 - ojs.aaai.org

Recent research has shown that integrating domain knowledge into deep learning
architectures is effective; It helps reduce the amount of required data, improves the accuracy …

被引用次数：12 相关文章所有 7 个版本

[PDF] arxiv.org

Deal: Decoding-time alignment for large language models

JY Huang, S Sengupta, D Bonadiman, Y Lai… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs) are nowadays expected to generate content aligned with
human preferences. Current work focuses on alignment at model training time, through …

被引用次数：12 相关文章所有 2 个版本

[PDF] arxiv.org

Global constraints with prompting for zero-shot event argument classification

Z Lin, H Zhang, Y Song - arXiv preprint arXiv:2302.04459, 2023 - arxiv.org

Determining the role of event arguments is a crucial subtask of event extraction. Most
previous supervised models leverage costly annotations, which is not practical for open …

被引用次数：12 相关文章所有 4 个版本

[PDF] jmlr.org

Constraint reasoning embedded structured prediction

N Jiang, M Zhang, WJ Van Hoeve, Y Xue - Journal of machine learning …, 2022 - jmlr.org

Many real-world structured prediction problems need machine learning to capture data
distribution and constraint reasoning to ensure structure validity. Nevertheless, constrained …

被引用次数：7 相关文章所有 5 个版本

[PDF] arxiv.org

Long horizon forecasting with temporal point processes

P Deshpande, K Marathe, A De… - Proceedings of the 14th …, 2021 - dl.acm.org

In recent years, marked temporal point processes (MTPPs) have emerged as a powerful
modeling machinery to characterize asynchronous events in a wide variety of applications …

被引用次数：21 相关文章所有 6 个版本

[PDF] arxiv.org

Automata-based constraints for language model decoding

T Koo, F Liu, L He - arXiv preprint arXiv:2407.08103, 2024 - arxiv.org

LMs are often expected to generate strings in some formal language; for example, structured
data, API calls, or code snippets. Although LMs can be tuned to improve their adherence to …

被引用次数：1 相关文章所有 3 个版本

[HTML] mit.edu

[HTML][HTML] Unleashing the true potential of sequence-to-sequence models for sequence tagging and structure parsing

H He, JD Choi - Transactions of the Association for Computational …, 2023 - direct.mit.edu

Abstract Sequence-to-Sequence (S2S) models have achieved remarkable success on
various text generation tasks. However, learning complex structures with S2S models …

被引用次数：2 相关文章所有 7 个版本

[PDF] aclanthology.org

Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models

AD McCarthy, H Zhang, S Kumar… - Findings of the …, 2023 - aclanthology.org

One challenge in speech translation is that plenty of spoken content is long-form, but short
units are necessary for obtaining high-quality translations. To address this mismatch, we …