Logic-guided data augmentation and regularization for consistent question answering

B Li, Y Hou, W Che - Ai Open, 2022 - Elsevier

As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where
deep learning techniques may fail. It is widely applied in computer vision then introduced to …

被引用次数：323 相关文章所有 5 个版本

[PDF] springer.com

Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

Z Lin, S Guan, W Zhang, H Zhang, Y Li… - Artificial Intelligence …, 2024 - Springer

Recently, large language models (LLMs) have attracted considerable attention due to their
remarkable capabilities. However, LLMs' generation of biased or hallucinatory content …

被引用次数：16 相关文章所有 4 个版本

[PDF] aclanthology.org

Making language models better reasoners with step-aware verifier

Y Li, Z Lin, S Zhang, Q Fu, B Chen… - Proceedings of the …, 2023 - aclanthology.org

Few-shot learning is a challenging task that requires language models to generalize from
limited examples. Large language models like GPT-3 and PaLM have made impressive …

被引用次数：155 相关文章所有 2 个版本

[PDF] arxiv.org

Active prompting with chain-of-thought for large language models

S Diao, P Wang, Y Lin, R Pan, X Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

The increasing scale of large language models (LLMs) brings emergent abilities to various
complex tasks requiring reasoning, such as arithmetic and commonsense reasoning. It is …

被引用次数：173 相关文章所有 3 个版本

[PDF] arxiv.org

A survey of data augmentation approaches for NLP

SY Feng, V Gangal, J Wei, S Chandar… - arXiv preprint arXiv …, 2021 - arxiv.org

Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …

被引用次数：909 相关文章所有 9 个版本

[PDF] mit.edu

Measuring and improving consistency in pretrained language models

Y Elazar, N Kassner, S Ravfogel… - Transactions of the …, 2021 - direct.mit.edu

Consistency of a model—that is, the invariance of its behavior under meaning-preserving
alternations in its input—is a highly desirable property in natural language processing. In …

被引用次数：344 相关文章所有 11 个版本

[PDF] arxiv.org

Boosting language models reasoning with chain-of-knowledge prompting

J Wang, Q Sun, X Li, M Gao - arXiv preprint arXiv:2306.06427, 2023 - arxiv.org

Recently, Chain-of-Thought (CoT) prompting has delivered success on complex reasoning
tasks, which aims at designing a simple prompt like``Let's think step by step''or multiple in …

被引用次数：41 相关文章所有 2 个版本

[PDF] arxiv.org

Consistency analysis of chatgpt

ME Jang, T Lukasiewicz - arXiv preprint arXiv:2303.06273, 2023 - arxiv.org

ChatGPT has gained a huge popularity since its introduction. Its positive aspects have been
reported through many media platforms, and some analyses even showed that ChatGPT …

被引用次数：89 相关文章所有 6 个版本

[PDF] arxiv.org

Mutant: A training paradigm for out-of-distribution generalization in visual question answering

T Gokhale, P Banerjee, C Baral, Y Yang - arXiv preprint arXiv:2009.08566, 2020 - arxiv.org

While progress has been made on the visual question answering leaderboards, models
often utilize spurious correlations and priors in datasets under the iid setting. As such …

被引用次数：162 相关文章所有 9 个版本

[PDF] arxiv.org

Reasoning like program executors

X Pi, Q Liu, B Chen, M Ziyadi, Z Lin, Q Fu, Y Gao… - arXiv preprint arXiv …, 2022 - arxiv.org

Reasoning over natural language is a long-standing goal for the research community.
However, studies have shown that existing language models are inadequate in reasoning …

被引用次数：64 相关文章所有 4 个版本