TAPEX: Table pre-training via learning a neural SQL executor

P Lu, L Qiu, W Yu, S Welleck, KW Chang - arXiv preprint arXiv:2212.10535, 2022 - arxiv.org

Mathematical reasoning is a fundamental aspect of human intelligence and is applicable in
various fields, including science, engineering, finance, and everyday life. The development …

被引用次数：97 相关文章所有 6 个版本

[PDF] arxiv.org

A survey on text-to-sql parsing: Concepts, methods, and future directions

B Qin, B Hui, L Wang, M Yang, J Li, B Li… - arXiv preprint arXiv …, 2022 - arxiv.org

Text-to-SQL parsing is an essential and challenging task. The goal of text-to-SQL parsing is
to convert a natural language (NL) question to its corresponding structured query language …

被引用次数：60 相关文章所有 2 个版本

[PDF] neurips.cc

Chameleon: Plug-and-play compositional reasoning with large language models

P Lu, B Peng, H Cheng, M Galley… - Advances in …, 2024 - proceedings.neurips.cc

Large language models (LLMs) have achieved remarkable progress in solving various
natural language processing tasks due to emergent reasoning abilities. However, LLMs …

被引用次数：293 相关文章所有 10 个版本

[PDF] mlr.press

Lever: Learning to verify language-to-code generation with execution

A Ni, S Iyer, D Radev, V Stoyanov… - International …, 2023 - proceedings.mlr.press

The advent of large language models trained on code (code LLMs) has led to significant
progress in language-to-code generation. State-of-the-art approaches in this area combine …

被引用次数：128 相关文章所有 6 个版本

[PDF] arxiv.org

Dynamic prompt learning via policy gradient for semi-structured mathematical reasoning

P Lu, L Qiu, KW Chang, YN Wu, SC Zhu… - arXiv preprint arXiv …, 2022 - arxiv.org

Mathematical reasoning, a core ability of human intelligence, presents unique challenges for
machines in abstract thinking and logical reasoning. Recent large pre-trained language …

被引用次数：166 相关文章所有 7 个版本

[PDF] neurips.cc

Lift: Language-interfaced fine-tuning for non-language machine learning tasks

T Dinh, Y Zeng, R Zhang, Z Lin… - Advances in …, 2022 - proceedings.neurips.cc

Fine-tuning pretrained language models (LMs) without making any architectural changes
has become a norm for learning various language downstream tasks. However, for non …

被引用次数：94 相关文章所有 8 个版本

[PDF] arxiv.org

Large language models are few (1)-shot table reasoners

W Chen - arXiv preprint arXiv:2210.06710, 2022 - arxiv.org

Recent literature has shown that large language models (LLMs) are generally excellent few-
shot reasoners to solve text reasoning tasks. However, the capability of LLMs on table …

被引用次数：99 相关文章所有 3 个版本

[PDF] neurips.cc

To repeat or not to repeat: Insights from scaling llm under token-crisis

F Xue, Y Fu, W Zhou, Z Zheng… - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent research has highlighted the importance of dataset size in scaling language models.
However, large language models (LLMs) are notoriously token-hungry during pre-training …

被引用次数：42 相关文章所有 5 个版本

[PDF] arxiv.org

A survey on stance detection for mis-and disinformation identification

M Hardalov, A Arora, P Nakov, I Augenstein - arXiv preprint arXiv …, 2021 - arxiv.org

Understanding attitudes expressed in texts, also known as stance detection, plays an
important role in systems for detecting false information online, be it misinformation …

被引用次数：131 相关文章所有 5 个版本

[PDF] arxiv.org

Tool documentation enables zero-shot tool-usage with large language models

CY Hsieh, SA Chen, CL Li, Y Fujii, A Ratner… - arXiv preprint arXiv …, 2023 - arxiv.org

Today, large language models (LLMs) are taught to use new tools by providing a few
demonstrations of the tool's usage. Unfortunately, demonstrations are hard to acquire, and …

被引用次数：45 相关文章所有 7 个版本