A survey on text-to-sql parsing: Concepts, methods, and future directions

B Qin, B Hui, L Wang, M Yang, J Li, B Li… - arXiv preprint arXiv …, 2022 - arxiv.org
Text-to-SQL parsing is an essential and challenging task. The goal of text-to-SQL parsing is
to convert a natural language (NL) question to its corresponding structured query language …

Can llm already serve as a database interface? a big bench for large-scale database grounded text-to-sqls

J Li, B Hui, G Qu, J Yang, B Li, B Li… - Advances in …, 2024 - proceedings.neurips.cc
Text-to-SQL parsing, which aims at converting natural language instructions into executable
SQLs, has gained increasing attention in recent years. In particular, GPT-4 and Claude-2 …

A survey of data augmentation approaches for NLP

SY Feng, V Gangal, J Wei, S Chandar… - arXiv preprint arXiv …, 2021 - arxiv.org
Data augmentation has recently seen increased interest in NLP due to more work in low-
resource domains, new tasks, and the popularity of large-scale neural networks that require …

Graphix-t5: Mixing pre-trained transformers with graph-aware layers for text-to-sql parsing

J Li, B Hui, R Cheng, B Qin, C Ma, N Huo… - Proceedings of the …, 2023 - ojs.aaai.org
The task of text-to-SQL parsing, which aims at converting natural language questions into
executable SQL queries, has garnered increasing attention in recent years. One of the major …

TAPEX: Table pre-training via learning a neural SQL executor

Q Liu, B Chen, J Guo, M Ziyadi, Z Lin, W Chen… - arXiv preprint arXiv …, 2021 - arxiv.org
Recent progress in language model pre-training has achieved a great success via
leveraging large-scale unstructured textual data. However, it is still a challenge to apply pre …

LGESQL: line graph enhanced text-to-SQL model with mixed local and non-local relations

R Cao, L Chen, Z Chen, Y Zhao, S Zhu, K Yu - arXiv preprint arXiv …, 2021 - arxiv.org
This work aims to tackle the challenging heterogeneous graph encoding problem in the text-
to-SQL task. Previous methods are typically node-centric and merely utilize different weight …

Rasat: Integrating relational structures into pretrained seq2seq model for text-to-sql

J Qi, J Tang, Z He, X Wan, Y Cheng, C Zhou… - arXiv preprint arXiv …, 2022 - arxiv.org
Relational structures such as schema linking and schema encoding have been validated as
a key component to qualitatively translating natural language into SQL queries. However …

Learning contextual representations for semantic parsing with generation-augmented pre-training

P Shi, P Ng, Z Wang, H Zhu, AH Li, J Wang… - Proceedings of the …, 2021 - ojs.aaai.org
Most recently, there has been significant interest in learning contextual representations for
various NLP tasks, by leveraging large scale text corpora to train powerful language models …

Natural SQL: Making SQL easier to infer from natural language specifications

Y Gan, X Chen, J Xie, M Purver, JR Woodward… - arXiv preprint arXiv …, 2021 - arxiv.org
Addressing the mismatch between natural language descriptions and the corresponding
SQL queries is a key challenge for text-to-SQL translation. To bridge this gap, we propose …

Improving compositional generalization with latent structure and data augmentation

L Qiu, P Shaw, P Pasupat, PK Nowak, T Linzen… - arXiv preprint arXiv …, 2021 - arxiv.org
Generic unstructured neural networks have been shown to struggle on out-of-distribution
compositional generalization. Compositional data augmentation via example recombination …