Language model behavior: A comprehensive survey
Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …
generated text is often surprising even to NLP researchers. In this survey, we discuss over …
Testing the general deductive reasoning capacity of large language models using ood examples
Given the intractably large size of the space of proofs, any model that is capable of general
deductive reasoning must generalize to proofs of greater complexity. Recent studies have …
deductive reasoning must generalize to proofs of greater complexity. Recent studies have …
How Do In-Context Examples Affect Compositional Generalization?
Compositional generalization--understanding unseen combinations of seen primitives--is an
essential reasoning capability in human intelligence. The AI community mainly studies this …
essential reasoning capability in human intelligence. The AI community mainly studies this …
Instruct me more! random prompting for visual in-context learning
Large-scale models trained on extensive datasets, have emerged as the preferred approach
due to their high generalizability across various tasks. In-context learning (ICL), a popular …
due to their high generalizability across various tasks. In-context learning (ICL), a popular …
How capable can a transformer become? a study on synthetic, interpretable tasks
Transformers trained on huge text corpora exhibit a remarkable set of capabilities, eg,
performing simple logical operations. Given the inherent compositional nature of language …
performing simple logical operations. Given the inherent compositional nature of language …
Leveraging code to improve in-context learning for semantic parsing
In-context learning (ICL) is an appealing approach for semantic parsing due to its few-shot
nature and improved generalization. However, learning to parse to rare domain-specific …
nature and improved generalization. However, learning to parse to rare domain-specific …
The validity of evaluation results: Assessing concurrence across compositionality benchmarks
NLP models have progressed drastically in recent years, according to numerous datasets
proposed to evaluate performance. Questions remain, however, about how particular …
proposed to evaluate performance. Questions remain, however, about how particular …
Adapt and Decompose: Efficient Generalization of Text-to-SQL via Domain Adapted Least-To-Most Prompting
Cross-domain and cross-compositional generalization of Text-to-SQL semantic parsing is a
challenging task. Existing Large Language Model (LLM) based solutions rely on inference …
challenging task. Existing Large Language Model (LLM) based solutions rely on inference …
Power-up! what can generative models do for human computation workflows?
We are amidst an explosion of artificial intelligence research, particularly around large
language models (LLMs). These models have a range of applications across domains like …
language models (LLMs). These models have a range of applications across domains like …
Attention as a Hypernetwork
Transformers can under some circumstances generalize to novel problem instances whose
constituent parts might have been encountered during training but whose compositions …
constituent parts might have been encountered during training but whose compositions …