Dissociating language and thought in large language models

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in Cognitive …, 2024 - cell.com
Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

A comprehensive overview of knowledge graph completion

T Shen, F Zhang, J Cheng - Knowledge-Based Systems, 2022 - Elsevier
Abstract Knowledge Graph (KG) provides high-quality structured knowledge for various
downstream knowledge-aware tasks (such as recommendation and intelligent question …

Safe latent diffusion: Mitigating inappropriate degeneration in diffusion models

P Schramowski, M Brack… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-conditioned image generation models have recently achieved astonishing results in
image quality and text alignment and are consequently employed in a fast-growing number …

Merlot reserve: Neural script knowledge through vision and language and sound

R Zellers, J Lu, X Lu, Y Yu, Y Zhao… - Proceedings of the …, 2022 - openaccess.thecvf.com
As humans, we navigate a multimodal world, building a holistic understanding from all our
senses. We introduce MERLOT Reserve, a model that represents videos jointly over time …

Merlot: Multimodal neural script knowledge models

R Zellers, X Lu, J Hessel, Y Yu… - Advances in neural …, 2021 - proceedings.neurips.cc
As humans, we understand events in the visual world contextually, performing multimodal
reasoning across time to make inferences about the past, present, and future. We introduce …

Winogrande: An adversarial winograd schema challenge at scale

K Sakaguchi, RL Bras, C Bhagavatula… - Communications of the …, 2021 - dl.acm.org
Commonsense reasoning remains a major challenge in AI, and yet, recent progresses on
benchmarks may seem to suggest otherwise. In particular, the recent neural language …

Hellaswag: Can a machine really finish your sentence?

R Zellers, A Holtzman, Y Bisk, A Farhadi… - arXiv preprint arXiv …, 2019 - arxiv.org
Recent work by Zellers et al.(2018) introduced a new task of commonsense natural
language inference: given an event description such as" A woman sits at a piano," a …

Chatgpt is a knowledgeable but inexperienced solver: An investigation of commonsense problem in large language models

N Bian, X Han, L Sun, H Lin, Y Lu, B He, S Jiang… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have made significant progress in NLP. However, their
ability to memorize, represent, and leverage commonsense knowledge has been a well …

Clever hans or neural theory of mind? stress testing social reasoning in large language models

N Shapira, M Levy, SH Alavi, X Zhou, Y Choi… - arXiv preprint arXiv …, 2023 - arxiv.org
The escalating debate on AI's capabilities warrants developing reliable metrics to assess
machine" intelligence". Recently, many anecdotal examples were used to suggest that …

Commonsenseqa: A question answering challenge targeting commonsense knowledge

A Talmor, J Herzig, N Lourie, J Berant - arXiv preprint arXiv:1811.00937, 2018 - arxiv.org
When answering a question, people often draw upon their rich world knowledge in addition
to the particular context. Recent work has focused primarily on answering questions given …