Unifying large language models and knowledge graphs: A roadmap

S Pan, L Luo, Y Wang, C Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Large language models (LLMs), such as ChatGPT and GPT4, are making new waves in the
field of natural language processing and artificial intelligence, due to their emergent ability …

Autoregressive search engines: Generating substrings as document identifiers

M Bevilacqua, G Ottaviano, P Lewis… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Knowledge-intensive language tasks require NLP systems to both provide the
correct answer and retrieve supporting evidence for it in a given corpus. Autoregressive …

Editing factual knowledge in language models

N De Cao, W Aziz, I Titov - arXiv preprint arXiv:2104.08164, 2021 - arxiv.org
The factual knowledge acquired during pre-training and stored in the parameters of
Language Models (LMs) can be useful in downstream tasks (eg, question answering or …

A neural corpus indexer for document retrieval

Y Wang, Y Hou, H Wang, Z Miao… - Advances in …, 2022 - proceedings.neurips.cc
Current state-of-the-art document retrieval solutions mainly follow an index-retrieve
paradigm, where the index is hard to be directly optimized for the final retrieval target. In this …

Open-domain visual entity recognition: Towards recognizing millions of wikipedia entities

H Hu, Y Luan, Y Chen, U Khandelwal… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large-scale multi-modal pre-training models such as CLIP and PaLI exhibit strong
generalization on various visual domains and tasks. However, existing image classification …

Multilingual generative language models for zero-shot cross-lingual event argument extraction

KH Huang, I Hsu, P Natarajan, KW Chang… - arXiv preprint arXiv …, 2022 - arxiv.org
We present a study on leveraging multilingual pre-trained generative language models for
zero-shot cross-lingual event argument extraction (EAE). By formulating EAE as a language …

GenIE: Generative information extraction

M Josifoski, N De Cao, M Peyrard, F Petroni… - arXiv preprint arXiv …, 2021 - arxiv.org
Structured and grounded representation of text is typically formalized by closed information
extraction, the problem of extracting an exhaustive set of (subject, relation, object) triplets …

One question answering model for many languages with cross-lingual dense passage retrieval

A Asai, X Yu, J Kasai… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract We present Cross-lingual Open-Retrieval Answer Generation (CORA), the first
unified many-to-many question answering (QA) model that can answer questions across …

A survey on challenges and advances in natural language processing with a focus on legal informatics and low-resource languages

P Krasadakis, E Sakkopoulos, VS Verykios - Electronics, 2024 - mdpi.com
The field of Natural Language Processing (NLP) has experienced significant growth in
recent years, largely due to advancements in Deep Learning technology and especially …

Bridging the gap between indexing and retrieval for differentiable search index with query generation

S Zhuang, H Ren, L Shou, J Pei, M Gong… - arXiv preprint arXiv …, 2022 - arxiv.org
The Differentiable Search Index (DSI) is an emerging paradigm for information retrieval.
Unlike traditional retrieval architectures where index and retrieval are two different and …