UDAPDR: unsupervised domain adaptation via LLM prompting and distillation of rerankers

Y Zhu, H Yuan, S Wang, J Liu, W Liu, C Deng… - arXiv preprint arXiv …, 2023 - arxiv.org

As a primary means of information acquisition, information retrieval (IR) systems, such as
search engines, have integrated themselves into our daily lives. These systems also serve …

被引用次数：263 相关文章所有 3 个版本

[PDF] arxiv.org

Retrieval-augmented generation for ai-generated content: A survey

P Zhao, H Zhang, Q Yu, Z Wang, Y Geng, F Fu… - arXiv preprint arXiv …, 2024 - arxiv.org

The development of Artificial Intelligence Generated Content (AIGC) has been facilitated by
advancements in model algorithms, scalable foundation model architectures, and the …

被引用次数：145 相关文章所有 4 个版本

[PDF] arxiv.org

A survey on knowledge distillation of large language models

X Xu, M Li, C Tao, T Shen, R Cheng, J Li, C Xu… - arXiv preprint arXiv …, 2024 - arxiv.org

This survey presents an in-depth exploration of knowledge distillation (KD) techniques
within the realm of Large Language Models (LLMs), spotlighting the pivotal role of KD in …

被引用次数：99 相关文章所有 2 个版本

[PDF] arxiv.org

JaColBERTv2. 5: Optimising Multi-Vector Retrievers to Create State-of-the-Art Japanese Retrievers with Constrained Resources

B Clavié - arXiv preprint arXiv:2407.20750, 2024 - arxiv.org

Neural Information Retrieval has advanced rapidly in high-resource languages, but progress
in lower-resource ones such as Japanese has been hindered by data scarcity, among other …

被引用次数：4 相关文章

[PDF] arxiv.org

Benchmarking and building long-context retrieval models with loco and m2-bert

J Saad-Falcon, DY Fu, S Arora, N Guha… - arXiv preprint arXiv …, 2024 - arxiv.org

Retrieval pipelines-an integral component of many machine learning systems-perform
poorly in domains where documents are long (eg, 10K tokens or more) and where …

被引用次数：11 相关文章所有 4 个版本

被引用次数：3 相关文章

[PDF] arxiv.org

KVPruner: Structural Pruning for Faster and Memory-Efficient Large Language Models

B Lv, Q Zhou, X Ding, Y Wang, Z Ma - arXiv preprint arXiv:2409.11057, 2024 - arxiv.org

The bottleneck associated with the key-value (KV) cache presents a significant challenge
during the inference processes of large language models. While depth pruning accelerates …