Evaluating retrieval quality in retrieval-augmented generation
Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for
retrieval models within these systems. Traditional end-to-end evaluation methods are …
retrieval models within these systems. Traditional end-to-end evaluation methods are …
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization
H Zamani, M Bendersky - Proceedings of the 47th International ACM …, 2024 - dl.acm.org
This paper introduces Stochastic RAG--a novel approach for end-to-end optimization of
retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of …
retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of …
Towards a search engine for machines: Unified ranking for multiple retrieval-augmented large language models
This paper introduces uRAG-a framework with a unified retrieval engine that serves multiple
downstream retrieval-augmented generation (RAG) systems. Each RAG system consumes …
downstream retrieval-augmented generation (RAG) systems. Each RAG system consumes …
Planning ahead in generative retrieval: Guiding autoregressive generation through simultaneous decoding
This paper introduces PAG-a novel optimization and decoding approach that guides
autoregressive generation of document identifiers in generative retrieval models through …
autoregressive generation of document identifiers in generative retrieval models through …
Comparing retrieval-augmentation and parameter-efficient fine-tuning for privacy-preserving personalization of large language models
Privacy-preserving methods for personalizing large language models (LLMs) are relatively
under-explored. There are two schools of thought on this topic:(1) generating personalized …
under-explored. There are two schools of thought on this topic:(1) generating personalized …
LongLaMP: A Benchmark for Personalized Long-form Text Generation
I Kumar, S Viswanathan, S Yerra, A Salemi… - arXiv preprint arXiv …, 2024 - arxiv.org
Long-text generation is seemingly ubiquitous in real-world applications of large language
models such as generating an email or writing a review. Despite the fundamental …
models such as generating an email or writing a review. Despite the fundamental …
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
In the field of language modeling, models augmented with retrieval components have
emerged as a promising solution to address several challenges faced in the natural …
emerged as a promising solution to address several challenges faced in the natural …
HYDRA: Model Factorization Framework for Black-Box LLM Personalization
Personalization has emerged as a critical research area in modern intelligent systems,
focusing on mining users' behavioral history and adapting to their preferences for delivering …
focusing on mining users' behavioral history and adapting to their preferences for delivering …
Understanding the role of user profile in the personalization of large language models
B Wu, Z Shi, HA Rahmani, V Ramineni… - arXiv preprint arXiv …, 2024 - arxiv.org
Utilizing user profiles to personalize Large Language Models (LLMs) has been shown to
enhance the performance on a wide range of tasks. However, the precise role of user …
enhance the performance on a wide range of tasks. However, the precise role of user …
A Review on Edge Large Language Models: Design, Execution, and Applications
Large language models (LLMs) have revolutionized natural language processing with their
exceptional capabilities. However, deploying LLMs on resource-constrained edge devices …
exceptional capabilities. However, deploying LLMs on resource-constrained edge devices …