Evaluating retrieval quality in retrieval-augmented generation

A Salemi, H Zamani - Proceedings of the 47th International ACM SIGIR …, 2024 - dl.acm.org
Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for
retrieval models within these systems. Traditional end-to-end evaluation methods are …

Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

H Zamani, M Bendersky - Proceedings of the 47th International ACM …, 2024 - dl.acm.org
This paper introduces Stochastic RAG--a novel approach for end-to-end optimization of
retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of …

Towards a search engine for machines: Unified ranking for multiple retrieval-augmented large language models

A Salemi, H Zamani - Proceedings of the 47th International ACM SIGIR …, 2024 - dl.acm.org
This paper introduces uRAG-a framework with a unified retrieval engine that serves multiple
downstream retrieval-augmented generation (RAG) systems. Each RAG system consumes …

Planning ahead in generative retrieval: Guiding autoregressive generation through simultaneous decoding

H Zeng, C Luo, H Zamani - Proceedings of the 47th International ACM …, 2024 - dl.acm.org
This paper introduces PAG-a novel optimization and decoding approach that guides
autoregressive generation of document identifiers in generative retrieval models through …

Comparing retrieval-augmentation and parameter-efficient fine-tuning for privacy-preserving personalization of large language models

A Salemi, H Zamani - arXiv preprint arXiv:2409.09510, 2024 - arxiv.org
Privacy-preserving methods for personalizing large language models (LLMs) are relatively
under-explored. There are two schools of thought on this topic:(1) generating personalized …

LongLaMP: A Benchmark for Personalized Long-form Text Generation

I Kumar, S Viswanathan, S Yerra, A Salemi… - arXiv preprint arXiv …, 2024 - arxiv.org
Long-text generation is seemingly ubiquitous in real-world applications of large language
models such as generating an email or writing a review. Despite the fundamental …

Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

TE Kim, A Salemi, A Drozdov, F Diaz… - arXiv preprint arXiv …, 2024 - arxiv.org
In the field of language modeling, models augmented with retrieval components have
emerged as a promising solution to address several challenges faced in the natural …

HYDRA: Model Factorization Framework for Black-Box LLM Personalization

Y Zhuang, H Sun, Y Yu, Q Wang, C Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Personalization has emerged as a critical research area in modern intelligent systems,
focusing on mining users' behavioral history and adapting to their preferences for delivering …

Understanding the role of user profile in the personalization of large language models

B Wu, Z Shi, HA Rahmani, V Ramineni… - arXiv preprint arXiv …, 2024 - arxiv.org
Utilizing user profiles to personalize Large Language Models (LLMs) has been shown to
enhance the performance on a wide range of tasks. However, the precise role of user …

A Review on Edge Large Language Models: Design, Execution, and Applications

Y Zheng, Y Chen, B Qian, X Shi, Y Shu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have revolutionized natural language processing with their
exceptional capabilities. However, deploying LLMs on resource-constrained edge devices …