Optimization methods for personalizing large language models through retrieval augmentation

A Salemi, H Zamani - Proceedings of the 47th International ACM SIGIR …, 2024 - dl.acm.org

Evaluating retrieval-augmented generation (RAG) presents challenges, particularly for
retrieval models within these systems. Traditional end-to-end evaluation methods are …

被引用次数：25 相关文章所有 2 个版本

[PDF] acm.org

Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization

H Zamani, M Bendersky - Proceedings of the 47th International ACM …, 2024 - dl.acm.org

This paper introduces Stochastic RAG--a novel approach for end-to-end optimization of
retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of …

被引用次数：6 相关文章所有 2 个版本

[PDF] acm.org

Towards a search engine for machines: Unified ranking for multiple retrieval-augmented large language models

A Salemi, H Zamani - Proceedings of the 47th International ACM SIGIR …, 2024 - dl.acm.org

This paper introduces uRAG-a framework with a unified retrieval engine that serves multiple
downstream retrieval-augmented generation (RAG) systems. Each RAG system consumes …

被引用次数：4 相关文章所有 2 个版本

[PDF] acm.org

Planning ahead in generative retrieval: Guiding autoregressive generation through simultaneous decoding

H Zeng, C Luo, H Zamani - Proceedings of the 47th International ACM …, 2024 - dl.acm.org

This paper introduces PAG-a novel optimization and decoding approach that guides
autoregressive generation of document identifiers in generative retrieval models through …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Comparing retrieval-augmentation and parameter-efficient fine-tuning for privacy-preserving personalization of large language models

A Salemi, H Zamani - arXiv preprint arXiv:2409.09510, 2024 - arxiv.org

Privacy-preserving methods for personalizing large language models (LLMs) are relatively
under-explored. There are two schools of thought on this topic:(1) generating personalized …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

LongLaMP: A Benchmark for Personalized Long-form Text Generation

I Kumar, S Viswanathan, S Yerra, A Salemi… - arXiv preprint arXiv …, 2024 - arxiv.org

Long-text generation is seemingly ubiquitous in real-world applications of large language
models such as generating an email or writing a review. Despite the fundamental …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

TE Kim, A Salemi, A Drozdov, F Diaz… - arXiv preprint arXiv …, 2024 - arxiv.org

In the field of language modeling, models augmented with retrieval components have
emerged as a promising solution to address several challenges faced in the natural …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

HYDRA: Model Factorization Framework for Black-Box LLM Personalization

Y Zhuang, H Sun, Y Yu, Q Wang, C Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Personalization has emerged as a critical research area in modern intelligent systems,
focusing on mining users' behavioral history and adapting to their preferences for delivering …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Understanding the role of user profile in the personalization of large language models

B Wu, Z Shi, HA Rahmani, V Ramineni… - arXiv preprint arXiv …, 2024 - arxiv.org

Utilizing user profiles to personalize Large Language Models (LLMs) has been shown to
enhance the performance on a wide range of tasks. However, the precise role of user …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

A Review on Edge Large Language Models: Design, Execution, and Applications

Y Zheng, Y Chen, B Qian, X Shi, Y Shu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) have revolutionized natural language processing with their
exceptional capabilities. However, deploying LLMs on resource-constrained edge devices …