A comprehensive tutorial on retrieval, augmentation, and generation strategies that ground large language models in external knowledge for better recommendations.
Large language models are increasingly adopted for recommendation tasks thanks to their reasoning capabilities and effectiveness with cold-start items. This tutorial presents a comprehensive taxonomy of retrieval-augmented generation techniques tailored for LLM-based recommender systems — covering the types of input information, design choices at each RAG pipeline stage (retrieval, augmentation, generation), and open research questions.
Target audience: researchers and practitioners interested in recommender systems, information retrieval, and large language models. Basic familiarity with RecSys and LLMs is helpful but not required.
LLMs offer strong reasoning and generation abilities, but often lack personalized, up-to-date, and domain-specific knowledge. RAG bridges this gap by grounding recommendations in retrieved user, item, and contextual information.
How can retrieval improve LLM-based recommendation quality?
What should be retrieved — users, items, reviews, knowledge graphs, or demonstrations?
How should retrieved information be represented and injected into LLMs?
How can we evaluate RAG-based recommender systems effectively?
A structured walk-through of the RAG pipeline for recommendation, from fundamentals to open problems.
A team of researchers from KAIST and Snap Inc. with deep expertise in recommender systems, graph mining, and LLMs.



* Equal contribution