CIKM 2025 Tutorial

Towards Large Generative Recommendation:
A Tokenization Perspective


1University of California San Diego, 2University of Science and Technology of China, 3National University of Singapore

Monday, November 10, 2025, 13:45 - 18:00 PM, in Room 203

About This Tutorial

The emergence of large generative models is transforming the landscape of recommender systems. One of the most fundamental components in building these models is action tokenization, the process of converting human-readable data (e.g., user-item interactions) into machine-readable formats (e.g., discrete token sequences). In this tutorial, we present a comprehensive overview of existing action tokenization techniques, converting actions to (1) item IDs, (2) textual descriptions, and (3) semantic IDs, and explore how they relate to the development of large generative recommendation models. We then make an in-depth discussion on the challenges, open questions, and potential future directions from the perspective of action tokenization, aiming to inspire the design of next-generation recommender systems.

Tutorial Program

Time Session
13:45 - 14:15 Part 1: Background and Introduction
14:15 - 15:00 Part 2: LLM-based Generative Recommendation
15:00 - 15:30 Part 3: Semantic IDs
15:30 - 16:00 Coffee Break
16:00 - 17:15 Part 4: Semantic ID-based Generative Recommendation
17:15 - 17:45 Part 5: Open Challenges and Beyond
17:45 - 18:00 Q&A Discussion

BibTeX

@inproceedings{cikm25-gen-rec-tutorial,
  author = {Hou, Yupeng and Zhang, An and Sheng, Leheng and Wu, Jiancan and Wang, Xiang and Chua, Tat-Seng and McAuley, Julian},
  title = {Towards Large Generative Recommendation: A Tokenization Perspective},
  year = {2025},
  booktitle = {Proceedings of the 34th ACM International Conference on Information and Knowledge Management},
  pages = {6821-6824},
  numpages = {4}
}