[2509.13648] Sequential Data Augmentation for Generative Recommendation

[2509.13648] Sequential Data Augmentation for Generative Recommendation

arXiv - Machine Learning 4 min read Article

Summary

This article introduces GenPAS, a novel framework for data augmentation in generative recommendation systems, emphasizing its impact on model performance and generalization.

Why It Matters

As generative recommendation systems become increasingly integral to personalized user experiences, understanding the role of data augmentation is crucial. This research provides a systematic approach to improve model training, which can lead to better user engagement and satisfaction in various applications.

Key Takeaways

  • GenPAS offers a principled framework for data augmentation in generative recommendation systems.
  • Different augmentation strategies can significantly affect model performance and generalization.
  • The framework allows for flexible control of training distributions, enhancing data efficiency.

Computer Science > Machine Learning arXiv:2509.13648 (cs) [Submitted on 17 Sep 2025 (v1), last revised 22 Feb 2026 (this version, v2)] Title:Sequential Data Augmentation for Generative Recommendation Authors:Geon Lee, Bhuvesh Kumar, Clark Mingxuan Ju, Tong Zhao, Kijung Shin, Neil Shah, Liam Collins View a PDF of the paper titled Sequential Data Augmentation for Generative Recommendation, by Geon Lee and 6 other authors View PDF HTML (experimental) Abstract:Generative recommendation plays a crucial role in personalized systems, predicting users' future interactions from their historical behavior sequences. A critical yet underexplored factor in training these models is data augmentation, the process of constructing training data from user interaction histories. By shaping the training distribution, data augmentation directly and often substantially affects model generalization and performance. Nevertheless, in much of the existing work, this process is simplified, applied inconsistently, or treated as a minor design choice, without a systematic and principled understanding of its effects. Motivated by our empirical finding that different augmentation strategies can yield large performance disparities, we conduct an in-depth analysis of how they reshape training distributions and influence alignment with future targets and generalization to unseen inputs. To systematize this design space, we propose GenPAS, a generalized and principled framework that models augmentation as a...

Related Articles

Machine Learning

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

We just published a paper on predicting adverse selection in high-frequency crypto markets using LightGBM, and I wanted to share it here ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Those of you with 10+ years in ML — what is the public completely wrong about?

For those of you who've been in ML/AI research or applied ML for 10+ years — what's the gap between what the public thinks AI is doing vs...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

AI assistants are optimized to seem helpful. That is not the same thing as being helpful.

RLHF trains models on human feedback. Humans rate responses they like. And it turns out humans consistently rate confident, fluent, agree...

Reddit - Artificial Intelligence · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime