[2602.18907] DeepInterestGR: Mining Deep Multi-Interest Using Multi-Modal LLMs for Generative Recommendation

[2602.18907] DeepInterestGR: Mining Deep Multi-Interest Using Multi-Modal LLMs for Generative Recommendation

arXiv - Machine Learning 4 min read Article

Summary

The paper presents DeepInterestGR, a novel framework that enhances generative recommendation systems by mining deep multi-interests using multi-modal large language models (LLMs). It addresses the limitations of existing methods by capturing richer user interests.

Why It Matters

As recommendation systems become increasingly central to user engagement, understanding and leveraging deep user interests can significantly enhance personalization and effectiveness. This research introduces innovative techniques that could lead to more accurate and interpretable recommendations, which is crucial for businesses relying on user data.

Key Takeaways

  • DeepInterestGR uses multi-modal LLMs to extract deeper user interests.
  • The framework addresses the 'Shallow Interest' problem in existing recommendation systems.
  • It employs a two-stage training pipeline combining supervised fine-tuning and reinforcement learning.
  • Experiments show significant performance improvements over state-of-the-art methods.
  • The approach enhances both personalization depth and recommendation interpretability.

Computer Science > Machine Learning arXiv:2602.18907 (cs) [Submitted on 21 Feb 2026] Title:DeepInterestGR: Mining Deep Multi-Interest Using Multi-Modal LLMs for Generative Recommendation Authors:Yangchen Zeng View a PDF of the paper titled DeepInterestGR: Mining Deep Multi-Interest Using Multi-Modal LLMs for Generative Recommendation, by Yangchen Zeng View PDF HTML (experimental) Abstract:Recent generative recommendation frameworks have demonstrated remarkable scaling potential by reformulating item prediction as autoregressive Semantic ID (SID) generation. However, existing methods primarily rely on shallow behavioral signals, encoding items solely through surface-level textual features such as titles and descriptions. This reliance results in a critical Shallow Interest problem: the model fails to capture the latent, semantically rich interests underlying user interactions, limiting both personalization depth and recommendation interpretability. DeepInterestGR introduces three key innovations: (1) Multi-LLM Interest Mining (MLIM): We leverage multiple frontier LLMs along with their multi-modal variants to extract deep textual and visual interest representations through Chain-of-Thought prompting. (2) Reward-Labeled Deep Interest (RLDI): We employ a lightweight binary classifier to assign reward labels to mined interests, enabling effective supervision signals for reinforcement learning. (3) Interest-Enhanced Item Discretization (IEID): The curated deep interests are enco...

Related Articles

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min ·
Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED
Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min ·
Llms

People anxious about deviating from what AI tells them to do?

My friend came over yesterday to dye her hair. She had asked ChatGPT for the 'correct' way to do it. Chat told her to dye the ends first,...

Reddit - Artificial Intelligence · 1 min ·
Llms

ChatGPT on trial: A landmark test of AI liability in the practice of law

AI Tools & Products ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime