[2602.23228] MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

[2602.23228] MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction

arXiv - AI 4 min read Article

Summary

The paper presents MovieTeller, a novel framework for generating movie synopses using tool-augmented progressive abstraction to enhance character consistency and narrative coherence in automated video summarization.

Why It Matters

As digital entertainment grows, effective automated video summarization becomes crucial for indexing and recommendations. MovieTeller addresses limitations in existing models, improving factual accuracy and narrative coherence, which is vital for content creators and consumers alike.

Key Takeaways

  • MovieTeller enhances movie synopsis generation through tool-augmented methods.
  • The framework avoids costly model fine-tuning by using off-the-shelf models.
  • It improves character identification and narrative coherence in long-form videos.
  • Progressive abstraction helps manage context length limitations of current models.
  • Experiments show significant improvements over traditional end-to-end approaches.

Computer Science > Computer Vision and Pattern Recognition arXiv:2602.23228 (cs) [Submitted on 26 Feb 2026] Title:MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction Authors:Yizhi Li, Xiaohan Chen, Miao Jiang, Wentao Tang, Gaoang Wang View a PDF of the paper titled MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction, by Yizhi Li and 3 other authors View PDF HTML (experimental) Abstract:With the explosive growth of digital entertainment, automated video summarization has become indispensable for applications such as content indexing, personalized recommendation, and efficient media archiving. Automatic synopsis generation for long-form videos, such as movies and TV series, presents a significant challenge for existing Vision-Language Models (VLMs). While proficient at single-image captioning, these general-purpose models often exhibit critical failures in long-duration contexts, primarily a lack of ID-consistent character identification and a fractured narrative coherence. To overcome these limitations, we propose MovieTeller, a novel framework for generating movie synopses via tool-augmented progressive abstraction. Our core contribution is a training-free, tool-augmented, fact-grounded generation process. Instead of requiring costly model fine-tuning, our framework directly leverages off-the-shelf models in a plug-and-play manner. We first invoke a specialized face recognition model as an external "tool" ...

Related Articles

Llms

What I learned about multi-agent coordination running 9 specialized Claude agents

I've been experimenting with multi-agent AI systems and ended up building something more ambitious than I originally planned: a fully ope...

Reddit - Artificial Intelligence · 1 min ·
Llms

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

I've been reviewing how various AI memory systems evaluate their performance and noticed a fundamental issue with cross-system comparison...

Reddit - Machine Learning · 1 min ·
Shifting to AI model customization is an architectural imperative | MIT Technology Review
Llms

Shifting to AI model customization is an architectural imperative | MIT Technology Review

In the early days of large language models (LLMs), we grew accustomed to massive 10x jumps in reasoning and coding capability with every ...

MIT Technology Review · 6 min ·
Llms

Artificial intelligence will always depends on human otherwise it will be obsolete.

I was looking for a tool for my specific need. There was not any. So i started to write the program in python, just basic structure. Then...

Reddit - Artificial Intelligence · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime