[2509.06415] Index-Preserving Lightweight Token Pruning for Efficient

[2509.06415] Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models

arXiv - AI March 05, 2026 3 min read

About this article

Abstract page for arXiv paper 2509.06415: Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models

Computer Science > Computer Vision and Pattern Recognition arXiv:2509.06415 (cs) [Submitted on 8 Sep 2025 (v1), last revised 4 Mar 2026 (this version, v2)] Title:Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models Authors:Jaemin Son, Sujin Choi, Inyong Yun View a PDF of the paper titled Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models, by Jaemin Son and 2 other authors View PDF Abstract:Recent progress in vision-language models (VLMs) has led to impressive results in document understanding tasks, but their high computational demands remain a challenge. To mitigate the compute burdens, we propose a lightweight token pruning framework that filters out non-informative background regions from document images prior to VLM processing. A binary patch-level classifier removes non-text areas, and a max-pooling refinement step recovers fragmented text regions to enhance spatial coherence. Experiments on real-world document datasets demonstrate that our approach substantially lowers computational costs, while maintaining comparable accuracy. Comments: Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL) Cite as: arXiv:2509.06415 [cs.CV] (or arXiv:2509.06415v2 [cs.CV] for this version) https://doi.org/10.48550/arXiv.2509.06415 Focus to learn more arXiv-issued DOI via DataCite Submission history From: ...

Originally published on March 05, 2026. Curated by AI News.

Llms

Popular AI gateway startup LiteLLM ditches controversial startup Delve | TechCrunch

LiteLLM had obtained two security compliance certifications via Delve and fell victim to some horrific credential-stealing malware last w...

TechCrunch - AI · 3 min · 9 minutes ago

Llms

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Greetings all - I've posted mostly in r/claudecode and r/aigamedev a couple of times previously. Working with CC for personal projects re...

Reddit - Artificial Intelligence · 1 min · 39 minutes ago

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

hey everyone. been lurking here for a while and wanted to share something we been building. the problem: ai coding agents are only as goo...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

[2509.06415] Index-Preserving Lightweight Token Pruning for Efficient Document Understanding in Vision-Language Models

About this article

Related Articles

Popular AI gateway startup LiteLLM ditches controversial startup Delve | TechCrunch

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

World models will be the next big thing, bye-bye LLMs

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

No comments

Stay updated with AI News