[2603.26498] Rocks, Pebbles and Sand: Modality-aware Scheduling for

[2603.26498] Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference

arXiv - AI March 30, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.26498: Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference

Computer Science > Distributed, Parallel, and Cluster Computing arXiv:2603.26498 (cs) [Submitted on 27 Mar 2026] Title:Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference Authors:Konstantinos Papaioannou, Thaleia Dimitra Doudali View a PDF of the paper titled Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference, by Konstantinos Papaioannou and Thaleia Dimitra Doudali View PDF HTML (experimental) Abstract:Multimodal Large Language Models (MLLMs) power platforms like ChatGPT, Gemini, and Copilot, enabling richer interactions with text, images, and videos. These heterogeneous workloads introduce additional inference stages, such as vision preprocessing and encoding, that inflate latency and memory demand. Existing LLM serving systems, optimized for text-only workloads, fail under multimodality: large requests (e.g., videos) monopolize resources, causing severe head-of-line blocking and performance degradation. Our key insight is that multimodal requests differ by orders of magnitude in resource demands, which we capture through a simple abstraction: videos behave like rocks, images like pebbles, and text like sand. We design RPS-Serve, a modality-aware scheduler that lets sand flow quickly through pebbles and rocks, ensuring interactive responsiveness while avoiding starvation. RPS-Serve classifies requests, prioritizes them dynamically, and applies aging to avoid starvation. Evaluation a...

Originally published on March 30, 2026. Curated by AI News.

Llms

Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet

Anthropic is testing an unreleased artificial intelligence (AI) model with capabilities that exceed any system it has previously released...

AI Tools & Products · 5 min · 31 minutes ago

Llms

Anthropic leaks part of Claude Code's internal source code

Claude Code has seen massive adoption over the last year, and its run-rate revenue had swelled to more than $2.5 billion as of February.

AI Tools & Products · 3 min · 31 minutes ago

Llms

Australian government and Anthropic sign MOU for AI safety and research

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

AI Tools & Products · 5 min · 31 minutes ago

Llms

Penguin to sue OpenAI over ChatGPT version of German children’s book

Publisher alleges AI research company’s chatbot violated its copyright over Coconut the Little Dragon series

AI Tools & Products · 3 min · 31 minutes ago

[2603.26498] Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference

About this article

Related Articles

Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet

Anthropic leaks part of Claude Code's internal source code

Australian government and Anthropic sign MOU for AI safety and research

Penguin to sue OpenAI over ChatGPT version of German children’s book

No comments

Stay updated with AI News