[2511.22599] DisCEdge: Distributed Context Management for Large

[2511.22599] DisCEdge: Distributed Context Management for Large Language Models at the Edge

arXiv - Machine Learning April 09, 2026 3 min read

About this article

Abstract page for arXiv paper 2511.22599: DisCEdge: Distributed Context Management for Large Language Models at the Edge

Computer Science > Distributed, Parallel, and Cluster Computing arXiv:2511.22599 (cs) [Submitted on 27 Nov 2025 (v1), last revised 8 Apr 2026 (this version, v2)] Title:DisCEdge: Distributed Context Management for Large Language Models at the Edge Authors:Mohammadreza Malekabbasi, Minghe Wang, David Bermbach View a PDF of the paper titled DisCEdge: Distributed Context Management for Large Language Models at the Edge, by Mohammadreza Malekabbasi and 2 other authors View PDF HTML (experimental) Abstract:Deploying Large Language Model (LLM) services at the edge benefits latency-sensitive and privacy-aware applications. However, the stateless nature of LLMs makes managing user context (e.g., sessions, preferences) across geo-distributed edge nodes challenging. Existing solutions, such as client-side context storage, introduce network latency and bandwidth overhead, undermining edge deployment advantages. We propose DisCEdge, a distributed context management system that stores and replicates user context in tokenized form across edge nodes. By maintaining context as token sequences, our system avoids redundant computation and enables efficient data replication. We evaluate an open-source prototype in a realistic edge environment. DisCEdge improves median response times by up to 14.46% and lowers median inter-node synchronization overhead by up to 15% compared to a raw-text-based system. It also reduces client request sizes by a median of 90% compared to client-side context manag...

Originally published on April 09, 2026. Curated by AI News.

Llms

Vance says Iran sent 3 different versions of 10-point proposal, one of them 'written by ChatGPT'

submitted by /u/esporx [link] [comments]

Reddit - Artificial Intelligence · 1 min · 11 minutes ago

Llms

[2601.22451] Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework

Abstract page for arXiv paper 2601.22451: Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validat...

arXiv - AI · 4 min · about 1 hour ago

Llms

[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

Abstract page for arXiv paper 2601.21463: Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

arXiv - AI · 4 min · about 1 hour ago

Llms

[2601.16206] Computer Environments Elicit General Agentic Intelligence in LLMs

Abstract page for arXiv paper 2601.16206: Computer Environments Elicit General Agentic Intelligence in LLMs

arXiv - AI · 4 min · about 1 hour ago

[2511.22599] DisCEdge: Distributed Context Management for Large Language Models at the Edge

About this article

Related Articles

Vance says Iran sent 3 different versions of 10-point proposal, one of them 'written by ChatGPT'

[2601.22451] Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework

[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

[2601.16206] Computer Environments Elicit General Agentic Intelligence in LLMs

No comments

Stay updated with AI News