[2604.01989] Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation
Abstract page for arXiv paper 2604.01989: Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation
GPUs, training clusters, MLOps, and deployment
Abstract page for arXiv paper 2604.01989: Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation
Abstract page for arXiv paper 2512.18809: FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation
Abstract page for arXiv paper 2512.08980: Training Multi-Image Vision Agents via End2End Reinforcement Learning
The paper presents UrbanFM, a novel framework for scaling urban spatio-temporal foundation models, addressing challenges in generalizabil...
The paper presents PRECTR-V2, an advanced framework for improving search relevance and click-through rate (CTR) prediction by addressing ...
The paper presents Agile V, a framework integrating AI in engineering workflows to ensure compliance and verification at machine-speed de...
The paper presents Dataset Color Quantization (DCQ), a framework designed to compress large-scale image datasets by reducing color-space ...
The paper presents OptiLeak, a framework utilizing reinforcement learning to enhance prompt reconstruction efficiency in multi-tenant LLM...
The paper introduces LESA, a framework for accelerating diffusion models using learnable stage-aware predictors, achieving significant sp...
This paper introduces a novel pruning method for fully-connected neural networks, which compensates for the removal of weights by adjusti...
This article explores the differences between protein language models (PLMs) and natural language models, highlighting how these distinct...
The paper presents a novel algorithm for imputing unknown missing values in sparse electronic health records (EHRs) using a transformer-b...
This article discusses three significant challenges and two potential solutions for improving the safety of unsupervised elicitation in l...
This paper presents a continual learning framework for neural OFDM receivers that allows for real-time adaptation to changing communicati...
The paper introduces QueryBandits, a model-agnostic framework designed to mitigate hallucinations in large language models (LLMs) by opti...
This article examines the expectation-realisation gap in agentic AI systems, revealing discrepancies between anticipated productivity gai...
This paper presents a multi-task deep learning model for predicting delivery delay durations in logistics, addressing challenges posed by...
The paper introduces KnapSpec, a framework for self-speculative decoding that optimizes layer selection in LLMs as a knapsack problem, en...
This paper proposes the 'Right to History,' a principle ensuring individuals have a verifiable record of AI agent actions on personal har...
This paper presents ESM, a novel framework for merging multiple task-specific models into a single multi-task model, addressing inter-tas...
This article discusses the concept of 'golden layers' in large language models (LLMs) and presents a novel method, Layer Gradient Analysi...
This paper analyzes the effectiveness of latency hiding and parallelism techniques in an MLIR-based AI kernel compiler, focusing on vecto...
The paper presents OptimusVLA, a dual-memory framework for robotic manipulation that enhances efficiency and robustness in action generat...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime