[2603.01581] KERV: Kinematic-Rectified Speculative Decoding for

[2603.01581] KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

arXiv - Machine Learning March 03, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.01581: KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

Computer Science > Robotics arXiv:2603.01581 (cs) [Submitted on 2 Mar 2026] Title:KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models Authors:Zihao Zheng, Zhihao Mao, Maoliang Li, Jiayu Chen, Xinhao Sun, Zhaobo Zhang, Donggang Cao, Hong Mei, Xiang Chen View a PDF of the paper titled KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models, by Zihao Zheng and 7 other authors View PDF HTML (experimental) Abstract:Vision-Language-Action (VLA) models build a token-domain robot control paradigm, yet suffer from low speed. Speculative Decoding (SD) is an optimization strategy that can boost inference speed. Two key issues emerge when integrating VLA and SD: first, SD relies on re-inference to address token errors, which is computationally expensive; second, to mitigate token errors, the acceptance threshold in SD requires careful adjustment. Existing works fail to address the above two issues effectively. Meanwhile, as the bridge between AI and the physical world, existing embodied intelligence has overlooked the application of robotic kinematics. To address these issues, we innovatively combine token-domain VLA models with kinematic-domain prediction for SD, proposing a kinematic-rectified SD framework named KERV. We employ a kinematics-based Kalman Filter to predict actions and compensate for SD errors, avoiding costly re-inference. Moreover, we design a kinematics-based adjustment strategy to dynamically rectify the acceptance threshold, addre...

Originally published on March 03, 2026. Curated by AI News.

Machine Learning

[2603.23899] SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries

Abstract page for arXiv paper 2603.23899: SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries

arXiv - AI · 4 min · 8 minutes ago

Llms

[2603.16629] MLLM-based Textual Explanations for Face Comparison

Abstract page for arXiv paper 2603.16629: MLLM-based Textual Explanations for Face Comparison

arXiv - AI · 4 min · 8 minutes ago

Llms

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Abstract page for arXiv paper 2603.15159: To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

arXiv - AI · 4 min · 8 minutes ago

Machine Learning

[2603.14375] The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

Abstract page for arXiv paper 2603.14375: The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

arXiv - AI · 4 min · 8 minutes ago

[2603.01581] KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

About this article

Related Articles

[2603.23899] SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries

[2603.16629] MLLM-based Textual Explanations for Face Comparison

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

[2603.14375] The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

No comments

Stay updated with AI News