[2603.27533] Demo-Pose: Depth-Monocular Modality Fusion For Object

[2603.27533] Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation

arXiv - AI March 31, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.27533: Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.27533 (cs) [Submitted on 29 Mar 2026] Title:Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation Authors:Rachit Agarwal, Abhishek Joshi, Sathish Chalasani, Woo Jin Kim View a PDF of the paper titled Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation, by Rachit Agarwal and 3 other authors View PDF HTML (experimental) Abstract:Object pose estimation is a fundamental task in 3D vision with applications in robotics, AR/VR, and scene understanding. We address the challenge of category-level 9-DoF pose estimation (6D pose + 3Dsize) from RGB-D input, without relying on CAD models during inference. Existing depth-only methods achieve strong results but ignore semantic cues from RGB, while many RGB-D fusion models underperform due to suboptimal cross-modal fusion that fails to align semantic RGB cues with 3D geometric representations. We propose DeMo-Pose, a hybrid architecture that fuses monocular semantic features with depth-based graph convolutional representations via a novel multimodal fusion strategy. To further improve geometric reasoning, we introduce a novel Mesh-Point Loss (MPL) that leverages mesh structure during training without adding inference overhead. Our approach achieves real-time inference and significantly improves over state-of-the-art methods across object categories, outperforming the strong GPV-Pose baseline by 3.2\% on 3D IoU and 11.1\% on pose accurac...

Originally published on March 31, 2026. Curated by AI News.

Machine Learning

[D] Budget Machine Learning Hardware

Looking to get into machine learning and found this video on a piece of hardware for less than £500. Is it really possible to teach auton...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Machine Learning

Your prompts aren’t the problem — something else is

I keep seeing people focus heavily on prompt optimization. But in practice, a lot of failures I’ve observed don’t come from the prompt it...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Machine Learning

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

We just published a paper on predicting adverse selection in high-frequency crypto markets using LightGBM, and I wanted to share it here ...

Reddit - Machine Learning · 1 min · about 5 hours ago

[2603.27533] Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation

About this article

Related Articles

[D] Budget Machine Learning Hardware

UMKC Announces New Master of Science in Artificial Intelligence

Your prompts aren’t the problem — something else is

[R], 31 MILLIONS High frequency data, Light GBM worked perfectly

No comments

Stay updated with AI News