[2602.20409] CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D

[2602.20409] CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation

arXiv - Machine Learning April 22, 2026 4 min read

About this article

Abstract page for arXiv paper 2602.20409: CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation

Computer Science > Computer Vision and Pattern Recognition arXiv:2602.20409 (cs) [Submitted on 23 Feb 2026 (v1), last revised 21 Apr 2026 (this version, v2)] Title:CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation Authors:Mainak Singha, Sarthak Mehrotra, Paolo Casari, Subhasis Chaudhuri, Elisa Ricci, Biplab Banerjee View a PDF of the paper titled CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation, by Mainak Singha and 5 other authors View PDF HTML (experimental) Abstract:Recent vision-language models (VLMs) such as CLIP demonstrate impressive cross-modal reasoning, extending beyond images to 3D perception. Yet, these models remain fragile under domain shifts, especially when adapting from synthetic to real-world point clouds. Conventional 3D domain adaptation approaches rely on heavy trainable encoders, yielding strong accuracy but at the cost of efficiency. We introduce CLIPoint3D, the first framework for few-shot unsupervised 3D point cloud domain adaptation built upon CLIP. Our approach projects 3D samples into multiple depth maps and exploits the frozen CLIP backbone, refined through a knowledge-driven prompt tuning scheme that integrates high-level language priors with geometric cues from a lightweight 3D encoder. To adapt task-specific features effectively, we apply parameter-efficient fine-tuning to CLIP's encoders and design an entropy-guided view sampling strategy for selecting confident projecti...

Originally published on April 22, 2026. Curated by AI News.

Llms

I tried Gemini, ChatGPT, and Claude for a month on Android, and I have a clear winner for you

The ultimate Android AI showdown

AI Tools & Products · 5 min · about 2 hours ago

Llms

[2603.29078] PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression

Abstract page for arXiv paper 2603.29078: PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression

arXiv - Machine Learning · 3 min · about 4 hours ago

Llms

[2602.11199] When and What to Ask: AskBench and Rubric-Guided RLVR for LLM Clarification

Abstract page for arXiv paper 2602.11199: When and What to Ask: AskBench and Rubric-Guided RLVR for LLM Clarification

arXiv - Machine Learning · 3 min · about 4 hours ago

Llms

[2512.23805] Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting

Abstract page for arXiv paper 2512.23805: Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting

arXiv - Machine Learning · 3 min · about 4 hours ago

[2602.20409] CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation

About this article

Related Articles

I tried Gemini, ChatGPT, and Claude for a month on Android, and I have a clear winner for you

[2603.29078] PolarQuant: Optimal Gaussian Weight Quantization via Hadamard Rotation for LLM Compression

[2602.11199] When and What to Ask: AskBench and Rubric-Guided RLVR for LLM Clarification

[2512.23805] Fitted Q Evaluation Without Bellman Completeness via Stationary Weighting

No comments

Stay updated with AI News