[2603.04158] GarmentPile++: Affordance-Driven Cluttered Garments

[2603.04158] GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

arXiv - AI March 05, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.04158: GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

Computer Science > Robotics arXiv:2603.04158 (cs) [Submitted on 4 Mar 2026] Title:GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning Authors:Mingleyang Li, Yuran Wang, Yue Chen, Tianxing Chen, Jiaqi Liang, Zishun Shen, Haoran Lu, Ruihai Wu, Hao Dong View a PDF of the paper titled GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning, by Mingleyang Li and 8 other authors View PDF HTML (experimental) Abstract:Garment manipulation has attracted increasing attention due to its critical role in home-assistant robotics. However, the majority of existing garment manipulation works assume an initial state consisting of only one garment, while piled garments are far more common in real-world settings. To bridge this gap, we propose a novel garment retrieval pipeline that can not only follow language instruction to execute safe and clean retrieval but also guarantee exactly one garment is retrieved per attempt, establishing a robust foundation for the execution of downstream tasks (e.g., folding, hanging, wearing). Our pipeline seamlessly integrates vision-language reasoning with visual affordance perception, fully leveraging the high-level reasoning and planning capabilities of VLMs alongside the generalization power of visual affordance for low-level actions. To enhance the VLM's comprehensive awareness of each garment's state within a garment pile, we employ visual segmentation model (SAM2) to execut...

Originally published on March 05, 2026. Curated by AI News.

Machine Learning

VulcanAMI Might Help

I open-sourced a large AI platform I built solo, working 16 hours a day, at my kitchen table, fueled by an inordinate degree of compulsio...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Machine Learning

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

I built an experimental UI and visualization layer around Meta’s open brain-response model just to see whether this stuff actually works ...

Reddit - Machine Learning · 1 min · about 6 hours ago

Machine Learning

[R] First open-source implementation of Hebbian fast-weight write-back for the BDH architecture

The BDH (Dragon Hatchling) paper (arXiv:2509.26507) describes a Hebbian synaptic plasticity mechanism where model weights update during i...

Reddit - Machine Learning · 1 min · about 15 hours ago

Machine Learning

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

I'm currently finishing up my second year of a three year Bachelor of Data Science degree. I've got the basics down quite well, linear re...

Reddit - Machine Learning · 1 min · 1 day ago

[2603.04158] GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning

About this article

Related Articles

VulcanAMI Might Help

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

[R] First open-source implementation of Hebbian fast-weight write-back for the BDH architecture

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

No comments

Stay updated with AI News