[2411.16196] Learn from Foundation Model: Fruit Detection Model

[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation

arXiv - Machine Learning March 24, 2026 4 min read

About this article

Abstract page for arXiv paper 2411.16196: Learn from Foundation Model: Fruit Detection Model without Manual Annotation

Computer Science > Computer Vision and Pattern Recognition arXiv:2411.16196 (cs) [Submitted on 25 Nov 2024 (v1), last revised 22 Mar 2026 (this version, v2)] Title:Learn from Foundation Model: Fruit Detection Model without Manual Annotation Authors:Yanan Wang, Zhenghao Fei, Ruichen Li, Yibin Ying View a PDF of the paper titled Learn from Foundation Model: Fruit Detection Model without Manual Annotation, by Yanan Wang and Zhenghao Fei and Ruichen Li and Yibin Ying View PDF HTML (experimental) Abstract:Recent breakthroughs in large foundation models have enabled the possibility of transferring knowledge pre-trained on vast datasets to domains with limited data availability. Agriculture is one of the domains that lacks sufficient data. This study proposes a framework to train effective, domain-specific, small models from foundation models without manual annotation. Our approach begins with SDM (Segmentation-Description-Matching), a stage that leverages two foundation models: SAM2 (Segment Anything in Images and Videos) for segmentation and OpenCLIP (Open Contrastive Language-Image Pretraining) for zero-shot open-vocabulary classification. In the second stage, a novel knowledge distillation mechanism is utilized to distill compact, edge-deployable models from SDM, enhancing both inference speed and perception accuracy. The complete method, termed SDM-D (Segmentation-Description-Matching-Distilling), demonstrates strong performance across various fruit detection tasks object de...

Originally published on March 24, 2026. Curated by AI News.

Llms

I Asked ChatGPT 500 Questions. Here Are the Ads I Saw Most Often | WIRED

Ads are rolling out across the US on ChatGPT’s free tier. I asked OpenAI's bot 500 questions to see what these ads were like and how they...

Wired - AI · 9 min · about 2 hours ago

Llms

Abacus.Ai Claw LLM consumes an incredible amount of credit without any usage :(

Three days ago, I clicked the "Deploy OpenClaw In Seconds" button to get an overview of the new service, but I didn't build any automatio...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

Google’s Gemini AI app debuts in Hong Kong

Tech giant’s chatbot service tops Apple’s app store chart in the city.

AI Tools & Products · 2 min · about 3 hours ago

Llms

Google Launches Gemini Import Tools to Poach Users From Rival AI Apps

Anyone looking to switch their AI assistant will find it surprisingly easy, as it only takes a few steps to move from A to B. This is not...

AI Tools & Products · 4 min · about 3 hours ago

[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation

About this article

Related Articles

I Asked ChatGPT 500 Questions. Here Are the Ads I Saw Most Often | WIRED

Abacus.Ai Claw LLM consumes an incredible amount of credit without any usage :(

Google’s Gemini AI app debuts in Hong Kong

Google Launches Gemini Import Tools to Poach Users From Rival AI Apps

No comments

Stay updated with AI News