Computer Vision

Image recognition, detection, and visual AI

Top This Week

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Machine Learning

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

arXiv - Machine Learning · 4 min ·
[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Machine Learning

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv - AI · 4 min ·
[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild
Computer Vision

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...

arXiv - AI · 4 min ·

All Content

[2603.23020] Concept-based explanations of Segmentation and Detection models in Natural Disaster Management
Machine Learning

[2603.23020] Concept-based explanations of Segmentation and Detection models in Natural Disaster Management

Abstract page for arXiv paper 2603.23020: Concept-based explanations of Segmentation and Detection models in Natural Disaster Management

arXiv - AI · 4 min ·
[2603.23037] YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception
Llms

[2603.23037] YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception

Abstract page for arXiv paper 2603.23037: YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable...

arXiv - AI · 4 min ·
[2603.22855] TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design
Computer Vision

[2603.22855] TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture Co-design

Abstract page for arXiv paper 2603.22855: TorR: Towards Brain-Inspired Task-Oriented Reasoning via Cache-Oriented Algorithm-Architecture ...

arXiv - Machine Learning · 4 min ·
[2603.22624] Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion
Machine Learning

[2603.22624] Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion

Abstract page for arXiv paper 2603.22624: Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion

arXiv - AI · 4 min ·
[2603.22593] Language Models Can Explain Visual Features via Steering
Llms

[2603.22593] Language Models Can Explain Visual Features via Steering

Abstract page for arXiv paper 2603.22593: Language Models Can Explain Visual Features via Steering

arXiv - AI · 3 min ·
[2603.22942] Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning
Llms

[2603.22942] Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

Abstract page for arXiv paper 2603.22942: Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning

arXiv - AI · 3 min ·
[2603.22721] HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment
Machine Learning

[2603.22721] HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment

Abstract page for arXiv paper 2603.22721: HyFI: Hyperbolic Feature Interpolation for Brain-Vision Alignment

arXiv - AI · 4 min ·
Llms

[For Hire] Full-Stack AI/ML Engineer | Agentic AI · RAG · Computer Vision · Voice AI · LangGraph · FastAPI | Remote

Hey everyone, I'm a Full-Stack AI/ML Engineer with 3+ years of production experience across Agentic AI, RAG pipelines, Computer Vision, a...

Reddit - ML Jobs · 1 min ·
[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation
Llms

[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation

Abstract page for arXiv paper 2411.16196: Learn from Foundation Model: Fruit Detection Model without Manual Annotation

arXiv - Machine Learning · 4 min ·
[2603.21377] HamVision: Hamiltonian Dynamics as Inductive Bias for Medical Image Analysis
Computer Vision

[2603.21377] HamVision: Hamiltonian Dynamics as Inductive Bias for Medical Image Analysis

Abstract page for arXiv paper 2603.21377: HamVision: Hamiltonian Dynamics as Inductive Bias for Medical Image Analysis

arXiv - Machine Learning · 4 min ·
[2603.20711] RoboECC: Multi-Factor-Aware Edge-Cloud Collaborative Deployment for VLA Models
Machine Learning

[2603.20711] RoboECC: Multi-Factor-Aware Edge-Cloud Collaborative Deployment for VLA Models

Abstract page for arXiv paper 2603.20711: RoboECC: Multi-Factor-Aware Edge-Cloud Collaborative Deployment for VLA Models

arXiv - Machine Learning · 3 min ·
[2603.20921] Discriminative Representation Learning for Clinical Prediction
Llms

[2603.20921] Discriminative Representation Learning for Clinical Prediction

Abstract page for arXiv paper 2603.20921: Discriminative Representation Learning for Clinical Prediction

arXiv - Machine Learning · 3 min ·
[2512.00065] Satellite to Street : Disaster Impact Estimator
Machine Learning

[2512.00065] Satellite to Street : Disaster Impact Estimator

Abstract page for arXiv paper 2512.00065: Satellite to Street : Disaster Impact Estimator

arXiv - AI · 4 min ·
[2511.18493] SAGE: Shape-Adapting Gated Experts for Adaptive Histopathology Image Segmentation
Machine Learning

[2511.18493] SAGE: Shape-Adapting Gated Experts for Adaptive Histopathology Image Segmentation

Abstract page for arXiv paper 2511.18493: SAGE: Shape-Adapting Gated Experts for Adaptive Histopathology Image Segmentation

arXiv - AI · 4 min ·
[2510.13232] What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging
Llms

[2510.13232] What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging

Abstract page for arXiv paper 2510.13232: What "Not" to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging

arXiv - AI · 4 min ·
[2506.13925] Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation
Llms

[2506.13925] Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation

Abstract page for arXiv paper 2506.13925: Segmenting Visuals With Querying Words: Language Anchors For Semi-Supervised Image Segmentation

arXiv - AI · 4 min ·
[2504.14636] AlphaZero-Edu: Democratizing Access to AlphaZero
Llms

[2504.14636] AlphaZero-Edu: Democratizing Access to AlphaZero

Abstract page for arXiv paper 2504.14636: AlphaZero-Edu: Democratizing Access to AlphaZero

arXiv - Machine Learning · 3 min ·
[2603.22002] SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation
Machine Learning

[2603.22002] SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation

Abstract page for arXiv paper 2603.22002: SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation

arXiv - AI · 4 min ·
[2603.21904] SHAPE: Structure-aware Hierarchical Unsupervised Domain Adaptation with Plausibility Evaluation for Medical Image Segmentation
Machine Learning

[2603.21904] SHAPE: Structure-aware Hierarchical Unsupervised Domain Adaptation with Plausibility Evaluation for Medical Image Segmentation

Abstract page for arXiv paper 2603.21904: SHAPE: Structure-aware Hierarchical Unsupervised Domain Adaptation with Plausibility Evaluation...

arXiv - AI · 4 min ·
[2603.21824] SteelDefectX: A Coarse-to-Fine Vision-Language Dataset and Benchmark for Generalizable Steel Surface Defect Detection
Machine Learning

[2603.21824] SteelDefectX: A Coarse-to-Fine Vision-Language Dataset and Benchmark for Generalizable Steel Surface Defect Detection

Abstract page for arXiv paper 2603.21824: SteelDefectX: A Coarse-to-Fine Vision-Language Dataset and Benchmark for Generalizable Steel Su...

arXiv - AI · 4 min ·
Previous Page 2 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime