Computer Vision

Image recognition, detection, and visual AI

Top This Week

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Machine Learning

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

arXiv - Machine Learning · 4 min ·
[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Machine Learning

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv - AI · 4 min ·
[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild
Computer Vision

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...

arXiv - AI · 4 min ·

All Content

[2603.21661] Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis
Machine Learning

[2603.21661] Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis

Abstract page for arXiv paper 2603.21661: Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-...

arXiv - Machine Learning · 4 min ·
[2603.21566] CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation
Machine Learning

[2603.21566] CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation

Abstract page for arXiv paper 2603.21566: CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Gr...

arXiv - Machine Learning · 4 min ·
[2603.21213] Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis
Machine Learning

[2603.21213] Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis

Abstract page for arXiv paper 2603.21213: Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis

arXiv - AI · 3 min ·
[2603.21071] CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels
Computer Vision

[2603.21071] CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels

Abstract page for arXiv paper 2603.21071: CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation wi...

arXiv - AI · 4 min ·
[2603.20920] Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computational Processing
Machine Learning

[2603.20920] Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computational Processing

Abstract page for arXiv paper 2603.20920: Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computat...

arXiv - Machine Learning · 4 min ·
[2603.20898] Natural Gradient Descent for Online Continual Learning
Machine Learning

[2603.20898] Natural Gradient Descent for Online Continual Learning

Abstract page for arXiv paper 2603.20898: Natural Gradient Descent for Online Continual Learning

arXiv - Machine Learning · 3 min ·
[2603.20860] Restoring Neural Network Plasticity for Faster Transfer Learning
Machine Learning

[2603.20860] Restoring Neural Network Plasticity for Faster Transfer Learning

Abstract page for arXiv paper 2603.20860: Restoring Neural Network Plasticity for Faster Transfer Learning

arXiv - AI · 4 min ·
[2603.20836] MERIT: Multi-domain Efficient RAW Image Translation
Machine Learning

[2603.20836] MERIT: Multi-domain Efficient RAW Image Translation

Abstract page for arXiv paper 2603.20836: MERIT: Multi-domain Efficient RAW Image Translation

arXiv - AI · 4 min ·
[2603.20777] OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation
Machine Learning

[2603.20777] OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation

Abstract page for arXiv paper 2603.20777: OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Se...

arXiv - Machine Learning · 3 min ·
[2603.20729] Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-attention
Computer Vision

[2603.20729] Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-attention

Abstract page for arXiv paper 2603.20729: Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-at...

arXiv - AI · 4 min ·
[2603.20697] Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models
Machine Learning

[2603.20697] Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models

Abstract page for arXiv paper 2603.20697: Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vis...

arXiv - AI · 4 min ·
[2603.20292] HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting
Machine Learning

[2603.20292] HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting

Abstract page for arXiv paper 2603.20292: HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting

arXiv - Machine Learning · 3 min ·
Machine Learning

I curated an 'Awesome List' for Generative AI in Jewelry- papers, datasets, open-source models and tools included!

Jewelry is one of the, if not the, hardest categories for AI image generation. Reflective metals, facet edges, prong geometry, and gemsto...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[N] Understanding & Fine-tuning Vision Transformers

A neat blog post by Mayank Pratap Singh with excellent visuals introducing ViTs from the ground up. The post covers: Patch embedding Posi...

Reddit - Machine Learning · 1 min ·
[2603.14579] Medical Image Spatial Grounding with Semantic Sampling
Llms

[2603.14579] Medical Image Spatial Grounding with Semantic Sampling

Abstract page for arXiv paper 2603.14579: Medical Image Spatial Grounding with Semantic Sampling

arXiv - Machine Learning · 4 min ·
[2603.19759] Growing Networks with Autonomous Pruning
Machine Learning

[2603.19759] Growing Networks with Autonomous Pruning

Abstract page for arXiv paper 2603.19759: Growing Networks with Autonomous Pruning

arXiv - Machine Learning · 3 min ·
[2603.20021] ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images
Machine Learning

[2603.20021] ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images

Abstract page for arXiv paper 2603.20021: ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severit...

arXiv - Machine Learning · 4 min ·
[2603.17470] VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection
Machine Learning

[2603.17470] VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection

Abstract page for arXiv paper 2603.17470: VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection

arXiv - AI · 4 min ·
[2507.16214] Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers
Machine Learning

[2507.16214] Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers

Abstract page for arXiv paper 2507.16214: Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers

arXiv - AI · 4 min ·
[2402.01703] Community-Informed AI Models for Police Accountability
Machine Learning

[2402.01703] Community-Informed AI Models for Police Accountability

Abstract page for arXiv paper 2402.01703: Community-Informed AI Models for Police Accountability

arXiv - Machine Learning · 4 min ·
Previous Page 3 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime