Computer Vision

Image recognition, detection, and visual AI

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

arXiv - Machine Learning · 4 min · about 8 hours ago

Machine Learning

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv - AI · 4 min · about 8 hours ago

Computer Vision

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...

arXiv - AI · 4 min · about 8 hours ago

All Content

Machine Learning

[2603.21661] Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis

Abstract page for arXiv paper 2603.21661: Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-...

arXiv - Machine Learning · 4 min · 3 days ago

Machine Learning

[2603.21566] CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation

Abstract page for arXiv paper 2603.21566: CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Gr...

arXiv - Machine Learning · 4 min · 3 days ago

Machine Learning

[2603.21213] Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis

Abstract page for arXiv paper 2603.21213: Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis

arXiv - AI · 3 min · 3 days ago

Computer Vision

[2603.21071] CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels

Abstract page for arXiv paper 2603.21071: CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation wi...

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.20920] Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computational Processing

Abstract page for arXiv paper 2603.20920: Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computat...

arXiv - Machine Learning · 4 min · 3 days ago

Machine Learning

[2603.20898] Natural Gradient Descent for Online Continual Learning

Abstract page for arXiv paper 2603.20898: Natural Gradient Descent for Online Continual Learning

arXiv - Machine Learning · 3 min · 3 days ago

Machine Learning

[2603.20860] Restoring Neural Network Plasticity for Faster Transfer Learning

Abstract page for arXiv paper 2603.20860: Restoring Neural Network Plasticity for Faster Transfer Learning

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.20836] MERIT: Multi-domain Efficient RAW Image Translation

Abstract page for arXiv paper 2603.20836: MERIT: Multi-domain Efficient RAW Image Translation

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.20777] OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation

Abstract page for arXiv paper 2603.20777: OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Se...

arXiv - Machine Learning · 3 min · 3 days ago

Computer Vision

[2603.20729] Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-attention

Abstract page for arXiv paper 2603.20729: Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-at...

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.20697] Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models

Abstract page for arXiv paper 2603.20697: Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vis...

arXiv - AI · 4 min · 3 days ago

Machine Learning

[2603.20292] HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting

Abstract page for arXiv paper 2603.20292: HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting

arXiv - Machine Learning · 3 min · 3 days ago

Machine Learning

I curated an 'Awesome List' for Generative AI in Jewelry- papers, datasets, open-source models and tools included!

Jewelry is one of the, if not the, hardest categories for AI image generation. Reflective metals, facet edges, prong geometry, and gemsto...

Reddit - Artificial Intelligence · 1 min · 4 days ago

Machine Learning

[N] Understanding & Fine-tuning Vision Transformers

A neat blog post by Mayank Pratap Singh with excellent visuals introducing ViTs from the ground up. The post covers: Patch embedding Posi...

Reddit - Machine Learning · 1 min · 4 days ago

Llms

[2603.14579] Medical Image Spatial Grounding with Semantic Sampling

Abstract page for arXiv paper 2603.14579: Medical Image Spatial Grounding with Semantic Sampling

arXiv - Machine Learning · 4 min · 4 days ago

Machine Learning

[2603.19759] Growing Networks with Autonomous Pruning

Abstract page for arXiv paper 2603.19759: Growing Networks with Autonomous Pruning

arXiv - Machine Learning · 3 min · 4 days ago

Machine Learning

[2603.20021] ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images

Abstract page for arXiv paper 2603.20021: ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severit...

arXiv - Machine Learning · 4 min · 4 days ago

Machine Learning

[2603.17470] VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection

Abstract page for arXiv paper 2603.17470: VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection

arXiv - AI · 4 min · 4 days ago

Machine Learning

[2507.16214] Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers

Abstract page for arXiv paper 2507.16214: Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers

arXiv - AI · 4 min · 4 days ago

Machine Learning

[2402.01703] Community-Informed AI Models for Police Accountability

Abstract page for arXiv paper 2402.01703: Community-Informed AI Models for Police Accountability

arXiv - Machine Learning · 4 min · 4 days ago

Previous Page 3 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Computer Vision

Top This Week

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

All Content

[2603.21661] Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis

[2603.21566] CataractSAM-2: A Domain-Adapted Model for Anterior Segment Surgery Segmentation and Scalable Ground-Truth Annotation

[2603.21213] Positional Segmentor-Guided Counterfactual Fine-Tuning for Spatially Localized Image Synthesis

[2603.21071] CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels

[2603.20920] Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computational Processing

[2603.20898] Natural Gradient Descent for Online Continual Learning

[2603.20860] Restoring Neural Network Plasticity for Faster Transfer Learning

[2603.20836] MERIT: Multi-domain Efficient RAW Image Translation

[2603.20777] OmniPatch: A Universal Adversarial Patch for ViT-CNN Cross-Architecture Transfer in Semantic Segmentation

[2603.20729] Weakly supervised multimodal segmentation of acoustic borehole images with depth-aware cross-attention

[2603.20697] Satellite-to-Street: Synthesizing Post-Disaster Views from Satellite Imagery via Generative Vision Models

[2603.20292] HSI Image Enhancement Classification Based on Knowledge Distillation: A Study on Forgetting

I curated an 'Awesome List' for Generative AI in Jewelry- papers, datasets, open-source models and tools included!

[N] Understanding & Fine-tuning Vision Transformers

[2603.14579] Medical Image Spatial Grounding with Semantic Sampling

[2603.19759] Growing Networks with Autonomous Pruning

[2603.20021] ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images

[2603.17470] VirPro: Visual-referred Probabilistic Prompt Learning for Weakly-Supervised Monocular 3D Detection

[2507.16214] Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers

[2402.01703] Community-Informed AI Models for Police Accountability

Related Topics

Stay updated with AI News