Computer Vision

Image recognition, detection, and visual AI

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv - AI · 4 min · about 5 hours ago

Computer Vision

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...

arXiv - AI · 4 min · about 5 hours ago

All Content

Machine Learning

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv - AI · 4 min · about 5 hours ago

Computer Vision

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2603.25170] Knowledge-Guided Adversarial Training for Infrared Object Detection via Thermal Radiation Modeling

Abstract page for arXiv paper 2603.25170: Knowledge-Guided Adversarial Training for Infrared Object Detection via Thermal Radiation Modeling

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

Abstract page for arXiv paper 2603.25109: MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

arXiv - AI · 4 min · about 5 hours ago

Computer Vision

[2603.25091] Pixelis: Reasoning in Pixels, from Seeing to Acting

Abstract page for arXiv paper 2603.25091: Pixelis: Reasoning in Pixels, from Seeing to Acting

arXiv - AI · 4 min · about 5 hours ago

Llms

[2603.25687] On Neural Scaling Laws for Weather Emulation through Continual Training

Abstract page for arXiv paper 2603.25687: On Neural Scaling Laws for Weather Emulation through Continual Training

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2603.24801] Dissecting Model Failures in Abdominal Aortic Aneurysm Segmentation through Explainability-Driven Analysis

Abstract page for arXiv paper 2603.24801: Dissecting Model Failures in Abdominal Aortic Aneurysm Segmentation through Explainability-Driv...

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

[2603.24753] Light Cones For Vision: Simple Causal Priors For Visual Hierarchy

Abstract page for arXiv paper 2603.24753: Light Cones For Vision: Simple Causal Priors For Visual Hierarchy

arXiv - Machine Learning · 3 min · about 5 hours ago

Machine Learning

[2603.24695] Amplified Patch-Level Differential Privacy for Free via Random Cropping

Abstract page for arXiv paper 2603.24695: Amplified Patch-Level Differential Privacy for Free via Random Cropping

arXiv - Machine Learning · 4 min · about 5 hours ago

Machine Learning

I built a real-time pipeline that reads game subtitles and converts them into dynamic voice acting (OCR → TTS → RVC) [P]

I've been experimenting with real-time pipelines that combine OCR + TTS + voice conversion, and I ended up building a desktop app that ca...

Reddit - Machine Learning · 1 min · 1 day ago

Machine Learning

[2411.15087] Phrase-Instance Alignment for Generalized Referring Segmentation

Abstract page for arXiv paper 2411.15087: Phrase-Instance Alignment for Generalized Referring Segmentation

arXiv - Machine Learning · 3 min · 1 day ago

Machine Learning

[2511.06767] QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

Abstract page for arXiv paper 2511.06767: QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common P...

arXiv - Machine Learning · 4 min · 1 day ago

Llms

[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

Abstract page for arXiv paper 2603.23831: Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

arXiv - Machine Learning · 3 min · 1 day ago

Computer Vision

[2603.23574] PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning

Abstract page for arXiv paper 2603.23574: PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning

arXiv - Machine Learning · 4 min · 1 day ago

Computer Vision

Senate Democrats are trying to ‘codify’ Anthropic’s red lines on autonomous weapons and mass surveillance | The Verge

Sen. Adam Schiff (D-CA) is drafting a bill to codify safeguards around the use of AI for autonomous weapons and mass domestic surveillanc...

The Verge - AI · 7 min · 2 days ago

Machine Learning

[2509.02419] From Noisy Labels to Intrinsic Structure: A Geometric-Structural Dual-Guided Framework for Noise-Robust Medical Image Segmentation

Abstract page for arXiv paper 2509.02419: From Noisy Labels to Intrinsic Structure: A Geometric-Structural Dual-Guided Framework for Nois...

arXiv - AI · 4 min · 2 days ago

Machine Learning

[2503.14553] Redefining non-IID Data in Federated Learning for Computer Vision Tasks: Migrating from Labels to Embeddings for Task-Specific Data Distributions

Abstract page for arXiv paper 2503.14553: Redefining non-IID Data in Federated Learning for Computer Vision Tasks: Migrating from Labels ...

arXiv - Machine Learning · 4 min · 2 days ago

Machine Learning

[2603.23356] Contrastive Metric Learning for Point Cloud Segmentation in Highly Granular Detectors

Abstract page for arXiv paper 2603.23356: Contrastive Metric Learning for Point Cloud Segmentation in Highly Granular Detectors

arXiv - AI · 4 min · 2 days ago

Machine Learning

[2603.23030] Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation

Abstract page for arXiv paper 2603.23030: Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic...

arXiv - AI · 4 min · 2 days ago

Page 1 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Computer Vision

Top This Week

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

All Content

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

[2603.25170] Knowledge-Guided Adversarial Training for Infrared Object Detection via Thermal Radiation Modeling

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

[2603.25091] Pixelis: Reasoning in Pixels, from Seeing to Acting

[2603.25687] On Neural Scaling Laws for Weather Emulation through Continual Training

[2603.24801] Dissecting Model Failures in Abdominal Aortic Aneurysm Segmentation through Explainability-Driven Analysis

[2603.24753] Light Cones For Vision: Simple Causal Priors For Visual Hierarchy

[2603.24695] Amplified Patch-Level Differential Privacy for Free via Random Cropping

I built a real-time pipeline that reads game subtitles and converts them into dynamic voice acting (OCR → TTS → RVC) [P]

[2411.15087] Phrase-Instance Alignment for Generalized Referring Segmentation

[2511.06767] QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

[2603.23574] PoiCGAN: A Targeted Poisoning Based on Feature-Label Joint Perturbation in Federated Learning

Senate Democrats are trying to ‘codify’ Anthropic’s red lines on autonomous weapons and mass surveillance | The Verge

[2509.02419] From Noisy Labels to Intrinsic Structure: A Geometric-Structural Dual-Guided Framework for Noise-Robust Medical Image Segmentation

[2503.14553] Redefining non-IID Data in Federated Learning for Computer Vision Tasks: Migrating from Labels to Embeddings for Task-Specific Data Distributions

[2603.23356] Contrastive Metric Learning for Point Cloud Segmentation in Highly Granular Detectors

[2603.23030] Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation

Related Topics

Stay updated with AI News