Computer Vision

Image recognition, detection, and visual AI

Top This Week

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Machine Learning

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

arXiv - Machine Learning · 4 min ·
[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Machine Learning

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv - AI · 4 min ·
[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild
Computer Vision

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...

arXiv - AI · 4 min ·

All Content

[2510.27315] CASR-Net: An Image Processing-focused Deep Learning-based Coronary Artery Segmentation and Refinement Network for X-ray Coronary Angiogram
Machine Learning

[2510.27315] CASR-Net: An Image Processing-focused Deep Learning-based Coronary Artery Segmentation and Refinement Network for X-ray Coronary Angiogram

Abstract page for arXiv paper 2510.27315: CASR-Net: An Image Processing-focused Deep Learning-based Coronary Artery Segmentation and Refi...

arXiv - AI · 4 min ·
[2603.03075] TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference
Machine Learning

[2603.03075] TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference

Abstract page for arXiv paper 2603.03075: TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference

arXiv - AI · 4 min ·
[2603.02958] Layer-wise QUBO-Based Training of CNN Classifiers for Quantum Annealing
Machine Learning

[2603.02958] Layer-wise QUBO-Based Training of CNN Classifiers for Quantum Annealing

Abstract page for arXiv paper 2603.02958: Layer-wise QUBO-Based Training of CNN Classifiers for Quantum Annealing

arXiv - AI · 4 min ·
[2603.02533] Functional Properties of the Focal-Entropy
Computer Vision

[2603.02533] Functional Properties of the Focal-Entropy

Abstract page for arXiv paper 2603.02533: Functional Properties of the Focal-Entropy

arXiv - Machine Learning · 3 min ·
[2603.02789] OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets
Llms

[2603.02789] OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-Scale Datasets

Abstract page for arXiv paper 2603.02789: OCR or Not? Rethinking Document Information Extraction in the MLLMs Era with Real-World Large-S...

arXiv - AI · 3 min ·
[2603.02483] Geometric structures and deviations on James' symmetric positive-definite matrix bicone domain
Machine Learning

[2603.02483] Geometric structures and deviations on James' symmetric positive-definite matrix bicone domain

Abstract page for arXiv paper 2603.02483: Geometric structures and deviations on James' symmetric positive-definite matrix bicone domain

arXiv - Machine Learning · 4 min ·
[2603.02475] Large-Scale Dataset and Benchmark for Skin Tone Classification in the Wild
Machine Learning

[2603.02475] Large-Scale Dataset and Benchmark for Skin Tone Classification in the Wild

Abstract page for arXiv paper 2603.02475: Large-Scale Dataset and Benchmark for Skin Tone Classification in the Wild

arXiv - Machine Learning · 4 min ·
[2603.02704] Intelligent Pathological Diagnosis of Gestational Trophoblastic Diseases via Visual-Language Deep Learning Model
Machine Learning

[2603.02704] Intelligent Pathological Diagnosis of Gestational Trophoblastic Diseases via Visual-Language Deep Learning Model

Abstract page for arXiv paper 2603.02704: Intelligent Pathological Diagnosis of Gestational Trophoblastic Diseases via Visual-Language De...

arXiv - AI · 4 min ·
[2603.03234] Guiding Sparse Neural Networks with Neurobiological Principles to Elicit Biologically Plausible Representations
Machine Learning

[2603.03234] Guiding Sparse Neural Networks with Neurobiological Principles to Elicit Biologically Plausible Representations

Abstract page for arXiv paper 2603.03234: Guiding Sparse Neural Networks with Neurobiological Principles to Elicit Biologically Plausible...

arXiv - Machine Learning · 4 min ·
[2603.02286] Beyond Prompt Degradation: Prototype-guided Dual-pool Prompting for Incremental Object Detection
Computer Vision

[2603.02286] Beyond Prompt Degradation: Prototype-guided Dual-pool Prompting for Incremental Object Detection

Abstract page for arXiv paper 2603.02286: Beyond Prompt Degradation: Prototype-guided Dual-pool Prompting for Incremental Object Detection

arXiv - AI · 4 min ·
[2603.03043] IoUCert: Robustness Verification for Anchor-based Object Detectors
Computer Vision

[2603.03043] IoUCert: Robustness Verification for Anchor-based Object Detectors

Abstract page for arXiv paper 2603.03043: IoUCert: Robustness Verification for Anchor-based Object Detectors

arXiv - Machine Learning · 3 min ·
Computer Vision

[R] Boundary-Metric Evaluation for Thin-Structure Segmentation under 2% Foreground Sparsity

Hey! I'm currently a undergrad student graduating in May and soon starting my Masters in AI. I've wanted to write a research paper to sta...

Reddit - Machine Learning · 1 min ·
Llms

[D] frontier models are a zero sum game for a few tasks - what they gain in reasoning they lose in your specific thing

when Google shipped Gemini 3 last November, it set new benchmarks on reasoning and coding. but it also removed pixel-level image segmenta...

Reddit - Machine Learning · 1 min ·
[2509.22240] COMPASS: Robust Feature Conformal Prediction for Medical Segmentation Metrics
Machine Learning

[2509.22240] COMPASS: Robust Feature Conformal Prediction for Medical Segmentation Metrics

Abstract page for arXiv paper 2509.22240: COMPASS: Robust Feature Conformal Prediction for Medical Segmentation Metrics

arXiv - Machine Learning · 4 min ·
[2601.08133] How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation?
Computer Vision

[2601.08133] How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation?

Abstract page for arXiv paper 2601.08133: How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmenta...

arXiv - AI · 4 min ·
[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression
Llms

[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression

Abstract page for arXiv paper 2601.04786: AgentOCR: Reimagining Agent History via Optical Self-Compression

arXiv - Machine Learning · 4 min ·
[2507.15852] Advancing Complex Video Object Segmentation via Progressive Concept Construction
Llms

[2507.15852] Advancing Complex Video Object Segmentation via Progressive Concept Construction

Abstract page for arXiv paper 2507.15852: Advancing Complex Video Object Segmentation via Progressive Concept Construction

arXiv - AI · 4 min ·
[2506.10941] VINCIE: Unlocking In-context Image Editing from Video
Machine Learning

[2506.10941] VINCIE: Unlocking In-context Image Editing from Video

Abstract page for arXiv paper 2506.10941: VINCIE: Unlocking In-context Image Editing from Video

arXiv - Machine Learning · 4 min ·
[2505.16017] GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection
Machine Learning

[2505.16017] GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection

Abstract page for arXiv paper 2505.16017: GradPCA: Leveraging NTK Alignment for Reliable Out-of-Distribution Detection

arXiv - Machine Learning · 3 min ·
[2506.06719] Improving Wildlife Out-of-Distribution Detection: Africas Big Five
Machine Learning

[2506.06719] Improving Wildlife Out-of-Distribution Detection: Africas Big Five

Abstract page for arXiv paper 2506.06719: Improving Wildlife Out-of-Distribution Detection: Africas Big Five

arXiv - AI · 4 min ·
Previous Page 6 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime