Computer Vision

Image recognition, detection, and visual AI

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

arXiv - Machine Learning · 4 min · about 12 hours ago

Machine Learning

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

arXiv - AI · 4 min · about 12 hours ago

Computer Vision

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...

arXiv - AI · 4 min · about 12 hours ago

All Content

Machine Learning

[2311.16157] GeoTop: Advancing Image Classification with Geometric-Topological Analysis

Abstract page for arXiv paper 2311.16157: GeoTop: Advancing Image Classification with Geometric-Topological Analysis

arXiv - Machine Learning · 4 min · 22 days ago

Machine Learning

[2505.16985] Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

Abstract page for arXiv paper 2505.16985: Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

arXiv - AI · 4 min · 22 days ago

Machine Learning

[2510.15202] A Geometry-Based View of Mahalanobis OOD Detection

Abstract page for arXiv paper 2510.15202: A Geometry-Based View of Mahalanobis OOD Detection

arXiv - Machine Learning · 4 min · 22 days ago

Machine Learning

[2312.17505] Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion

Abstract page for arXiv paper 2312.17505: Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion

arXiv - AI · 4 min · 22 days ago

Nlp

[2603.04321] SPRINT: Semi-supervised Prototypical Representation for Few-Shot Class-Incremental Tabular Learning

Abstract page for arXiv paper 2603.04321: SPRINT: Semi-supervised Prototypical Representation for Few-Shot Class-Incremental Tabular Lear...

arXiv - AI · 3 min · 22 days ago

Machine Learning

[2603.04024] Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus for Ambiguous Medical Image Segmentation

Abstract page for arXiv paper 2603.04024: Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus ...

arXiv - AI · 4 min · 22 days ago

Llms

[2603.04002] Discriminative Perception via Anchored Description for Reasoning Segmentation

Abstract page for arXiv paper 2603.04002: Discriminative Perception via Anchored Description for Reasoning Segmentation

arXiv - AI · 4 min · 22 days ago

Machine Learning

[2603.03989] When Visual Evidence is Ambiguous: Pareidolia as a Diagnostic Probe for Vision Models

Abstract page for arXiv paper 2603.03989: When Visual Evidence is Ambiguous: Pareidolia as a Diagnostic Probe for Vision Models

arXiv - AI · 4 min · 22 days ago

Llms

[2603.03983] GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

Abstract page for arXiv paper 2603.03983: GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

arXiv - AI · 3 min · 22 days ago

Llms

[2603.03583] ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer

Abstract page for arXiv paper 2603.03583: ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer

arXiv - Machine Learning · 3 min · 22 days ago

Generative Ai

[2603.03971] Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

Abstract page for arXiv paper 2603.03971: Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

arXiv - AI · 4 min · 22 days ago

Machine Learning

[2603.03350] Automated Measurement of Geniohyoid Muscle Thickness During Speech Using Deep Learning and Ultrasound

Abstract page for arXiv paper 2603.03350: Automated Measurement of Geniohyoid Muscle Thickness During Speech Using Deep Learning and Ultr...

arXiv - Machine Learning · 3 min · 22 days ago

Machine Learning

[2603.03806] Separators in Enhancing Autoregressive Pretraining for Vision Mamba

Abstract page for arXiv paper 2603.03806: Separators in Enhancing Autoregressive Pretraining for Vision Mamba

arXiv - AI · 3 min · 22 days ago

Machine Learning

[2603.04359] Dissecting Quantization Error: A Concentration-Alignment Perspective

Abstract page for arXiv paper 2603.04359: Dissecting Quantization Error: A Concentration-Alignment Perspective

arXiv - AI · 3 min · 22 days ago

Computer Vision

[2603.03654] Field imaging framework for morphological characterization of aggregates with computer vision: Algorithms and applications

Abstract page for arXiv paper 2603.03654: Field imaging framework for morphological characterization of aggregates with computer vision: ...

arXiv - AI · 4 min · 22 days ago

Llms

[2603.03637] Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial Instructions

Abstract page for arXiv paper 2603.03637: Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial I...

arXiv - AI · 3 min · 22 days ago

Llms

[2603.03371] Sleeper Cell: Injecting Latent Malice Temporal Backdoors into Tool-Using LLMs

Abstract page for arXiv paper 2603.03371: Sleeper Cell: Injecting Latent Malice Temporal Backdoors into Tool-Using LLMs

arXiv - AI · 4 min · 22 days ago

Computer Vision

[2603.03342] Cryo-SWAN: the Multi-Scale Wavelet-decomposition-inspired Autoencoder Network for molecular density representation of molecular volumes

Abstract page for arXiv paper 2603.03342: Cryo-SWAN: the Multi-Scale Wavelet-decomposition-inspired Autoencoder Network for molecular den...

arXiv - AI · 4 min · 22 days ago

Computer Vision

[2603.03315] M-QUEST -- Meme Question-Understanding Evaluation on Semantics and Toxicity

Abstract page for arXiv paper 2603.03315: M-QUEST -- Meme Question-Understanding Evaluation on Semantics and Toxicity

arXiv - Machine Learning · 4 min · 22 days ago

Machine Learning

[2601.15133] Graph Recognition via Subgraph Prediction

Abstract page for arXiv paper 2601.15133: Graph Recognition via Subgraph Prediction

arXiv - Machine Learning · 3 min · 23 days ago

Previous Page 5 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Computer Vision

Top This Week

[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection

[2508.00307] Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD

[2603.25524] CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations in the wild

All Content

[2311.16157] GeoTop: Advancing Image Classification with Geometric-Topological Analysis

[2505.16985] Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

[2510.15202] A Geometry-Based View of Mahalanobis OOD Detection

[2312.17505] Catch Me If You Can Describe Me: Open-Vocabulary Camouflaged Instance Segmentation with Diffusion

[2603.04321] SPRINT: Semi-supervised Prototypical Representation for Few-Shot Class-Incremental Tabular Learning

[2603.04024] Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus for Ambiguous Medical Image Segmentation

[2603.04002] Discriminative Perception via Anchored Description for Reasoning Segmentation

[2603.03989] When Visual Evidence is Ambiguous: Pareidolia as a Diagnostic Probe for Vision Models

[2603.03983] GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

[2603.03583] ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer

[2603.03971] Upholding Epistemic Agency: A Brouwerian Assertibility Constraint for Responsible AI

[2603.03350] Automated Measurement of Geniohyoid Muscle Thickness During Speech Using Deep Learning and Ultrasound

[2603.03806] Separators in Enhancing Autoregressive Pretraining for Vision Mamba

[2603.04359] Dissecting Quantization Error: A Concentration-Alignment Perspective

[2603.03654] Field imaging framework for morphological characterization of aggregates with computer vision: Algorithms and applications

[2603.03637] Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial Instructions

[2603.03371] Sleeper Cell: Injecting Latent Malice Temporal Backdoors into Tool-Using LLMs

[2603.03342] Cryo-SWAN: the Multi-Scale Wavelet-decomposition-inspired Autoencoder Network for molecular density representation of molecular volumes

[2603.03315] M-QUEST -- Meme Question-Understanding Evaluation on Semantics and Toxicity

[2601.15133] Graph Recognition via Subgraph Prediction

Related Topics

Stay updated with AI News