[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...
This study presents a methodology for mapping and predicting chlorophyll-a levels in the Mar Menor Lagoon using C2RCC-processed Sentinel ...
This paper presents a novel approach to image transmission using multi-hop deep joint source-channel coding (DeepJSCC) combined with deep...
This paper evaluates the robustness of Vision-Language-Action (VLA) models against various multi-modal perturbations, proposing a new met...
The paper introduces Proportionate Credit Policy Optimization (PCPO), a novel framework aimed at improving the stability and quality of t...
This paper presents a novel approach to infrared small target detection and segmentation (IRSTDS) by introducing a noise-suppression feat...
This article presents a novel approach to active view selection (AVS) for 3D reconstruction using neural uncertainty maps, significantly ...
HoloLLM introduces a Multimodal Large Language Model that enhances human sensing and reasoning by integrating diverse sensory inputs, out...
The paper presents a novel LiDAR-camera fusion framework for real-time 3D dynamic object detection and trajectory prediction, enhancing s...
The paper presents MoEMba, a novel framework utilizing Mamba-based Mixture of Experts for enhancing high-density EMG-based hand gesture r...
The paper presents CS-Aligner, a novel framework for vision-language alignment that integrates Cauchy-Schwarz divergence with mutual info...
XMorph presents a novel framework for explainable brain tumor analysis, achieving 96% accuracy while addressing interpretability and comp...
The paper introduces VAUQ, a framework for vision-aware uncertainty quantification in large vision-language models (LVLMs), enhancing sel...
MIP Candy is a modular framework built on PyTorch for medical image processing, offering a flexible pipeline for data handling, training,...
The paper presents CrystaL, a novel framework for Multimodal Large Language Models (MLLMs) that enhances visual understanding by crystall...
This paper presents MMHNet, a novel multimodal hierarchical network that enhances video-to-audio generation by enabling models to general...
This paper presents a novel approach to brain lesion segmentation in MRI scans using report-supervised learning, enhancing accuracy by in...
This paper presents a novel system that integrates depth camera measurements and deep learning for accurate distance estimation in UAV-as...
This paper presents ArtiAgent, a novel approach to automate the creation of artifact-annotated datasets for training visual language mode...
Airavat introduces an innovative framework for automating Internet measurement workflows, ensuring both generation and verification again...
OrthoDiffusion is a novel diffusion-based model designed for multi-task interpretation of musculoskeletal MRI scans, improving diagnostic...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime