[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...
AeroDGS presents a novel framework for 4D reconstruction from monocular UAV videos, addressing challenges in depth ambiguity and motion e...
This article presents a novel deep learning approach for accurately solving the geodesic problem on continuous surfaces, achieving third-...
This article discusses the application of foundation models in histopathology, highlighting a novel approach that improves robustness and...
This paper presents a novel approach to reconstruct audio and images from clipped measurements using self-supervised learning, addressing...
The paper introduces SOTAlign, a semi-supervised framework for aligning unimodal vision and language models using minimal paired data and...
CryoNet.Refine introduces a one-step diffusion model for efficiently refining structural models using cryo-EM density maps, offering a si...
This article presents a novel approach for unsupervised denoising of diffusion-weighted images (dMRI) by addressing noise bias and varian...
RhythmBERT is a novel self-supervised language model designed for ECG waveform analysis, enhancing heart disease detection by treating EC...
The CXReasonAgent integrates large language models with diagnostic tools for improved reasoning in chest X-ray interpretations, addressin...
This paper presents a novel approach to semantic image communication in IoT networks using a doubly adaptive channel and spatial attentio...
The paper presents GeoPerceive, a benchmark for evaluating geometric perception in vision-language models (VLMs), and introduces GeoDPO, ...
The paper introduces Certified Circuits, a framework that enhances the stability and accuracy of circuit discovery in neural networks, ad...
FactGuard introduces an innovative framework for detecting video misinformation using reinforcement learning, enhancing the capabilities ...
The paper presents pQuant, a novel approach for low-bit language models that utilizes decoupled linear quantization-aware training to enh...
LUMOS introduces an innovative framework for scientific machine learning (SciML) that simplifies model design by integrating feature sele...
This paper introduces Space Syntax-guided Post-training (SSPT) for enhancing residential floor plan generation by integrating architectur...
BrepCoder is a unified multimodal large language model designed for multi-task reasoning in Computer-Aided Design (CAD), specifically uti...
The paper introduces Entropy-Controlled Flow Matching (ECFM), a method that optimizes flow matching in machine learning by controlling in...
Google has launched the Nano Banana 2 model, enhancing image generation capabilities with faster processing and improved realism, now def...
Google's Nano Banana 2 introduces advanced AI image generation tools to free users, enhancing capabilities previously exclusive to paid s...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime