[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...
Niantic is leveraging spatial intelligence to enhance industrial applications, focusing on integrating augmented reality with real-world ...
The article discusses the author's successful use of an AI geolocation tool to pinpoint the exact coordinates of a blurry image from the ...
This paper examines the effectiveness of visual navigation policies trained in simulation versus those trained with real-world data, high...
MedicalPatchNet introduces a self-explainable AI architecture for chest X-ray classification, enhancing interpretability while maintainin...
WebGym is an innovative open-source environment designed for training visual web agents, featuring nearly 300,000 tasks and a high-throug...
The paper presents NTK-Guided Implicit Neural Teaching (NINT), a method that accelerates training of Implicit Neural Representations (INR...
The paper presents OC-STORM, an object-centric model-based reinforcement learning framework that enhances sample efficiency by leveraging...
The GFPL framework enhances federated learning by addressing data imbalance and communication overhead in resource-constrained vision tas...
This paper presents a novel approach to image reconstruction using spatially adaptive sparsity level maps within convolutional dictionari...
This article presents a U-Net based deep learning architecture for segmenting brain tumors in MRI scans, focusing on the often-overlooked...
The paper introduces PSF-Med, a benchmark assessing paraphrase sensitivity in medical vision language models, revealing significant varia...
The paper presents MMLoP, a framework for efficient vision-language adaptation using low-rank prompting, achieving high accuracy with sig...
This paper explores the robustness of sparse artificial neural networks with adaptive topology, demonstrating their competitive performan...
This article presents a novel approach using a Deep Clustering based Boundary-Decoder Net for predicting inter and intra-layer stress in ...
The paper introduces a novel attack method, Coherence-Preserving Semantic Injection (CSI), that exploits vulnerabilities in semantic-awar...
This study evaluates the performance of foundation models in detecting abdominal trauma, revealing that specificity deficits are influenc...
This paper presents a novel approach to monocular normal estimation by reformulating the problem as shading sequence estimation, enhancin...
The paper introduces LatentLens, a method for mapping visual tokens to natural language descriptions in Vision-Language Models (VLMs), en...
The paper presents FigEx2, a framework for detecting and captioning panels in scientific compound figures, enhancing understanding and ac...
This article presents a unified framework for Aerial Vision-Language Navigation (VLN), enabling UAVs to interpret natural language and na...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime