[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...
The paper presents FairQuant, a framework for fairness-aware mixed-precision quantization in medical image classification, optimizing bot...
The paper presents AMLRIS, a novel training strategy for Referring Image Segmentation (RIS) that enhances object segmentation through ali...
The paper introduces SubspaceAD, a training-free method for few-shot anomaly detection that utilizes subspace modeling to achieve state-o...
The paper presents SoPE, a novel Spherical Coordinate-Based Positional Embedding method aimed at improving the spatial perception capabil...
The paper presents pMoE, a novel Mixture-of-Experts prompt tuning method that enhances visual adaptation by integrating diverse domain kn...
The paper introduces SUPERGLASSES, a benchmark for evaluating Vision Language Models (VLMs) in AI smart glasses, addressing the limitatio...
ViCLIP-OT introduces a novel vision-language model tailored for Vietnamese image-text retrieval, outperforming existing models in low-res...
This paper presents a novel approach to instruction-based image editing by integrating planning, reasoning, and generation through a mult...
The paper presents CGSA, a novel framework for Source-Free Domain Adaptive Object Detection that integrates object-centric learning to en...
BetterScene introduces an innovative approach to 3D scene synthesis, enhancing novel view synthesis quality using sparse photos and a rep...
The paper discusses the evaluation challenges in text-to-image generation, focusing on classifier-free guidance (CFG) and proposing a new...
The paper presents Quality-Aware Robust Multi-View Clustering (QARMVC), a novel framework addressing the challenges of heterogeneous obse...
DrivePTS introduces a progressive learning framework for generating diverse driving scenes, enhancing fidelity and controllability in aut...
DisQ-HNet introduces a novel framework for synthesizing tau-PET images from MRI scans, enhancing interpretability and preserving anatomic...
HARU-Net introduces a novel deep learning architecture for denoising cone-beam computed tomography (CBCT) images, enhancing edge preserva...
The paper presents SignVLA, a novel gloss-free Vision-Language-Action framework for real-time robotic manipulation guided by sign languag...
This paper introduces Spatial Credit Redistribution (SCR) to address hallucinations in vision-language models by redistributing activatio...
TopoEdit presents a novel approach for fast post-optimization editing of topology optimized structures, enhancing mechanical performance ...
The paper introduces SimpleOCR, a method to enhance Multimodal Large Language Models (MLLMs) by rendering visualized questions, addressin...
This article presents a novel deep learning framework for predicting malignancy in renal tumors using 3D CT images, eliminating the need ...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime