[2506.22504] Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2506.22504: Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion Detection
Abstract page for arXiv paper 2508.00307: Acoustic Imaging for Low-SNR UAV Detection: Dense Beamformed Energy Maps and U-Net SELD
Abstract page for arXiv paper 2603.25524: CHIRP dataset: towards long-term, individual-level, behavioral monitoring of bird populations i...
This article presents PathVis, a mixed-reality platform designed to enhance digital pathology workflows by integrating multimodal AI and ...
The paper introduces ViT-Linearizer, a framework that distills knowledge from Vision Transformers (ViTs) into efficient linear-time model...
This paper presents MixCache, a novel caching framework designed to enhance the efficiency of text-to-video diffusion models, significant...
LayerT2V presents a novel framework for multi-layer video generation, enabling the creation of editable video layers that enhance profess...
The paper presents Dual-IPO, a novel framework for optimizing text-to-video generation by iteratively improving both the reward and video...
LinGuinE introduces a novel framework for longitudinal volumetric tumor segmentation, enhancing tracking and mask generation across multi...
The paper presents MomentMix, a novel augmentation technique using Length-Aware DETR to enhance video moment retrieval, particularly for ...
This paper introduces a framework for open vocabulary object detection that allows vision language models to identify and learn novel obj...
This paper presents a novel framework for one-shot learning in computer vision, utilizing Abstracted Gaussian Prototypes to enhance image...
The paper introduces SeeThrough3D, a model for occlusion-aware 3D control in text-to-image generation, enhancing the realism of synthesiz...
This paper presents a novel bitwise systolic array architecture designed for runtime-reconfigurable multi-precision quantized multiplicat...
The paper presents GUIPruner, a framework for enhancing the efficiency of high-resolution GUI agents by addressing spatiotemporal redunda...
ColoDiff introduces a novel framework for generating colonoscopy videos that ensures dynamic consistency and content awareness, addressin...
The paper presents a novel framework, Motif-based Continuous Dynamics (MCD), to model animal behavior by identifying continuous motor mot...
The paper presents Latent Gaussian Splatting (LaGS) for 4D panoptic occupancy tracking, enhancing robot perception in dynamic environment...
This article presents Fase3D, an innovative encoder-free Fourier-based model for processing 3D multimodal data, enhancing efficiency and ...
This article reviews adversarial transferability in image classification, proposing a standardized framework for evaluating transfer-base...
The article presents MM-NeuroOnco, a comprehensive dataset aimed at improving MRI-based brain tumor diagnosis through multimodal instruct...
This paper presents Stepwise Diffusion Policy Optimization (SDPO), a novel reinforcement learning framework designed to enhance few-step ...
This paper presents a novel approach to medical image reconstruction using Dual-Coupled Plug-and-Play Diffusion, addressing limitations i...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime