[2602.09678] Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2601.13622: CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language...
Abstract page for arXiv paper 2603.26551: Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones
The MMS-VPR paper introduces a comprehensive multimodal dataset for street-level visual place recognition, addressing gaps in existing da...
This paper presents a novel 3D data Analysis Optimization Pipeline that utilizes Bayesian Optimization to enhance segmentation and classi...
The paper introduces MINT, a framework for optimizing large language models (LLMs) using multimodal biomedical data to enhance predictive...
This paper presents a machine learning-based pipeline for automated segmentation and classification of vessels in Intracoronary Optical C...
The paper presents a novel dynamic training-free fusion framework for combining subject and style LoRAs in generative models, enhancing c...
The paper presents RPT-SR, a novel transformer architecture designed for infrared image super-resolution, addressing inefficiencies in ex...
The paper introduces Terminal Velocity Matching (TVM), a novel approach to generative modeling that enhances performance in one- and few-...
This article evaluates self-supervised learning models for cardiac ultrasound view classification, comparing USF-MAE and MoCo v3 using th...
The paper introduces Sparrow, a novel framework designed to enhance speculative decoding in Video Large Language Models (Vid-LLMs) by opt...
This article explores how visual-language models (VLMs) make decisions based on image inputs, introducing a framework to analyze their pr...
This article presents a comprehensive study on training long-context visual document models, achieving state-of-the-art performance in vi...
The paper presents a novel framework, Just KIDDIN, that combines Knowledge Distillation and knowledge infusion to improve the detection o...
This paper presents a novel approach for classifying and localizing ovarian cancer subtypes using weakly supervised learning techniques, ...
The Dex4D framework enables task-agnostic dexterous manipulation by using simulation to learn generalist policies that can be applied to ...
GRAFNet introduces a novel architecture for polyp segmentation in colonoscopy, enhancing accuracy through biologically inspired multi-sca...
The paper presents LoRWeB, a novel approach to visual analogy learning that enhances image manipulation by dynamically selecting and weig...
The paper introduces the Vision Wormhole, a framework for enabling efficient latent-space communication in heterogeneous multi-agent syst...
The paper presents GMAIL, a novel framework for aligning generated images with real images in machine learning, enhancing performance in ...
The article presents CARE Drive, a framework for evaluating the reason-responsiveness of vision language models in automated driving, add...
This paper analyzes how multimodal Transformers integrate visual and linguistic information, revealing a layer-wise evolution of predicti...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime