[2602.09678] Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2601.13622: CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language...
Abstract page for arXiv paper 2603.26551: Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones
This paper presents Exploration-Exploitation Distillation (E^2D), a method for efficient large-scale dataset distillation that balances a...
This paper presents a novel approach to camera virtualization for sports and visual performances, enabling photorealistic rendering from ...
The paper presents a novel method for detecting annotation errors in video datasets by analyzing loss trajectories, enhancing model train...
The paper introduces Experiment Automation Agents (EAA), a system leveraging vision-language models to automate complex microscopy workfl...
StrokeNeXt introduces a Siamese-encoder model for classifying brain strokes in CT images, achieving high accuracy and low misclassificati...
This paper presents a mathematical model for accurate 2D reconstruction in PET scanners, utilizing an Analytical White Image Model to enh...
The article presents an Attention-Gated U-Net model for semantic segmentation of brain tumors, enhancing treatment planning through impro...
This paper presents a novel method for inverse material design using guided diffusion and optimized loss functions, addressing challenges...
This paper evaluates the out-of-distribution generalization of reasoning in multimodal large language models (LLMs) through a grid-based ...
The paper presents Doubly Stochastic Mean-Shift (DSMS), an innovative clustering algorithm that enhances standard Mean-Shift methods by i...
The paper presents COMPOT, a novel framework for compressing Transformer models using Calibration-Optimized Matrix Procrustes Orthogonali...
This article explores how Vision Language Models (VLMs) enhance performance on text-only tasks by correcting binding shortcuts through vi...
The paper presents a Decoupled Representation Refinement (DRR) paradigm for Implicit Neural Representations (INRs), enhancing speed and f...
Apple is set to launch AI-powered smart glasses, a pendant, and upgraded AirPods, enhancing its AI hardware lineup with features like cam...
This article explores the effectiveness of various AI headshot generators, focusing on the author's experience with Headshot Kiwi, highli...
The paper presents Region-to-Image Distillation, a novel approach to enhance fine-grained multimodal perception in MLLMs by internalizing...
The paper presents HyperDet, a novel framework for 3D object detection using hyper 4D radar point clouds, addressing limitations of tradi...
The paper presents ReaDy-Go, a novel simulation pipeline that enhances visual navigation in dynamic environments by integrating 3D Gaussi...
The paper presents C^2ROPE, an advanced positional encoding method for 3D Large Multimodal Models, addressing limitations of existing Rot...
The paper introduces ShapBPT, a novel method for image feature attributions using data-aware binary partition trees, enhancing interpreta...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime