[2602.09678] Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2601.13622: CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language...
Abstract page for arXiv paper 2603.26551: Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones
This article evaluates how Text-to-Image diffusion models represent historical contexts, introducing a benchmark to assess their accuracy...
This article explores how AI agents imitating human content affect information diversity, revealing context-dependent outcomes in homogen...
The paper introduces Soft-CAM, a method that enhances the interpretability of convolutional neural networks (CNNs) in medical image analy...
The paper introduces 'Visual Planning', a new paradigm that utilizes images for reasoning in spatial tasks, enhancing planning capabiliti...
This article presents a novel approach to data-efficient inference of neural fluid fields using SciML foundation models, demonstrating si...
The paper introduces SUNLayer, a theoretical framework for stable denoising using generative networks, focusing on activation functions a...
The paper discusses Latent Equivariant Operators as a novel approach to enhance object recognition in computer vision, addressing challen...
This paper presents a novel approach to generating causal explanations for image classifiers, introducing a black-box algorithm grounded ...
This article presents a theoretical analysis of Quantum Extreme Learning Machines (QELMs) using the Pauli-transfer matrix approach, highl...
This paper presents a quantum feature extraction method that enhances multi-class image classification for satellite applications, achiev...
The paper presents Zero-Shot Interactive Perception (ZS-IP), a framework that enhances robotic manipulation through a memory-driven Visio...
This paper investigates the adversarial robustness of discrete image tokenizers, highlighting their vulnerabilities and proposing a novel...
This article presents a high-resolution framework for soil moisture estimation using multimodal Earth observation data, highlighting the ...
CityGuard introduces a novel framework for privacy-preserving identity retrieval across urban surveillance cameras, addressing challenges...
ZACH-ViT introduces a novel Vision Transformer architecture tailored for medical imaging, enhancing performance by removing fixed spatial...
The paper presents RamanSeg, an interpretable deep learning model for analyzing Raman spectra in cancer diagnosis, achieving significant ...
The paper presents TopoGate, a model designed to enhance new-lesion prediction in longitudinal low-dose CT scans by integrating quality-a...
The paper introduces OODBench, a benchmark for evaluating large vision-language models' performance on out-of-distribution (OOD) data, hi...
DohaScript introduces a large-scale dataset for continuous handwritten Hindi text, addressing the lack of diverse and high-quality resour...
The paper introduces the Video Query Performance Prediction (VQPP) benchmark, addressing a gap in query performance prediction for video ...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime