[2602.09678] Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2601.13622: CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language...
Abstract page for arXiv paper 2603.26551: Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones
This article introduces the concept of multimodal prompt optimization for Multimodal Large Language Models (MLLMs), proposing a new frame...
This article presents a novel inference-time search algorithm that enhances diffusion-based image reconstruction by utilizing side inform...
The MCIF benchmark introduces a novel framework for evaluating multimodal crosslingual instruction-following capabilities in large langua...
This article presents a novel approach using an Attention-Enhanced U-Net for the automatic segmentation of COVID-19 infected lung regions...
The paper presents Cert-SSBD, a novel method for defending against backdoor attacks in deep neural networks by optimizing noise levels sp...
This paper explores the integration of Self-Organizing Maps (SOMs) with Vision Transformers (ViTs) to enhance performance on small datase...
This paper presents a knowledge distillation approach for Multi-View 3D reconstruction, utilizing a teacher-student model framework to en...
The paper presents Bongard-RWR+, a dataset designed to enhance fine-grained visual reasoning in Bongard Problems using real-world images ...
This paper explores causal explanations in image classification, demonstrating their formal properties and computability, while introduci...
The paper presents advancements in Image Copy Detection (ICD) by introducing PixTrace and CopyNCE, enhancing feature representation and i...
This article provides a comprehensive overview of optical remote sensing, highlighting advancements in computer vision and drone technolo...
The paper presents SpectralGCD, a novel approach for Generalized Category Discovery (GCD) that enhances multimodal learning by efficientl...
The paper presents ASTERIS, a self-supervised spatiotemporal denoising algorithm that enhances detection limits in astronomical imaging, ...
This paper presents a novel approach to 3D scene rendering using multimodal Gaussian splatting, integrating RF sensing for improved accur...
The paper presents FLoRG, a federated fine-tuning framework that utilizes low-rank Gram matrices and Procrustes alignment to enhance the ...
The paper presents DDiT, a novel approach for dynamic patch scheduling in diffusion transformers, enhancing efficiency in image and video...
The paper presents Xray-Visual, a novel vision model architecture designed for large-scale image and video understanding, utilizing exten...
This paper evaluates the reliability of Mamba, a state-space model, for medical imaging under various attack scenarios, highlighting vuln...
This article presents a hybrid federated learning model that combines SWIN Transformer and CNN for diagnosing lung diseases, particularly...
The paper presents a novel framework integrating formal verification with deep learning for improved image retrieval, addressing the limi...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime