[2602.09678] Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Image recognition, detection, and visual AI
Abstract page for arXiv paper 2602.09678: Administrative Law's Fourth Settlement: AI and the Capability-Accountability Trap
Abstract page for arXiv paper 2601.13622: CARPE: Context-Aware Image Representation Prioritization via Ensemble for Large Vision-Language...
Abstract page for arXiv paper 2603.26551: Beyond MACs: Hardware Efficient Architecture Design for Vision Backbones
The paper presents ADAMAB, a novel framework for efficient embedding calibration in few-shot pattern recognition, leveraging adaptive dat...
The paper presents Fore-Mamba3D, a novel approach for 3D object detection that enhances foreground encoding while addressing limitations ...
The paper presents Vid2Sid, a novel video-driven system identification pipeline that enhances the calibration of robot simulators by anal...
The paper 'MentalBlackboard' evaluates spatial visualization capabilities of Vision-Language Models (VLMs) through mathematical transform...
The paper presents a novel method for controlled face manipulation to augment data for facial expression analysis, addressing label scarc...
This paper presents a novel down-sampling strategy called Stair Pooling for U-Net architectures, aimed at enhancing precision in biomedic...
FinSight-Net introduces a physics-aware framework for underwater fish detection, improving accuracy while reducing computational overhead...
The paper presents CaReFlow, a novel approach for multimodal fusion that addresses modality gaps using cyclic adaptive rectified flow, en...
Ani3DHuman presents a novel framework for photorealistic 3D human animation, combining kinematics-based methods with video diffusion prio...
The paper presents UP-Fuse, an innovative framework for LiDAR-camera fusion that enhances 3D panoptic segmentation by addressing sensor d...
The paper presents MultiDiffSense, a diffusion-based model for generating visuo-tactile images conditioned on object shape and contact po...
The paper presents a novel method for non-invasive grading of prostate cancer using micro-ultrasound, leveraging knowledge distillation f...
The article presents RetinaVision, a deep learning framework for accurate classification of retinal diseases using optical coherence tomo...
The paper presents US-JEPA, a novel self-supervised framework for medical ultrasound imaging that enhances representation learning by pre...
The paper presents IPv2, an enhanced image purification strategy for improving lung CT denoising at ultra-low doses, addressing limitatio...
The paper presents TIACam, a novel framework for camera-robust zero-watermarking that utilizes text-anchored invariant feature learning w...
The paper presents LAVIDA, a novel zero-shot video anomaly detection framework that utilizes a Multimodal Large Language Model to enhance...
The paper presents WiCompass, a framework for improving mmWave human pose estimation by focusing on data coverage rather than brute-force...
The paper presents a novel unified pushing policy that utilizes visual prompts to enhance the efficiency and versatility of robotic pushi...
FUSAR-GPT is a novel visual language model designed for interpreting SAR imagery, enhancing performance through spatiotemporal feature em...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime