[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Alignment, bias, regulation, and responsible AI
Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Abstract page for arXiv paper 2510.18196: Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge
Abstract page for arXiv paper 2509.23435: AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models
This paper explores how physical embodiment in artificial agents can enhance their ability to generalize and provide care in uncertain en...
The paper introduces Batch-CAM, a training framework for convolutional deep learning models that enhances interpretability by aligning mo...
This article surveys hypergame theory, focusing on modeling misaligned perceptions and nested beliefs in multi-agent systems, highlightin...
This article presents a formal analysis of AI consciousness denial, revealing that self-reports of consciousness by AI systems are episte...
This article presents a novel approach to network incident response using a large language model (LLM) that autonomously learns and adapt...
The paper presents SCOPE, a framework for selective pairwise evaluation using large language models (LLMs) that improves judgment accurac...
This paper explores the emergence of 'cyborg propaganda,' where human and AI collaboration reshapes collective action, blurring lines bet...
The paper explores EXCODER, a method for explainable classification of discrete time series representations, enhancing interpretability w...
The paper introduces Diverging Flows, a method for detecting extrapolations in conditional generation models, enhancing safety in applica...
The paper presents Curriculum-DPO++, an advanced method for text-to-image generation that optimizes preference learning through a dual cu...
This study investigates the reliability of AI in detecting cognitive impairment among multilingual English speakers in the UK, revealing ...
This paper presents a strategic framework for governments to decide between buying or building large language models (LLMs) for public se...
This paper presents a unified framework for bio-inspired models that enhances interpretability in recurrent neural networks (RNNs) throug...
This paper presents a method for detecting object tracking failures using sequential hypothesis testing, enhancing safety in computer vis...
The paper introduces the Variation Calibration Error (VCE) metric, extending confidence calibration methods in machine learning to assess...
The RGAlign-Rec framework enhances proactive intent prediction in e-commerce chatbots by aligning latent query reasoning with ranking obj...
This article presents a novel ultrasound-guided method for real-time 3D visualization of spinal motion to assess spinal instability, aimi...
This paper evaluates the robustness of object detection models used in autonomous vehicles under adverse weather conditions, proposing a ...
The paper presents RADAR, a novel evaluation framework for Multi-modal Large Language Models (MLLMs) that addresses performance bottlenec...
The article explores design requirements for generative social robots in higher education, emphasizing the need for knowledge-based frame...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime