[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Alignment, bias, regulation, and responsible AI
Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Abstract page for arXiv paper 2510.18196: Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge
Abstract page for arXiv paper 2509.23435: AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models
The paper presents Chimera, a framework that integrates neuro-symbolic attention mechanisms into programmable dataplanes, enhancing traff...
The paper presents Amortized Reasoning Tree Search (ARTS), a novel approach to enhance reasoning in Large Language Models by decoupling p...
RAT-Bench introduces a comprehensive benchmark for evaluating text anonymization tools based on their effectiveness in preventing re-iden...
The paper introduces SQuTR, a new benchmark for evaluating the robustness of spoken query retrieval systems under various acoustic noise ...
MedXIAOHE is a medical vision-language foundation model that enhances medical understanding and reasoning in clinical applications, achie...
The paper introduces IndicFairFace, a balanced dataset aimed at addressing geographical bias in Vision-Language Models (VLMs) by represen...
The paper introduces TensorCommitments, a novel proof-of-inference scheme designed to enhance the security of large language model (LLM) ...
The paper presents Power Interpretable Causal ODE Networks (PICODE), a novel model for explainable anomaly detection and root cause analy...
The paper presents Favia, a forensic agent designed to identify and analyze vulnerability-fixing commits in software repositories, improv...
This article explores how attachment styles and age influence the intimacy users develop with AI companions, challenging the notion that ...
This paper examines the relationship between correctness in mathematical proofs and their epistemic value, arguing that formal correctnes...
This paper presents a novel recovery-based shielding framework for safe reinforcement learning (RL) using Gaussian process dynamics model...
This paper discusses the evolution of large language models (LLMs) into modular agents equipped with skills, emphasizing architecture, ac...
This paper explores how soft contamination in training data affects the evaluation of large language models (LLMs) on benchmarks, reveali...
This paper explores the mechanisms behind the implicit bias in gradient-based training of deep networks, focusing on the scaling and alig...
The paper presents Policy4OOD, a knowledge-guided world model designed to simulate policy interventions against the opioid overdose crisi...
The paper presents a novel scoring formula, Peak + Accumulation, for detecting multi-turn LLM attack patterns, addressing limitations in ...
This article examines how demographic-based persona assignments in large language models (LLMs) can impact agent performance, revealing v...
This paper discusses a hybrid obstacle avoidance system for unmanned aircraft that combines optimal control with fuzzy logic to improve d...
This paper introduces Constrained Assumption-Based Argumentation (CABA), extending traditional Assumption-Based Argumentation frameworks ...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime