[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Alignment, bias, regulation, and responsible AI
Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Abstract page for arXiv paper 2510.18196: Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge
Abstract page for arXiv paper 2509.23435: AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models
This paper presents an information-theoretic analysis of world models in optimal reward maximizers, quantifying the information conveyed ...
This article evaluates the robustness of large reasoning models against multi-turn adversarial attacks, revealing vulnerabilities and pro...
The article presents X-SYS, a reference architecture designed for interactive explanation systems in AI, addressing the challenges of dep...
The paper introduces SkillsBench, a benchmark assessing the effectiveness of agent skills across 86 tasks in 11 domains, revealing signif...
This paper introduces a diagnostic benchmark for evaluating the robustness of reasoning models on parameterized logical problems, specifi...
GT-HarmBench introduces a benchmark for evaluating AI safety risks in multi-agent environments, highlighting significant reliability gaps...
This paper presents a theoretical framework for adaptive utility-weighted benchmarking in AI, emphasizing the importance of stakeholder p...
The article discusses five critical issues surrounding AI at the AI Impact Summit, including job displacement, rogue AI, energy demands, ...
Anthropic has donated $20 million to Public First Action to promote AI education and policy, emphasizing the need for regulation amidst g...
The introduction of ads in AI chatbots raises privacy concerns as companies like OpenAI and Microsoft explore new revenue models amidst u...
David Greene, former NPR host, is suing Google, claiming the voice in its NotebookLM tool mimics his own. This raises concerns about AI v...
Oglethorpe students will engage in discussions on the ethics of artificial intelligence and its workplace implications, featuring expert ...
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime