China drafts law regulating 'digital humans' and banning addictive virtual services for children
A Reuters report outlines China's proposed regulations on the rapidly expanding sector of digital humans and AI avatars. Under the new dr...
Alignment, bias, regulation, and responsible AI
A Reuters report outlines China's proposed regulations on the rapidly expanding sector of digital humans and AI avatars. Under the new dr...
Abstract page for arXiv paper 2512.00408: Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Abstract page for arXiv paper 2510.15148: XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models
The paper introduces ReliabilityRAG, a framework designed to enhance the robustness of Retrieval-Augmented Generation (RAG) systems again...
This paper presents a novel algorithm for instrumental variable regression that ensures differential privacy while maintaining statistica...
The paper introduces AECBench, a benchmark for evaluating large language models (LLMs) in the Architecture, Engineering, and Construction...
The paper presents the Epsilon-Neighborhood Decision-Boundary Governed Estimation (EDGE) algorithm for efficiently estimating decision bo...
This paper presents a safety steering framework to enhance the robustness of large language models (LLMs) against multi-turn jailbreaking...
This article presents an audit of a Dutch public sector risk profiling algorithm, utilizing an unsupervised bias detection tool to identi...
This article explores the integration of Large Language Models (LLMs) with Reinforcement Learning (RL) to enhance decision-making in auto...
This paper presents a novel unsupervised prognostics framework for deep-space habitats, addressing multiple unlabeled failure modes throu...
This article presents a framework for enhancing simulation environments in Autonomous Cyber Operations (ACO) by implementing new actions ...
This paper introduces a learning-based approach for generating uncertainty-aware high-level spatial concepts in 3D Scene Graphs, enhancin...
This article presents a novel approach to simulating cyberattacks by integrating Security Chaos Engineering (SCE) into Breach Attack Simu...
This article presents a novel method called potential-energy gating for robust state estimation in bistable stochastic systems, enhancing...
The paper introduces activation-based data attribution to identify and mitigate undesirable behaviors in production language models post-...
This paper investigates the factors influencing feature importance in machine learning model explanations, emphasizing that feature salie...
This survey explores the role of foundation models in enhancing scenario generation and analysis for autonomous driving, addressing limit...
This article presents a novel framework called Geometric Pessimism for Offline Reinforcement Learning (RL), enhancing performance in robo...
This paper explores the non-identifiability of steering vectors in large language models (LLMs), revealing that these vectors cannot be u...
The paper presents SWIRL, a framework for self-improving world modeling in machine learning, focusing on latent actions to enhance predic...
The paper presents HALT, a method for finetuning large language models (LLMs) to enhance reliability by generating responses only when co...
This paper presents a novel stochastic gradient method for combinatorial optimization that requires only a single query, enhancing effici...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime