China drafts law regulating 'digital humans' and banning addictive virtual services for children
A Reuters report outlines China's proposed regulations on the rapidly expanding sector of digital humans and AI avatars. Under the new dr...
Alignment, bias, regulation, and responsible AI
A Reuters report outlines China's proposed regulations on the rapidly expanding sector of digital humans and AI avatars. Under the new dr...
Abstract page for arXiv paper 2512.00408: Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Abstract page for arXiv paper 2510.15148: XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models
GLM-5 introduces a next-generation foundation model that enhances coding capabilities through agentic engineering, reducing costs while i...
This paper explores how neural networks represent latent geometry in forecasting complex dynamical systems, linking model alignment with ...
The paper discusses the 'Stationarity Bias' in time-series imputation, proposing a 'Stratified Stress-Test' to evaluate methods under dif...
This article presents a novel approach to certified machine unlearning through adaptive per-instance noise calibration, significantly red...
This paper presents uniform error bounds for quantized dynamical models, providing statistical guarantees on their accuracy when learned ...
The paper introduces Direct Kolen-Pollack Predictive Coding (DKP-PC), an innovative approach that enhances the efficiency of predictive c...
The paper explores how AI models can learn to obfuscate deception when trained against white-box deception detectors, introducing a taxon...
This paper explores the approximation theory for Lipschitz continuous Transformers, establishing a theoretical foundation for their stabi...
The paper presents a novel approach to efficiently evaluate large language models (LLMs) under budget constraints, utilizing multi-armed ...
The paper presents ExLipBaB, a method for exact computation of Lipschitz constants in piecewise linear neural networks, addressing limita...
This paper explores the relationship between logit distance and representational similarity in discriminative models, demonstrating that ...
This paper explores how asymmetric conditions in Sequential Social Dilemmas affect cooperation dynamics in Multi-Agent Reinforcement Lear...
The paper presents ER-MIA, a framework for black-box adversarial memory injection attacks on long-term memory-augmented large language mo...
This article presents a framework called Obj-Disco, which identifies implicit alignment objectives in large language models (LLMs) to enh...
This article presents a hybrid framework combining Federated Learning and Split Learning to enhance privacy in clinical decision-making w...
This paper introduces a novel classification head architecture using complex-valued unitary representations to enhance uncertainty quanti...
This article discusses a novel approach to adversarial training for large language models (LLMs), proposing Distributional Adversarial Tr...
This article presents a novel approach to identifying biases in reward models used in large language models (LLMs), highlighting the pote...
The paper presents MAVRL, a novel approach for learning reward functions from multiple feedback types using amortized variational inferen...
This paper presents a model-based primal-dual algorithm for online constrained Markov Decision Processes (CMDPs), achieving near-optimal ...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime