[2604.03356] Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing
Abstract page for arXiv paper 2604.03356: Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing
Alignment, bias, regulation, and responsible AI
Abstract page for arXiv paper 2604.03356: Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing
Abstract page for arXiv paper 2602.01528: Making Bias Non-Predictive: Training Robust LLM Reasoning via Reinforcement Learning
Abstract page for arXiv paper 2510.27584: Image Hashing via Cross-View Code Alignment in the Age of Foundation Models
The paper presents Safe-SDL, a framework for ensuring safety in AI-driven Self-Driving Laboratories, addressing the critical 'Syntax-to-S...
This paper introduces a scenario approach for post-design certification of user-specified properties, enhancing reliability without addit...
This paper explores the effects of latent space regularization on the quality of generative test inputs for deep learning classifiers, de...
The paper introduces the Agent Communication Protocol (ACP), a framework for secure and efficient agent-to-agent orchestration, addressin...
The paper introduces CircuChain, a benchmark for evaluating large language models (LLMs) in electrical circuit analysis, focusing on thei...
GaiaFlow presents a novel framework for carbon-efficient search, employing semantic-guided diffusion tuning to balance retrieval accuracy...
This article discusses the use of large language models (LLMs) as synthetic participants in social science experiments, evaluating their ...
The paper presents GMAIL, a novel framework for aligning generated images with real images in machine learning, enhancing performance in ...
The paper presents SCENE, a novel estimator for over-the-air federated distillation that enhances aggregation without requiring pilot sig...
The article presents CARE Drive, a framework for evaluating the reason-responsiveness of vision language models in automated driving, add...
The paper presents a novel watermarking scheme for language models that ensures unforgeability and recoverability, enhancing content prov...
The paper presents RUVA, a novel architecture for personalized on-device graph reasoning that enhances user control over AI-generated con...
This paper discusses the limitations of generative AI agents that equate understanding with resolving explicit queries, highlighting the ...
This paper presents a structured capabilities model to improve the construct validity of large language model (LLM) evaluations, addressi...
This paper presents a benchmark suite for decision-making under imperfect recall in game theory, introducing regret matching algorithms t...
This article presents a novel method for detecting backdoors in LoRA adapters by analyzing their weight matrices, achieving high accuracy...
The paper presents a novel adaptive abstention system for Large Language Models (LLMs) that balances safety and utility by dynamically ad...
The paper presents WAC, a web agent that enhances task execution by integrating model collaboration, consequence simulation, and action r...
This paper presents the Layer Smoothing Attack (LSA), a novel backdoor attack exploiting layer-specific vulnerabilities in federated lear...
The paper presents X-MAP, a framework for analyzing and profiling misclassifications in spam and phishing detection, enhancing interpreta...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime