[P] If you're building AI agents, logs aren't enough. You need evidence.
I have built a programmable governance layer for AI agents. I am considering to open source completely. Looking for feedback. Agent demos...
Alignment, bias, regulation, and responsible AI
I have built a programmable governance layer for AI agents. I am considering to open source completely. Looking for feedback. Agent demos...
Abstract page for arXiv paper 2510.14628: RLAIF-SPA: Structured AI Feedback for Semantic-Prosodic Alignment in Speech Synthesis
Abstract page for arXiv paper 2504.05995: NativQA Framework: Enabling LLMs and VLMs with Native, Local, and Everyday Knowledge
This article discusses the design and deployment of a GenAI-powered training system for 9-1-1 call-takers, highlighting the challenges fa...
The paper introduces Reverse N-Wise Output-Oriented Testing, a novel approach for testing AI/ML and quantum computing systems, addressing...
This paper presents a framework for explainable failure prediction in neural networks used in radio access networks, enhancing model tran...
This paper presents the Quest Graph framework for analyzing agentic systems' capabilities, establishing a computational hierarchy and eff...
This article presents a safety-constrained reinforcement learning framework aimed at enhancing the reliability of wireless autonomy, part...
This paper discusses the need for explicit bias consideration in evaluating Large Language Models (LLMs) used in finance, identifying fiv...
This study investigates the training-induced bias towards LLM-generated content in dense retrieval systems, revealing how dataset and tra...
This paper presents two definitions of primary cause within a hybrid action-theoretic framework, addressing the complexities of causation...
This paper evaluates the effectiveness of malicious prompt classifiers under true distribution shifts, revealing significant performance ...
The paper introduces Concept Influence, a method to enhance training data attribution by leveraging interpretability, improving performan...
The paper presents StarWM, a novel world model for refining decision-making policies in StarCraft II using large language models, demonst...
The paper explores how advanced AI models exhibit complex reasoning in simulated nuclear crises, revealing insights into strategic decisi...
This paper presents a method to eliminate planner bias in goal recognition using multi-plan dataset generation, enhancing the evaluation ...
The paper presents GREAT-EER, a Graph Edge Attention Network designed to optimize emergency evacuation responses by solving the Bus Evacu...
This paper introduces Base Score Extraction Functions in gradual argumentation, enhancing decision-making and AI transparency by mapping ...
The paper presents Arbor, a framework designed to enhance the navigation of critical conversation flows in high-stakes environments like ...
This paper explores the distinction between deception and hallucination failures in large language models (LLMs), proposing a mechanism-o...
This paper explores knowledge conflicts in multimodal large language models (MLLMs) during long chain-of-thought reasoning, proposing a f...
This paper presents COOL-MC, a tool for verifying and explaining sepsis treatment policies using reinforcement learning, enhancing decisi...
The paper discusses the learnability and computability limits of machine learning, emphasizing the structured feedback of code generation...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime