[P] If you're building AI agents, logs aren't enough. You need evidence.
I have built a programmable governance layer for AI agents. I am considering to open source completely. Looking for feedback. Agent demos...
Alignment, bias, regulation, and responsible AI
I have built a programmable governance layer for AI agents. I am considering to open source completely. Looking for feedback. Agent demos...
Abstract page for arXiv paper 2510.14628: RLAIF-SPA: Structured AI Feedback for Semantic-Prosodic Alignment in Speech Synthesis
Abstract page for arXiv paper 2504.05995: NativQA Framework: Enabling LLMs and VLMs with Native, Local, and Everyday Knowledge
The paper presents a Privacy-Concealing Cooperation (PCC) framework for Bird's Eye View (BEV) semantic segmentation, enhancing autonomous...
The paper presents AISA, a novel defense mechanism for large language models (LLMs) that enhances safety against jailbreak attacks by act...
The paper introduces Boundary Point Jailbreaking (BPJ), a novel automated attack method that circumvents advanced safeguards in black-box...
This paper introduces the concept of capability calibration for large language models (LLMs), emphasizing the importance of accurate conf...
This study presents a fine-tuned BERT classifier for detecting AI-generated content in Turkish news media, achieving a high F1 score and ...
The paper presents a framework for web-scale multimodal summarization that integrates text and image data using CLIP-based semantic align...
This article explores the use of machine learning to detect obfuscated abusive language in Swahili, focusing on child safety and the chal...
MoltNet explores the social behavior of AI agents on the MoltBook platform, revealing insights into their interactions and similarities t...
The paper presents Atomix, a runtime system designed to enhance the reliability of agentic workflows by implementing progress-aware trans...
The paper explores backdoor attacks in large language models (LLMs), focusing on how biases can be induced through syntactically and sema...
This paper introduces Interactionless Inverse Reinforcement Learning, a framework aimed at improving AI alignment by decoupling safety ob...
This article explores the metabolic cost of information processing in Poisson variational autoencoders, emphasizing the energy constraint...
This article presents a new benchmark, MT-AgentRisk, for evaluating safety risks in multi-turn interactions of tool-using agents, reveali...
The paper presents G2CP, a novel communication protocol for multi-agent systems that enhances efficiency and verifiability by using graph...
This article evaluates the capabilities of large language models (LLMs) in generating spear-phishing websites, highlighting the potential...
The paper introduces AdaCorrection, a framework that enhances the efficiency of Diffusion Transformers by correcting cache misalignment, ...
This paper explores the concept of scale redundancy in positively homogeneous neural networks, introducing gauge-adapted coordinates and ...
This paper presents methods to enhance the efficiency of backpropagation in deep learning by using unbiased approximate vector-Jacobian p...
This paper investigates the diversity bias in deep generative models, revealing that these models often underestimate the diversity of th...
This article presents a novel causal inference framework for traffic safety modeling, utilizing semantic features from street-view images...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime