[2508.00500] ProbGuard: Probabilistic Runtime Monitoring for LLM Agent

[2508.00500] ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety

arXiv - AI March 30, 2026 4 min read

About this article

Abstract page for arXiv paper 2508.00500: ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety

Computer Science > Artificial Intelligence arXiv:2508.00500 (cs) [Submitted on 1 Aug 2025 (v1), last revised 27 Mar 2026 (this version, v3)] Title:ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety Authors:Haoyu Wang, Christopher M. Poskitt, Jiali Wei, Jun Sun View a PDF of the paper titled ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety, by Haoyu Wang and Christopher M. Poskitt and Jiali Wei and Jun Sun View PDF HTML (experimental) Abstract:Large Language Model (LLM) agents increasingly operate across domains such as robotics, virtual assistants, and web automation. However, their stochastic decision-making introduces safety risks that are difficult to anticipate during execution. Existing runtime monitoring frameworks, such as AgentSpec, primarily rely on reactive safety rules that detect violations only when unsafe behavior is imminent or has already occurred, limiting their ability to handle long-horizon dependencies. We present ProbGuard, a proactive runtime monitoring framework for LLM agents that anticipates safety violations through probabilistic risk prediction. ProbGuard abstracts agent executions into symbolic states and learns a Discrete-Time Markov Chain (DTMC) from execution traces to model behavioral dynamics. At runtime, the monitor estimates the probability that future executions will reach unsafe states and triggers interventions when this risk exceeds a user-defined threshold. To improve robustness, ProbGuard incorporates s...

Originally published on March 30, 2026. Curated by AI News.

Llms

De-aged casts, ChatGPT-generated programs: How AI is changing Korean TV

Artificial intelligence is transforming every corner of industry, and television is no exception. Major networks in Korea have recently a...

AI Tools & Products · 4 min · about 1 hour ago

Llms

[2603.16629] MLLM-based Textual Explanations for Face Comparison

Abstract page for arXiv paper 2603.16629: MLLM-based Textual Explanations for Face Comparison

arXiv - AI · 4 min · about 2 hours ago

Llms

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Abstract page for arXiv paper 2603.15159: To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

arXiv - AI · 4 min · about 2 hours ago

Llms

[2602.08316] SWE Context Bench: A Benchmark for Context Learning in Coding

Abstract page for arXiv paper 2602.08316: SWE Context Bench: A Benchmark for Context Learning in Coding

arXiv - AI · 4 min · about 2 hours ago

[2508.00500] ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety

About this article

Related Articles

De-aged casts, ChatGPT-generated programs: How AI is changing Korean TV

[2603.16629] MLLM-based Textual Explanations for Face Comparison

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

[2602.08316] SWE Context Bench: A Benchmark for Context Learning in Coding

No comments

Stay updated with AI News