Deterministic vs. probabilistic guardrails for agentic AI — our approach and an open-source tool [D]
We've been thinking hard about whether safety guardrails for AI agents should be LLM-based (probabilistic) or rule-based (deterministic)....
GPT, Claude, Gemini, and other LLMs
We've been thinking hard about whether safety guardrails for AI agents should be LLM-based (probabilistic) or rule-based (deterministic)....
A lot of AI startups exist partly because the foundation models haven't expanded into their category yet. As many jokingly acknowledge, t...
When ChatGPT or Perplexity answers a question, it runs RAG: retrieves top candidates from a crawled index, then scores them. The scoring ...
Abstract page for arXiv paper 2602.01701: Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query System...
Abstract page for arXiv paper 2602.01649: Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning
Abstract page for arXiv paper 2602.00428: When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent S...
Abstract page for arXiv paper 2601.22060: Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
Abstract page for arXiv paper 2601.21895: Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
Abstract page for arXiv paper 2602.08324: Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
Abstract page for arXiv paper 2602.05735: CSRv2: Unlocking Ultra-Sparse Embeddings
Abstract page for arXiv paper 2602.04369: Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis
Abstract page for arXiv paper 2602.02742: Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding
Abstract page for arXiv paper 2602.02555: Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Rein...
Abstract page for arXiv paper 2512.08937: When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being
Abstract page for arXiv paper 2601.20838: Reward Models Inherit Value Biases from Pretraining
Abstract page for arXiv paper 2601.20088: Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery
Abstract page for arXiv paper 2512.03794: AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
Abstract page for arXiv paper 2512.01822: InnoGym: Benchmarking the Innovation Potential of AI Agents
Abstract page for arXiv paper 2511.21740: A cross-species neural foundation model for end-to-end speech decoding
Abstract page for arXiv paper 2511.21722: German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Populatio...
Abstract page for arXiv paper 2601.18753: HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs
Abstract page for arXiv paper 2511.10985: When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets
Abstract page for arXiv paper 2601.04786: AgentOCR: Reimagining Agent History via Optical Self-Compression
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime