AI Agents
Autonomous agents, tool use, and agentic systems
Top This Week
[P] If you're building AI agents, logs aren't enough. You need evidence.
I have built a programmable governance layer for AI agents. I am considering to open source completely. Looking for feedback. Agent demos...
[2602.00185] QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities
Abstract page for arXiv paper 2602.00185: QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities
All Content
[2602.16671] SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation
The SPARC framework enhances automated C unit test generation by bridging the gap between program intent and syntactic constraints, impro...
[2602.16634] Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models
The paper presents Enhanced Diffusion Sampling, a novel method for efficient rare event sampling and free energy calculation in molecular...
[2602.16612] Causal and Compositional Abstraction
The paper presents a formal framework for causal and compositional abstraction, emphasizing its significance in AI and scientific practic...
[2602.16555] Learning Distributed Equilibria in Linear-Quadratic Stochastic Differential Games: An $α$-Potential Approach
This paper explores independent policy-gradient learning in N-player linear-quadratic stochastic differential games, establishing global ...
[2602.16585] DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows
DataJoint 2.0 introduces a relational workflow model designed to enhance collaboration in scientific data pipelines, ensuring data integr...
[2602.16476] Learning Preference from Observed Rankings
This paper presents a framework for learning individual preferences from partial ranking data, enhancing recommendation systems by addres...
[2602.16554] MerLean: An Agentic Framework for Autoformalization in Quantum Computation
MerLean introduces an automated framework for autoformalization in quantum computation, converting mathematical statements into verified ...
[2602.16520] Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents
The paper presents RLM-JB, a framework utilizing Recursive Language Models for detecting jailbreak prompts in large language models, enha...
[2602.16375] Variable-Length Semantic IDs for Recommender Systems
This paper introduces variable-length semantic identifiers for recommender systems, addressing challenges in item representation and impr...
[2602.16485] Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
The paper introduces 'Team-of-Thoughts', a novel multi-agent system architecture that enhances performance by leveraging heterogeneous ag...
[2602.16305] BAT: Better Audio Transformer Guided by Convex Gated Probing
The paper introduces the Better Audio Transformer (BAT), which utilizes a novel Convex Gated Probing method to enhance audio self-supervi...
[2602.16444] RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation
RoboGene introduces a framework for automating the generation of diverse, physically plausible robotic manipulation tasks, addressing the...
[2602.16183] Multi-Agent Combinatorial-Multi-Armed-Bandit framework for the Submodular Welfare Problem under Bandit Feedback
This paper presents a multi-agent combinatorial multi-armed bandit framework for the Submodular Welfare Problem, achieving improved regre...
[2602.16372] AI-Driven Structure Refinement of X-ray Diffraction
This paper presents WPEM, an AI-driven workflow for refining X-ray diffraction data, enhancing the stability and accuracy of peak intensi...
[2602.16356] Articulated 3D Scene Graphs for Open-World Mobile Manipulation
This paper presents MoMa-SG, a framework for creating semantic-kinematic 3D scene graphs to enhance mobile manipulation of articulated ob...
[2602.16161] Emotion Collider: Dual Hyperbolic Mirror Manifolds for Sentiment Recovery via Anti Emotion Reflection
The paper presents Emotion Collider (EC-Net), a novel framework for multimodal emotion and sentiment modeling using hyperbolic geometry a...
[2602.16334] Spatial Audio Question Answering and Reasoning on Dynamic Source Movements
This article presents a study on Spatial Audio Question Answering (Spatial AQA) focusing on dynamic sound source movements, introducing i...
[2602.16315] The Diversity Paradox revisited: Systemic Effects of Feedback Loops in Recommender Systems
This paper revisits the diversity paradox in recommender systems, exploring how feedback loops influence user behavior and consumption pa...
[2602.16307] Generative AI Usage of University Students: Navigating Between Education and Business
This study explores the use of generative AI by university students balancing education and work, highlighting its benefits and challenges.
[2602.16131] Empirical Cumulative Distribution Function Clustering for LLM-based Agent System Analysis
This article presents a novel evaluation framework for LLM-based agents using empirical cumulative distribution functions (ECDFs) to asse...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime