AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Boston's CIO wants the public — and other city governments — to use his open-source agentic AI tools
Ai Agents

Boston's CIO wants the public — and other city governments — to use his open-source agentic AI tools

AI Tools & Products · 7 min ·
Machine Learning

[P] If you're building AI agents, logs aren't enough. You need evidence.

I have built a programmable governance layer for AI agents. I am considering to open source completely. Looking for feedback. Agent demos...

Reddit - Machine Learning · 1 min ·
[2602.00185] QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities
Llms

[2602.00185] QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

Abstract page for arXiv paper 2602.00185: QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

arXiv - AI · 4 min ·

All Content

[2602.16671] SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation
Llms

[2602.16671] SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation

The SPARC framework enhances automated C unit test generation by bridging the gap between program intent and syntactic constraints, impro...

arXiv - AI · 4 min ·
[2602.16634] Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models
Machine Learning

[2602.16634] Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models

The paper presents Enhanced Diffusion Sampling, a novel method for efficient rare event sampling and free energy calculation in molecular...

arXiv - Machine Learning · 4 min ·
[2602.16612] Causal and Compositional Abstraction
Machine Learning

[2602.16612] Causal and Compositional Abstraction

The paper presents a formal framework for causal and compositional abstraction, emphasizing its significance in AI and scientific practic...

arXiv - AI · 4 min ·
[2602.16555] Learning Distributed Equilibria in Linear-Quadratic Stochastic Differential Games: An $α$-Potential Approach
Machine Learning

[2602.16555] Learning Distributed Equilibria in Linear-Quadratic Stochastic Differential Games: An $α$-Potential Approach

This paper explores independent policy-gradient learning in N-player linear-quadratic stochastic differential games, establishing global ...

arXiv - Machine Learning · 3 min ·
[2602.16585] DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows
Machine Learning

[2602.16585] DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows

DataJoint 2.0 introduces a relational workflow model designed to enhance collaboration in scientific data pipelines, ensuring data integr...

arXiv - AI · 3 min ·
[2602.16476] Learning Preference from Observed Rankings
Machine Learning

[2602.16476] Learning Preference from Observed Rankings

This paper presents a framework for learning individual preferences from partial ranking data, enhancing recommendation systems by addres...

arXiv - Machine Learning · 4 min ·
[2602.16554] MerLean: An Agentic Framework for Autoformalization in Quantum Computation
Ai Agents

[2602.16554] MerLean: An Agentic Framework for Autoformalization in Quantum Computation

MerLean introduces an automated framework for autoformalization in quantum computation, converting mathematical statements into verified ...

arXiv - AI · 3 min ·
[2602.16520] Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents
Llms

[2602.16520] Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents

The paper presents RLM-JB, a framework utilizing Recursive Language Models for detecting jailbreak prompts in large language models, enha...

arXiv - AI · 3 min ·
[2602.16375] Variable-Length Semantic IDs for Recommender Systems
Llms

[2602.16375] Variable-Length Semantic IDs for Recommender Systems

This paper introduces variable-length semantic identifiers for recommender systems, addressing challenges in item representation and impr...

arXiv - Machine Learning · 4 min ·
[2602.16485] Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Machine Learning

[2602.16485] Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

The paper introduces 'Team-of-Thoughts', a novel multi-agent system architecture that enhances performance by leveraging heterogeneous ag...

arXiv - AI · 3 min ·
[2602.16305] BAT: Better Audio Transformer Guided by Convex Gated Probing
Machine Learning

[2602.16305] BAT: Better Audio Transformer Guided by Convex Gated Probing

The paper introduces the Better Audio Transformer (BAT), which utilizes a novel Convex Gated Probing method to enhance audio self-supervi...

arXiv - Machine Learning · 3 min ·
[2602.16444] RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation
Machine Learning

[2602.16444] RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation

RoboGene introduces a framework for automating the generation of diverse, physically plausible robotic manipulation tasks, addressing the...

arXiv - AI · 4 min ·
[2602.16183] Multi-Agent Combinatorial-Multi-Armed-Bandit framework for the Submodular Welfare Problem under Bandit Feedback
Ai Agents

[2602.16183] Multi-Agent Combinatorial-Multi-Armed-Bandit framework for the Submodular Welfare Problem under Bandit Feedback

This paper presents a multi-agent combinatorial multi-armed bandit framework for the Submodular Welfare Problem, achieving improved regre...

arXiv - Machine Learning · 3 min ·
[2602.16372] AI-Driven Structure Refinement of X-ray Diffraction
Nlp

[2602.16372] AI-Driven Structure Refinement of X-ray Diffraction

This paper presents WPEM, an AI-driven workflow for refining X-ray diffraction data, enhancing the stability and accuracy of peak intensi...

arXiv - AI · 4 min ·
[2602.16356] Articulated 3D Scene Graphs for Open-World Mobile Manipulation
Robotics

[2602.16356] Articulated 3D Scene Graphs for Open-World Mobile Manipulation

This paper presents MoMa-SG, a framework for creating semantic-kinematic 3D scene graphs to enhance mobile manipulation of articulated ob...

arXiv - AI · 4 min ·
[2602.16161] Emotion Collider: Dual Hyperbolic Mirror Manifolds for Sentiment Recovery via Anti Emotion Reflection
Machine Learning

[2602.16161] Emotion Collider: Dual Hyperbolic Mirror Manifolds for Sentiment Recovery via Anti Emotion Reflection

The paper presents Emotion Collider (EC-Net), a novel framework for multimodal emotion and sentiment modeling using hyperbolic geometry a...

arXiv - Machine Learning · 3 min ·
[2602.16334] Spatial Audio Question Answering and Reasoning on Dynamic Source Movements
Machine Learning

[2602.16334] Spatial Audio Question Answering and Reasoning on Dynamic Source Movements

This article presents a study on Spatial Audio Question Answering (Spatial AQA) focusing on dynamic sound source movements, introducing i...

arXiv - AI · 4 min ·
[2602.16315] The Diversity Paradox revisited: Systemic Effects of Feedback Loops in Recommender Systems
Machine Learning

[2602.16315] The Diversity Paradox revisited: Systemic Effects of Feedback Loops in Recommender Systems

This paper revisits the diversity paradox in recommender systems, exploring how feedback loops influence user behavior and consumption pa...

arXiv - AI · 3 min ·
[2602.16307] Generative AI Usage of University Students: Navigating Between Education and Business
Generative Ai

[2602.16307] Generative AI Usage of University Students: Navigating Between Education and Business

This study explores the use of generative AI by university students balancing education and work, highlighting its benefits and challenges.

arXiv - AI · 3 min ·
[2602.16131] Empirical Cumulative Distribution Function Clustering for LLM-based Agent System Analysis
Llms

[2602.16131] Empirical Cumulative Distribution Function Clustering for LLM-based Agent System Analysis

This article presents a novel evaluation framework for LLM-based agents using empirical cumulative distribution functions (ECDFs) to asse...

arXiv - Machine Learning · 3 min ·
Previous Page 112 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime