AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

[2602.00185] QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities
Llms

[2602.00185] QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

Abstract page for arXiv paper 2602.00185: QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

arXiv - AI · 4 min ·
[2506.22653] URSA: The Universal Research and Scientific Agent
Llms

[2506.22653] URSA: The Universal Research and Scientific Agent

Abstract page for arXiv paper 2506.22653: URSA: The Universal Research and Scientific Agent

arXiv - AI · 3 min ·
[2505.00472] UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces
Ai Agents

[2505.00472] UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces

Abstract page for arXiv paper 2505.00472: UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces

arXiv - AI · 3 min ·

All Content

[2510.06200] StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars
Llms

[2510.06200] StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars

The paper introduces StarEmbed, a benchmark for evaluating time series foundation models on astronomical observations of variable stars, ...

arXiv - AI · 4 min ·
[2510.04694] Multilingual Routing in Mixture-of-Experts
Llms

[2510.04694] Multilingual Routing in Mixture-of-Experts

This paper explores multilingual routing in Mixture-of-Experts (MoE) architectures, revealing how these models handle multilingual data a...

arXiv - Machine Learning · 4 min ·
[2602.05139] Adaptive Exploration for Latent-State Bandits
Machine Learning

[2602.05139] Adaptive Exploration for Latent-State Bandits

The paper presents adaptive exploration strategies for latent-state bandits, addressing challenges in reward estimation and action select...

arXiv - Machine Learning · 3 min ·
[2509.05249] COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization
Machine Learning

[2509.05249] COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization

COGITAO introduces a novel framework for studying compositionality and generalization in visual reasoning, offering extensive task genera...

arXiv - AI · 4 min ·
[2508.08177] MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision
Llms

[2508.08177] MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

The paper introduces MedReasoner, a framework that utilizes reinforcement learning for precise medical reasoning and pixel-level groundin...

arXiv - AI · 4 min ·
[2601.11616] Mixture-of-Experts as Soft Clustering: A Dual Jacobian-PCA Spectral Geometry Perspective
Machine Learning

[2601.11616] Mixture-of-Experts as Soft Clustering: A Dual Jacobian-PCA Spectral Geometry Perspective

This paper explores Mixture-of-Experts (MoE) architectures through a geometric lens, analyzing their impact on function representation an...

arXiv - Machine Learning · 4 min ·
[2508.01067] Expressive Power of Graph Transformers via Logic
Llms

[2508.01067] Expressive Power of Graph Transformers via Logic

This paper explores the expressive power of graph transformers, comparing their capabilities under different logical frameworks, particul...

arXiv - AI · 3 min ·
[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency
Machine Learning

[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency

The paper presents FreqPolicy, a novel flow-based visuomotor policy that enhances efficiency in robotic manipulation by imposing frequenc...

arXiv - AI · 4 min ·
[2511.22581] High entropy leads to symmetry equivariant policies in Dec-POMDPs
Ai Startups

[2511.22581] High entropy leads to symmetry equivariant policies in Dec-POMDPs

This paper explores how high entropy regularization in Dec-POMDPs leads to symmetry equivariant policies, ensuring convergence to a consi...

arXiv - Machine Learning · 4 min ·
[2505.03795] Modeling Human Behavior in a Strategic Network Game with Complex Group Dynamics
Machine Learning

[2505.03795] Modeling Human Behavior in a Strategic Network Game with Complex Group Dynamics

This article explores modeling human behavior in strategic network games, focusing on the Junior High Game (JHG) and comparing various be...

arXiv - AI · 4 min ·
[2511.03710] Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards
Machine Learning

[2511.03710] Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards

This article presents a novel approach to reducing variance in reinforcement learning through shrinkage baselines, enhancing training sta...

arXiv - Machine Learning · 3 min ·
[2504.08603] FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment
Robotics

[2504.08603] FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment

The paper presents FindAnything, a framework for open-vocabulary and object-centric mapping that enhances robot exploration in unknown en...

arXiv - AI · 4 min ·
[2510.24318] Transformers can do Bayesian Clustering
Machine Learning

[2510.24318] Transformers can do Bayesian Clustering

The paper presents Cluster-PFN, a Transformer-based model for unsupervised Bayesian clustering, demonstrating improved accuracy and speed...

arXiv - Machine Learning · 3 min ·
[2510.19753] Transformers Provably Learn Algorithmic Solutions for Graph Connectivity, But Only with the Right Data
Machine Learning

[2510.19753] Transformers Provably Learn Algorithmic Solutions for Graph Connectivity, But Only with the Right Data

The paper explores how Transformers can learn algorithmic solutions for graph connectivity, demonstrating that success depends on the tra...

arXiv - Machine Learning · 3 min ·
[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes
Llms

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...

arXiv - AI · 4 min ·
[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation
Generative Ai

[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation

This survey reviews advancements in spatiotemporal consistency in video generation, addressing challenges and methodologies in creating c...

arXiv - AI · 4 min ·
[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers
Machine Learning

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...

arXiv - Machine Learning · 3 min ·
[2412.10999] Cocoa: Co-Planning and Co-Execution with AI Agents
Nlp

[2412.10999] Cocoa: Co-Planning and Co-Execution with AI Agents

The paper presents Cocoa, a system designed to enhance human-agent collaboration in AI tasks by allowing flexible co-planning and co-exec...

arXiv - AI · 4 min ·
[2405.05523] Prompt When the Animal is: Temporal Animal Behavior Grounding with Positional Recovery Training
Machine Learning

[2405.05523] Prompt When the Animal is: Temporal Animal Behavior Grounding with Positional Recovery Training

This paper introduces a novel Positional Recovery Training (Port) framework for improving temporal grounding in animal behavior analysis,...

arXiv - AI · 3 min ·
[2401.04536] Evaluating Language Model Agency through Negotiations
Llms

[2401.04536] Evaluating Language Model Agency through Negotiations

This paper introduces a novel method for evaluating language model agency through negotiation games, addressing limitations of existing b...

arXiv - Machine Learning · 3 min ·
Previous Page 110 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime