AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2602.00185] QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

Abstract page for arXiv paper 2602.00185: QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

arXiv - AI · 4 min · 13 minutes ago

Llms

[2506.22653] URSA: The Universal Research and Scientific Agent

Abstract page for arXiv paper 2506.22653: URSA: The Universal Research and Scientific Agent

arXiv - AI · 3 min · 13 minutes ago

Ai Agents

[2505.00472] UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces

Abstract page for arXiv paper 2505.00472: UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces

arXiv - AI · 3 min · 13 minutes ago

All Content

Llms

[2510.06200] StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars

The paper introduces StarEmbed, a benchmark for evaluating time series foundation models on astronomical observations of variable stars, ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.04694] Multilingual Routing in Mixture-of-Experts

This paper explores multilingual routing in Mixture-of-Experts (MoE) architectures, revealing how these models handle multilingual data a...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.05139] Adaptive Exploration for Latent-State Bandits

The paper presents adaptive exploration strategies for latent-state bandits, addressing challenges in reward estimation and action select...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2509.05249] COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization

COGITAO introduces a novel framework for studying compositionality and generalization in visual reasoning, offering extensive task genera...

arXiv - AI · 4 min · about 2 months ago

Llms

[2508.08177] MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

The paper introduces MedReasoner, a framework that utilizes reinforcement learning for precise medical reasoning and pixel-level groundin...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2601.11616] Mixture-of-Experts as Soft Clustering: A Dual Jacobian-PCA Spectral Geometry Perspective

This paper explores Mixture-of-Experts (MoE) architectures through a geometric lens, analyzing their impact on function representation an...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2508.01067] Expressive Power of Graph Transformers via Logic

This paper explores the expressive power of graph transformers, comparing their capabilities under different logical frameworks, particul...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency

The paper presents FreqPolicy, a novel flow-based visuomotor policy that enhances efficiency in robotic manipulation by imposing frequenc...

arXiv - AI · 4 min · about 2 months ago

Ai Startups

[2511.22581] High entropy leads to symmetry equivariant policies in Dec-POMDPs

This paper explores how high entropy regularization in Dec-POMDPs leads to symmetry equivariant policies, ensuring convergence to a consi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2505.03795] Modeling Human Behavior in a Strategic Network Game with Complex Group Dynamics

This article explores modeling human behavior in strategic network games, focusing on the Junior High Game (JHG) and comparing various be...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2511.03710] Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards

This article presents a novel approach to reducing variance in reinforcement learning through shrinkage baselines, enhancing training sta...

arXiv - Machine Learning · 3 min · about 2 months ago

Robotics

[2504.08603] FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment

The paper presents FindAnything, a framework for open-vocabulary and object-centric mapping that enhances robot exploration in unknown en...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2510.24318] Transformers can do Bayesian Clustering

The paper presents Cluster-PFN, a Transformer-based model for unsupervised Bayesian clustering, demonstrating improved accuracy and speed...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2510.19753] Transformers Provably Learn Algorithmic Solutions for Graph Connectivity, But Only with the Right Data

The paper explores how Transformers can learn algorithmic solutions for graph connectivity, demonstrating that success depends on the tra...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...

arXiv - AI · 4 min · about 2 months ago

Generative Ai

[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation

This survey reviews advancements in spatiotemporal consistency in video generation, addressing challenges and methodologies in creating c...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2412.10999] Cocoa: Co-Planning and Co-Execution with AI Agents

The paper presents Cocoa, a system designed to enhance human-agent collaboration in AI tasks by allowing flexible co-planning and co-exec...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2405.05523] Prompt When the Animal is: Temporal Animal Behavior Grounding with Positional Recovery Training

This paper introduces a novel Positional Recovery Training (Port) framework for improving temporal grounding in animal behavior analysis,...

arXiv - AI · 3 min · about 2 months ago

Llms

[2401.04536] Evaluating Language Model Agency through Negotiations

This paper introduces a novel method for evaluating language model agency through negotiation games, addressing limitations of existing b...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 110 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

[2602.00185] QUASAR: A Universal Autonomous System for Atomistic Simulation and a Benchmark of Its Capabilities

[2506.22653] URSA: The Universal Research and Scientific Agent

[2505.00472] UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces

All Content

[2510.06200] StarEmbed: Benchmarking Time Series Foundation Models on Astronomical Observations of Variable Stars

[2510.04694] Multilingual Routing in Mixture-of-Experts

[2602.05139] Adaptive Exploration for Latent-State Bandits

[2509.05249] COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization

[2508.08177] MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

[2601.11616] Mixture-of-Experts as Soft Clustering: A Dual Jacobian-PCA Spectral Geometry Perspective

[2508.01067] Expressive Power of Graph Transformers via Logic

[2506.08822] FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency

[2511.22581] High entropy leads to symmetry equivariant policies in Dec-POMDPs

[2505.03795] Modeling Human Behavior in a Strategic Network Game with Complex Group Dynamics

[2511.03710] Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards

[2504.08603] FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment

[2510.24318] Transformers can do Bayesian Clustering

[2510.19753] Transformers Provably Learn Algorithmic Solutions for Graph Connectivity, But Only with the Right Data

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

[2502.17863] A Survey: Spatiotemporal Consistency in Video Generation

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

[2412.10999] Cocoa: Co-Planning and Co-Execution with AI Agents

[2405.05523] Prompt When the Animal is: Temporal Animal Behavior Grounding with Positional Recovery Training

[2401.04536] Evaluating Language Model Agency through Negotiations

Related Topics

Stay updated with AI News