AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

Considering NeurIPS submission [D]

Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic sys...

Reddit - Machine Learning · 1 min ·
Ai Agents

Agent frameworks waste ~350,000+ tokens per session resending static files. 95% reduction benchmarked.

Measured the actual token waste on a local Qwen 3.5 122B setup. The numbers are unreal. Found a compile-time approach that cuts query con...

Reddit - Artificial Intelligence · 1 min ·
OpenClaw gives users yet another reason to be freaked out about security - Ars Technica
Ai Agents

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

The viral AI agentic tool let attackers silently gain admin unauthenticated access.

Ars Technica - AI · 5 min ·

All Content

[2509.01924] Non-Linear Model-Based Sequential Decision-Making in Agriculture
Machine Learning

[2509.01924] Non-Linear Model-Based Sequential Decision-Making in Agriculture

This paper presents a novel approach to sequential decision-making in agriculture using nonlinear model-based algorithms, enhancing resou...

arXiv - Machine Learning · 4 min ·
[2510.11103] A Primer on SO(3) Action Representations in Deep Reinforcement Learning
Robotics

[2510.11103] A Primer on SO(3) Action Representations in Deep Reinforcement Learning

This paper explores SO(3) action representations in deep reinforcement learning, focusing on their implications for robotic control tasks...

arXiv - AI · 4 min ·
[2510.09469] Towards Information-Optimized Multi-Agent Path Finding: A Hybrid Framework with Reduced Inter-Agent Information Sharing
Robotics

[2510.09469] Towards Information-Optimized Multi-Agent Path Finding: A Hybrid Framework with Reduced Inter-Agent Information Sharing

This paper presents a hybrid framework for Multi-Agent Path Finding (MAPF) that minimizes inter-agent information sharing while maintaini...

arXiv - AI · 4 min ·
[2510.05598] AgentDR: Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents
Llms

[2510.05598] AgentDR: Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents

The paper presents AgentDR, a novel framework that enhances recommendation systems by leveraging LLMs to understand implicit item relatio...

arXiv - AI · 4 min ·
[2510.02240] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
Llms

[2510.02240] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

The paper presents RewardMap, a multi-stage reinforcement learning framework aimed at improving fine-grained visual reasoning in multimod...

arXiv - AI · 4 min ·
[2505.22842] Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation
Llms

[2505.22842] Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation

The paper introduces the Bayesian Attention Mechanism (BAM), a novel framework for positional encoding in transformer models that enhance...

arXiv - Machine Learning · 3 min ·
[2509.24526] CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models
Machine Learning

[2509.24526] CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models

The paper introduces Consistency Mid-Training (CMT), a novel method for enhancing the efficiency of training flow map models, achieving s...

arXiv - Machine Learning · 4 min ·
[2509.24243] SafeFlowMatcher: Safe and Fast Planning using Flow Matching with Control Barrier Functions
Robotics

[2509.24243] SafeFlowMatcher: Safe and Fast Planning using Flow Matching with Control Barrier Functions

The paper presents SafeFlowMatcher, a new planning framework that integrates flow matching with control barrier functions to ensure safe ...

arXiv - AI · 4 min ·
[2509.23040] Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents
Llms

[2509.23040] Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents

The paper presents ReMemR1, a novel approach for enhancing long-context reasoning in large language models by integrating revisitable mem...

arXiv - AI · 4 min ·
[2503.07853] Hier-COS: Making Deep Features Hierarchy-aware via Composition of Orthogonal Subspaces
Machine Learning

[2503.07853] Hier-COS: Making Deep Features Hierarchy-aware via Composition of Orthogonal Subspaces

The paper presents Hier-COS, a new framework for improving hierarchical classification in deep learning by addressing limitations in exis...

arXiv - Machine Learning · 4 min ·
[2509.21628] Comparing and Integrating Different Notions of Representational Correspondence in Neural Systems
Machine Learning

[2509.21628] Comparing and Integrating Different Notions of Representational Correspondence in Neural Systems

This article explores the integration of various representational similarity metrics in neural systems, assessing their effectiveness in ...

arXiv - AI · 4 min ·
[2508.11915] CORE: Measuring Multi-Agent LLM Interaction Quality under Game-Theoretic Pressures
Llms

[2508.11915] CORE: Measuring Multi-Agent LLM Interaction Quality under Game-Theoretic Pressures

The paper introduces CORE, a metric for evaluating language quality in multi-agent LLM interactions under game-theoretic conditions, reve...

arXiv - Machine Learning · 4 min ·
[2508.07087] SQL-Exchange: Transforming SQL Queries Across Domains
Ai Infrastructure

[2508.07087] SQL-Exchange: Transforming SQL Queries Across Domains

SQL-Exchange introduces a framework for transforming SQL queries across different database schemas while maintaining structural integrity...

arXiv - AI · 3 min ·
[2402.10758] Stochastic Localization via Iterative Posterior Sampling
Machine Learning

[2402.10758] Stochastic Localization via Iterative Posterior Sampling

This article presents a novel methodology called Stochastic Localization via Iterative Posterior Sampling (SLIPS) for sampling from unnor...

arXiv - Machine Learning · 4 min ·
[2508.00017] Generative Logic: A New Computer Architecture for Deterministic Reasoning and Knowledge Generation
Machine Learning

[2508.00017] Generative Logic: A New Computer Architecture for Deterministic Reasoning and Knowledge Generation

The paper introduces Generative Logic (GL), a new computer architecture designed for deterministic reasoning and knowledge generation, ut...

arXiv - AI · 4 min ·
[2507.20174] LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks
Llms

[2507.20174] LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks

The paper introduces LRR-Bench, a benchmark for evaluating Vision-Language Models (VLMs) on spatial understanding tasks, revealing signif...

arXiv - AI · 4 min ·
[2507.16874] Budget Allocation Policies for Real-Time Multi-Agent Path Finding
Robotics

[2507.16874] Budget Allocation Policies for Real-Time Multi-Agent Path Finding

This article presents a study on budget allocation policies for real-time multi-agent path finding (RT-MAPF), focusing on improving effic...

arXiv - AI · 4 min ·
[2602.08885] Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression
Machine Learning

[2602.08885] Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression

This article presents a novel approach to symbolic regression through the introduction of SimpliPy, a simplification engine that signific...

arXiv - AI · 4 min ·
[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights
Llms

[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

This study evaluates the performance of generalist Vision Language Models (VLMs) compared to specialist medical VLMs, revealing that gene...

arXiv - AI · 3 min ·
[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization
Llms

[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

The paper presents EBPO, a novel framework that enhances Group Relative Policy Optimization (GRPO) by employing Empirical Bayes shrinkage...

arXiv - AI · 4 min ·
Previous Page 67 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime