AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Agents

Considering NeurIPS submission [D]

Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic sys...

Reddit - Machine Learning · 1 min · about 5 hours ago

Ai Agents

Agent frameworks waste ~350,000+ tokens per session resending static files. 95% reduction benchmarked.

Measured the actual token waste on a local Qwen 3.5 122B setup. The numbers are unreal. Found a compile-time approach that cuts query con...

Reddit - Artificial Intelligence · 1 min · about 10 hours ago

Ai Agents

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

The viral AI agentic tool let attackers silently gain admin unauthenticated access.

Ars Technica - AI · 5 min · about 13 hours ago

All Content

Machine Learning

[2509.01924] Non-Linear Model-Based Sequential Decision-Making in Agriculture

This paper presents a novel approach to sequential decision-making in agriculture using nonlinear model-based algorithms, enhancing resou...

arXiv - Machine Learning · 4 min · about 1 month ago

Robotics

[2510.11103] A Primer on SO(3) Action Representations in Deep Reinforcement Learning

This paper explores SO(3) action representations in deep reinforcement learning, focusing on their implications for robotic control tasks...

arXiv - AI · 4 min · about 1 month ago

Robotics

[2510.09469] Towards Information-Optimized Multi-Agent Path Finding: A Hybrid Framework with Reduced Inter-Agent Information Sharing

This paper presents a hybrid framework for Multi-Agent Path Finding (MAPF) that minimizes inter-agent information sharing while maintaini...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.05598] AgentDR: Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents

The paper presents AgentDR, a novel framework that enhances recommendation systems by leveraging LLMs to understand implicit item relatio...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.02240] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

The paper presents RewardMap, a multi-stage reinforcement learning framework aimed at improving fine-grained visual reasoning in multimod...

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.22842] Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation

The paper introduces the Bayesian Attention Mechanism (BAM), a novel framework for positional encoding in transformer models that enhance...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2509.24526] CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models

The paper introduces Consistency Mid-Training (CMT), a novel method for enhancing the efficiency of training flow map models, achieving s...

arXiv - Machine Learning · 4 min · about 1 month ago

Robotics

[2509.24243] SafeFlowMatcher: Safe and Fast Planning using Flow Matching with Control Barrier Functions

The paper presents SafeFlowMatcher, a new planning framework that integrates flow matching with control barrier functions to ensure safe ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.23040] Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents

The paper presents ReMemR1, a novel approach for enhancing long-context reasoning in large language models by integrating revisitable mem...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2503.07853] Hier-COS: Making Deep Features Hierarchy-aware via Composition of Orthogonal Subspaces

The paper presents Hier-COS, a new framework for improving hierarchical classification in deep learning by addressing limitations in exis...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.21628] Comparing and Integrating Different Notions of Representational Correspondence in Neural Systems

This article explores the integration of various representational similarity metrics in neural systems, assessing their effectiveness in ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2508.11915] CORE: Measuring Multi-Agent LLM Interaction Quality under Game-Theoretic Pressures

The paper introduces CORE, a metric for evaluating language quality in multi-agent LLM interactions under game-theoretic conditions, reve...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Infrastructure

[2508.07087] SQL-Exchange: Transforming SQL Queries Across Domains

SQL-Exchange introduces a framework for transforming SQL queries across different database schemas while maintaining structural integrity...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2402.10758] Stochastic Localization via Iterative Posterior Sampling

This article presents a novel methodology called Stochastic Localization via Iterative Posterior Sampling (SLIPS) for sampling from unnor...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2508.00017] Generative Logic: A New Computer Architecture for Deterministic Reasoning and Knowledge Generation

The paper introduces Generative Logic (GL), a new computer architecture designed for deterministic reasoning and knowledge generation, ut...

arXiv - AI · 4 min · about 1 month ago

Llms

[2507.20174] LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks

The paper introduces LRR-Bench, a benchmark for evaluating Vision-Language Models (VLMs) on spatial understanding tasks, revealing signif...

arXiv - AI · 4 min · about 1 month ago

Robotics

[2507.16874] Budget Allocation Policies for Real-Time Multi-Agent Path Finding

This article presents a study on budget allocation policies for real-time multi-agent path finding (RT-MAPF), focusing on improving effic...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.08885] Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression

This article presents a novel approach to symbolic regression through the introduction of SimpliPy, a simplification engine that signific...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

This study evaluates the performance of generalist Vision Language Models (VLMs) compared to specialist medical VLMs, revealing that gene...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

The paper presents EBPO, a novel framework that enhances Group Relative Policy Optimization (GRPO) by employing Empirical Bayes shrinkage...

arXiv - AI · 4 min · about 1 month ago

Previous Page 67 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Considering NeurIPS submission [D]

Agent frameworks waste ~350,000+ tokens per session resending static files. 95% reduction benchmarked.

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

All Content

[2509.01924] Non-Linear Model-Based Sequential Decision-Making in Agriculture

[2510.11103] A Primer on SO(3) Action Representations in Deep Reinforcement Learning

[2510.09469] Towards Information-Optimized Multi-Agent Path Finding: A Hybrid Framework with Reduced Inter-Agent Information Sharing

[2510.05598] AgentDR: Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents

[2510.02240] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

[2505.22842] Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation

[2509.24526] CMT: Mid-Training for Efficient Learning of Consistency, Mean Flow, and Flow Map Models

[2509.24243] SafeFlowMatcher: Safe and Fast Planning using Flow Matching with Control Barrier Functions

[2509.23040] Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents

[2503.07853] Hier-COS: Making Deep Features Hierarchy-aware via Composition of Orthogonal Subspaces

[2509.21628] Comparing and Integrating Different Notions of Representational Correspondence in Neural Systems

[2508.11915] CORE: Measuring Multi-Agent LLM Interaction Quality under Game-Theoretic Pressures

[2508.07087] SQL-Exchange: Transforming SQL Queries Across Domains

[2402.10758] Stochastic Localization via Iterative Posterior Sampling

[2508.00017] Generative Logic: A New Computer Architecture for Deterministic Reasoning and Knowledge Generation

[2507.20174] LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks

[2507.16874] Budget Allocation Policies for Real-Time Multi-Agent Path Finding

[2602.08885] Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression

[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

Related Topics

Stay updated with AI News