AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Agents

You can now give an AI agent its own email, phone number, wallet, computer, and voice. This is what the stack looks like

I’ve been tracking the companies building primitives specifically for agents rather than humans. The pattern is becoming obvious: every c...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Ai Agents

AI agents have been blindly guessing your UI this whole time. Here's the file that fixes it.

Every time you ask an AI coding agent to build UI, it invents everything from scratch. Colors. Fonts. Spacing. Button styles. All of it -...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

Llms

OpenClaw security checklist: practical safeguards for AI agents

Here is one of the better quality guides on the ensuring safety when deploying OpenClaw: https://chatgptguide.ai/openclaw-security-checkl...

Reddit - Artificial Intelligence · 1 min · about 10 hours ago

All Content

Machine Learning

[2602.19244] Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts

This paper presents a Soft Mixture-of-Experts framework for Directed Controller Synthesis, enhancing exploration policies in reinforcemen...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19225] Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

The paper presents Proximity-Based Multi-Turn Optimization (ProxMO), a framework designed to improve credit assignment in LLM agent train...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2602.19223] Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

This paper presents a comprehensive benchmark for Multi-Agent Reinforcement Learning (MARL) applied to urban energy management using the ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.18694] In-Context Planning with Latent Temporal Abstractions

The paper presents I-TAP, a novel offline reinforcement learning framework that enhances planning in continuous control by utilizing late...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.19160] Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

This paper evaluates the reasoning capabilities of Large Language Models (LLMs) through General Game Playing tasks, revealing performance...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18679] Transformers for dynamical systems learn transfer operators in-context

This article explores how transformers can learn transfer operators for dynamical systems through in-context learning, enabling zero-shot...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19158] DoAtlas-1: A Causal Compilation Paradigm for Clinical AI

The paper presents DoAtlas-1, a novel causal compilation paradigm for clinical AI that transforms medical evidence into executable code, ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19109] Post-Routing Arithmetic in Llama-3: Last-Token Result Writing and Rotation-Structured Digit Directions

The paper examines three-digit addition in Meta-Llama-3-8B, focusing on how arithmetic results are determined post-routing, emphasizing t...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19071] Defining Explainable AI for Requirements Analysis

This paper defines the requirements for Explainable AI (XAI) in the context of requirements analysis, focusing on the dimensions of Sourc...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18649] Global Low-Rank, Local Full-Rank: The Holographic Encoding of Learned Algorithms

This paper explores the holographic encoding principle in neural networks, demonstrating that learned algorithms exhibit global low-rank ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.19065] Agentic Problem Frames: A Systematic Approach to Engineering Reliable Domain Agents

The paper introduces Agentic Problem Frames (APF), a framework for developing reliable domain agents by focusing on structured interactio...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18639] Learning Invariant Visual Representations for Planning with Joint-Embedding Predictive World Models

This paper presents a novel approach to improving the robustness of latent predictive world models in machine learning by addressing the ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19006] Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

This article evaluates 15 large language models on quantum mechanics problem-solving across diverse tasks, revealing performance stratifi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18637] Online decoding of rat self-paced locomotion speed from EEG using recurrent neural networks

This study explores the online decoding of self-paced locomotion speed in rats using non-invasive EEG and recurrent neural networks, achi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19000] MagicAgent: Towards Generalized Agent Planning

The paper presents MagicAgent, a series of foundation models aimed at improving generalized agent planning in AI, addressing challenges i...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18628] Non-Interfering Weight Fields: Treating Model Parameters as a Continuously Extensible Function

The paper introduces Non-Interfering Weight Fields (NIWF), a novel framework that allows neural networks to extend capabilities without f...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18998] Benchmark Test-Time Scaling of General LLM Agents

This paper introduces General AgentBench, a benchmark for evaluating general LLM agents across various domains, revealing performance cha...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18600] MapTab: Can MLLMs Master Constrained Route Planning?

The paper introduces MapTab, a benchmark for evaluating Multimodal Large Language Models (MLLMs) on constrained route planning tasks, hig...

arXiv - Machine Learning · 3 min · about 1 month ago

Robotics

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

InfEngine is an innovative autonomous engine designed to enhance infrared radiation computing by automating workflows, achieving a 92.7% ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18584] GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry

The paper presents GIST, a method for targeted data selection in instruction tuning, improving efficiency by aligning training gradients ...

arXiv - AI · 4 min · about 1 month ago

Previous Page 82 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

You can now give an AI agent its own email, phone number, wallet, computer, and voice. This is what the stack looks like

AI agents have been blindly guessing your UI this whole time. Here's the file that fixes it.

OpenClaw security checklist: practical safeguards for AI agents

All Content

[2602.19244] Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts

[2602.19225] Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

[2602.19223] Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

[2602.18694] In-Context Planning with Latent Temporal Abstractions

[2602.19160] Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

[2602.18679] Transformers for dynamical systems learn transfer operators in-context

[2602.19158] DoAtlas-1: A Causal Compilation Paradigm for Clinical AI

[2602.19109] Post-Routing Arithmetic in Llama-3: Last-Token Result Writing and Rotation-Structured Digit Directions

[2602.19071] Defining Explainable AI for Requirements Analysis

[2602.18649] Global Low-Rank, Local Full-Rank: The Holographic Encoding of Learned Algorithms

[2602.19065] Agentic Problem Frames: A Systematic Approach to Engineering Reliable Domain Agents

[2602.18639] Learning Invariant Visual Representations for Planning with Joint-Embedding Predictive World Models

[2602.19006] Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

[2602.18637] Online decoding of rat self-paced locomotion speed from EEG using recurrent neural networks

[2602.19000] MagicAgent: Towards Generalized Agent Planning

[2602.18628] Non-Interfering Weight Fields: Treating Model Parameters as a Continuously Extensible Function

[2602.18998] Benchmark Test-Time Scaling of General LLM Agents

[2602.18600] MapTab: Can MLLMs Master Constrained Route Planning?

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

[2602.18584] GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry

Related Topics

Stay updated with AI News