AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

You can now give an AI agent its own email, phone number, wallet, computer, and voice. This is what the stack looks like

I’ve been tracking the companies building primitives specifically for agents rather than humans. The pattern is becoming obvious: every c...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

AI agents have been blindly guessing your UI this whole time. Here's the file that fixes it.

Every time you ask an AI coding agent to build UI, it invents everything from scratch. Colors. Fonts. Spacing. Button styles. All of it -...

Reddit - Artificial Intelligence · 1 min ·
Llms

OpenClaw security checklist: practical safeguards for AI agents

Here is one of the better quality guides on the ensuring safety when deploying OpenClaw: https://chatgptguide.ai/openclaw-security-checkl...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.19244] Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts
Machine Learning

[2602.19244] Robust Exploration in Directed Controller Synthesis via Reinforcement Learning with Soft Mixture-of-Experts

This paper presents a Soft Mixture-of-Experts framework for Directed Controller Synthesis, enhancing exploration policies in reinforcemen...

arXiv - Machine Learning · 3 min ·
[2602.19225] Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training
Llms

[2602.19225] Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training

The paper presents Proximity-Based Multi-Turn Optimization (ProxMO), a framework designed to improve credit assignment in LLM agent train...

arXiv - AI · 4 min ·
[2602.19223] Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment
Ai Agents

[2602.19223] Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

This paper presents a comprehensive benchmark for Multi-Agent Reinforcement Learning (MARL) applied to urban energy management using the ...

arXiv - Machine Learning · 4 min ·
[2602.18694] In-Context Planning with Latent Temporal Abstractions
Machine Learning

[2602.18694] In-Context Planning with Latent Temporal Abstractions

The paper presents I-TAP, a novel offline reinforcement learning framework that enhances planning in continuous control by utilizing late...

arXiv - AI · 4 min ·
[2602.19160] Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing
Llms

[2602.19160] Reasoning Capabilities of Large Language Models. Lessons Learned from General Game Playing

This paper evaluates the reasoning capabilities of Large Language Models (LLMs) through General Game Playing tasks, revealing performance...

arXiv - AI · 4 min ·
[2602.18679] Transformers for dynamical systems learn transfer operators in-context
Llms

[2602.18679] Transformers for dynamical systems learn transfer operators in-context

This article explores how transformers can learn transfer operators for dynamical systems through in-context learning, enabling zero-shot...

arXiv - Machine Learning · 3 min ·
[2602.19158] DoAtlas-1: A Causal Compilation Paradigm for Clinical AI
Llms

[2602.19158] DoAtlas-1: A Causal Compilation Paradigm for Clinical AI

The paper presents DoAtlas-1, a novel causal compilation paradigm for clinical AI that transforms medical evidence into executable code, ...

arXiv - AI · 3 min ·
[2602.19109] Post-Routing Arithmetic in Llama-3: Last-Token Result Writing and Rotation-Structured Digit Directions
Llms

[2602.19109] Post-Routing Arithmetic in Llama-3: Last-Token Result Writing and Rotation-Structured Digit Directions

The paper examines three-digit addition in Meta-Llama-3-8B, focusing on how arithmetic results are determined post-routing, emphasizing t...

arXiv - AI · 3 min ·
[2602.19071] Defining Explainable AI for Requirements Analysis
Machine Learning

[2602.19071] Defining Explainable AI for Requirements Analysis

This paper defines the requirements for Explainable AI (XAI) in the context of requirements analysis, focusing on the dimensions of Sourc...

arXiv - AI · 3 min ·
[2602.18649] Global Low-Rank, Local Full-Rank: The Holographic Encoding of Learned Algorithms
Machine Learning

[2602.18649] Global Low-Rank, Local Full-Rank: The Holographic Encoding of Learned Algorithms

This paper explores the holographic encoding principle in neural networks, demonstrating that learned algorithms exhibit global low-rank ...

arXiv - AI · 4 min ·
[2602.19065] Agentic Problem Frames: A Systematic Approach to Engineering Reliable Domain Agents
Llms

[2602.19065] Agentic Problem Frames: A Systematic Approach to Engineering Reliable Domain Agents

The paper introduces Agentic Problem Frames (APF), a framework for developing reliable domain agents by focusing on structured interactio...

arXiv - AI · 4 min ·
[2602.18639] Learning Invariant Visual Representations for Planning with Joint-Embedding Predictive World Models
Machine Learning

[2602.18639] Learning Invariant Visual Representations for Planning with Joint-Embedding Predictive World Models

This paper presents a novel approach to improving the robustness of latent predictive world models in machine learning by addressing the ...

arXiv - Machine Learning · 4 min ·
[2602.19006] Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks
Llms

[2602.19006] Evaluating Large Language Models on Quantum Mechanics: A Comparative Study Across Diverse Models and Tasks

This article evaluates 15 large language models on quantum mechanics problem-solving across diverse tasks, revealing performance stratifi...

arXiv - AI · 4 min ·
[2602.18637] Online decoding of rat self-paced locomotion speed from EEG using recurrent neural networks
Machine Learning

[2602.18637] Online decoding of rat self-paced locomotion speed from EEG using recurrent neural networks

This study explores the online decoding of self-paced locomotion speed in rats using non-invasive EEG and recurrent neural networks, achi...

arXiv - Machine Learning · 4 min ·
[2602.19000] MagicAgent: Towards Generalized Agent Planning
Llms

[2602.19000] MagicAgent: Towards Generalized Agent Planning

The paper presents MagicAgent, a series of foundation models aimed at improving generalized agent planning in AI, addressing challenges i...

arXiv - AI · 4 min ·
[2602.18628] Non-Interfering Weight Fields: Treating Model Parameters as a Continuously Extensible Function
Llms

[2602.18628] Non-Interfering Weight Fields: Treating Model Parameters as a Continuously Extensible Function

The paper introduces Non-Interfering Weight Fields (NIWF), a novel framework that allows neural networks to extend capabilities without f...

arXiv - AI · 4 min ·
[2602.18998] Benchmark Test-Time Scaling of General LLM Agents
Llms

[2602.18998] Benchmark Test-Time Scaling of General LLM Agents

This paper introduces General AgentBench, a benchmark for evaluating general LLM agents across various domains, revealing performance cha...

arXiv - AI · 3 min ·
[2602.18600] MapTab: Can MLLMs Master Constrained Route Planning?
Llms

[2602.18600] MapTab: Can MLLMs Master Constrained Route Planning?

The paper introduces MapTab, a benchmark for evaluating Multimodal Large Language Models (MLLMs) on constrained route planning tasks, hig...

arXiv - Machine Learning · 3 min ·
[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing
Robotics

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

InfEngine is an innovative autonomous engine designed to enhance infrared radiation computing by automating workflows, achieving a 92.7% ...

arXiv - AI · 3 min ·
[2602.18584] GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry
Machine Learning

[2602.18584] GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry

The paper presents GIST, a method for targeted data selection in instruction tuning, improving efficiency by aligning training gradients ...

arXiv - AI · 4 min ·
Previous Page 82 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime