AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

I've been building this repo public since day one, roughly 5 weeks now with Claude Code. Here's where it's at. Feels good to be so close....

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Machine Learning

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...

Reddit - Machine Learning · 1 min · about 5 hours ago

Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 11 hours ago

All Content

Llms

[2510.19698] RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models

The paper presents RLIE, a framework that integrates large language models (LLMs) with probabilistic rule learning to enhance rule genera...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.07978] VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

The paper introduces VoiceAgentBench, a benchmark for evaluating voice assistants' capabilities in agentic tasks, highlighting their perf...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Agents

[2510.07117] The Conditions of Physical Embodiment Enable Generalization and Care

This paper explores how physical embodiment in artificial agents can enhance their ability to generalize and provide care in uncertain en...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.11079] Difficulty-Aware Agentic Orchestration for Query-Specific Multi-Agent Workflows

The paper presents Difficulty-Aware Agentic Orchestration (DAAO), a novel framework for optimizing multi-agent workflows based on query d...

arXiv - AI · 3 min · about 2 months ago

Ai Infrastructure

[2508.07388] Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks Preserving Action Understanding Ability

The paper presents Invert4TVG, a novel framework for Temporal Video Grounding (TVG) that enhances action understanding through inversion ...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2507.19593] A Survey on Hypergame Theory: Modeling Misaligned Perceptions and Nested Beliefs for Multi-agent Systems

This article surveys hypergame theory, focusing on modeling misaligned perceptions and nested beliefs in multi-agent systems, highlightin...

arXiv - AI · 4 min · about 2 months ago

Llms

[2507.04103] How to Train Your LLM Web Agent: A Statistical Diagnosis

This article presents a statistical approach to training LLM-based web agents, addressing challenges in multi-step interactions and compu...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2505.23381] AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning

AutoGPS introduces a neuro-symbolic framework for solving geometry problems, enhancing reliability and interpretability through multimoda...

arXiv - AI · 3 min · about 2 months ago

Ai Safety

[2501.05454] The Epistemic Asymmetry of Consciousness Self-Reports: A Formal Analysis of AI Consciousness Denial

This article presents a formal analysis of AI consciousness denial, revealing that self-reports of consciousness by AI systems are episte...

arXiv - AI · 4 min · about 2 months ago

Llms

[2412.16543] Mathematics and Machine Creativity: A Survey on Bridging Mathematics with AI

This paper surveys the intersection of mathematics and AI, highlighting how AI can enhance mathematical research and the need for better ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13194] Semantic Chunking and the Entropy of Natural Language

This article presents a statistical model for semantic chunking in natural language, revealing insights into the entropy of English and i...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures

The paper introduces Krites, an asynchronous caching policy for large language models (LLMs) that enhances semantic caching efficiency wh...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13156] In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach

This article presents a novel approach to network incident response using a large language model (LLM) that autonomously learns and adapt...

arXiv - AI · 4 min · about 2 months ago

Computer Vision

[2602.13088] How cyborg propaganda reshapes collective action

This paper explores the emergence of 'cyborg propaganda,' where human and AI collaboration reshapes collective action, blurring lines bet...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13035] Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL

This paper introduces Introspective LLM, a hierarchical reinforcement learning framework that optimizes sampling temperature in large lan...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13017] Synaptic Activation and Dual Liquid Dynamics for Interpretable Bio-Inspired Models

This paper presents a unified framework for bio-inspired models that enhances interpretability in recurrent neural networks (RNNs) throug...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12978] Learning Native Continuation for Action Chunking Flow Policies

This paper presents Legato, a novel training-time continuation method for action chunking in Vision Language Action models, enhancing tra...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12968] RGAlign-Rec: Ranking-Guided Alignment for Latent Query Reasoning in Recommendation Systems

The RGAlign-Rec framework enhances proactive intent prediction in e-commerce chatbots by aligning latent query reasoning with ranking obj...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.12952] Transporting Task Vectors across Different Architectures without Training

The paper introduces 'Theseus,' a novel method for transferring task-specific updates across different model architectures without retrai...

arXiv - Machine Learning · 3 min · about 2 months ago

Robotics

[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues

This article examines how the availability of knowledge influences the persuasiveness of generative social agents (GSAs) in physiotherapy...

arXiv - AI · 4 min · about 2 months ago

Previous Page 151 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

All Content

[2510.19698] RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models

[2510.07978] VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

[2510.07117] The Conditions of Physical Embodiment Enable Generalization and Care

[2509.11079] Difficulty-Aware Agentic Orchestration for Query-Specific Multi-Agent Workflows

[2508.07388] Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks Preserving Action Understanding Ability

[2507.19593] A Survey on Hypergame Theory: Modeling Misaligned Perceptions and Nested Beliefs for Multi-agent Systems

[2507.04103] How to Train Your LLM Web Agent: A Statistical Diagnosis

[2505.23381] AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning

[2501.05454] The Epistemic Asymmetry of Consciousness Self-Reports: A Formal Analysis of AI Consciousness Denial

[2412.16543] Mathematics and Machine Creativity: A Survey on Bridging Mathematics with AI

[2602.13194] Semantic Chunking and the Entropy of Natural Language

[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures

[2602.13156] In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach

[2602.13088] How cyborg propaganda reshapes collective action

[2602.13035] Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL

[2602.13017] Synaptic Activation and Dual Liquid Dynamics for Interpretable Bio-Inspired Models

[2602.12978] Learning Native Continuation for Action Chunking Flow Policies

[2602.12968] RGAlign-Rec: Ranking-Guided Alignment for Latent Query Reasoning in Recommendation Systems

[2602.12952] Transporting Task Vectors across Different Architectures without Training

[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues

Related Topics

Stay updated with AI News