AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Llms

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

I've been building this repo public since day one, roughly 5 weeks now with Claude Code. Here's where it's at. Feels good to be so close....

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...

Reddit - Machine Learning · 1 min ·
Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min ·

All Content

[2510.19698] RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models
Llms

[2510.19698] RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models

The paper presents RLIE, a framework that integrates large language models (LLMs) with probabilistic rule learning to enhance rule genera...

arXiv - AI · 4 min ·
[2510.07978] VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
Llms

[2510.07978] VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

The paper introduces VoiceAgentBench, a benchmark for evaluating voice assistants' capabilities in agentic tasks, highlighting their perf...

arXiv - Machine Learning · 4 min ·
[2510.07117] The Conditions of Physical Embodiment Enable Generalization and Care
Ai Agents

[2510.07117] The Conditions of Physical Embodiment Enable Generalization and Care

This paper explores how physical embodiment in artificial agents can enhance their ability to generalize and provide care in uncertain en...

arXiv - Machine Learning · 4 min ·
[2509.11079] Difficulty-Aware Agentic Orchestration for Query-Specific Multi-Agent Workflows
Llms

[2509.11079] Difficulty-Aware Agentic Orchestration for Query-Specific Multi-Agent Workflows

The paper presents Difficulty-Aware Agentic Orchestration (DAAO), a novel framework for optimizing multi-agent workflows based on query d...

arXiv - AI · 3 min ·
[2508.07388] Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks Preserving Action Understanding Ability
Ai Infrastructure

[2508.07388] Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks Preserving Action Understanding Ability

The paper presents Invert4TVG, a novel framework for Temporal Video Grounding (TVG) that enhances action understanding through inversion ...

arXiv - AI · 4 min ·
[2507.19593] A Survey on Hypergame Theory: Modeling Misaligned Perceptions and Nested Beliefs for Multi-agent Systems
Machine Learning

[2507.19593] A Survey on Hypergame Theory: Modeling Misaligned Perceptions and Nested Beliefs for Multi-agent Systems

This article surveys hypergame theory, focusing on modeling misaligned perceptions and nested beliefs in multi-agent systems, highlightin...

arXiv - AI · 4 min ·
[2507.04103] How to Train Your LLM Web Agent: A Statistical Diagnosis
Llms

[2507.04103] How to Train Your LLM Web Agent: A Statistical Diagnosis

This article presents a statistical approach to training LLM-based web agents, addressing challenges in multi-step interactions and compu...

arXiv - Machine Learning · 4 min ·
[2505.23381] AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning
Machine Learning

[2505.23381] AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning

AutoGPS introduces a neuro-symbolic framework for solving geometry problems, enhancing reliability and interpretability through multimoda...

arXiv - AI · 3 min ·
[2501.05454] The Epistemic Asymmetry of Consciousness Self-Reports: A Formal Analysis of AI Consciousness Denial
Ai Safety

[2501.05454] The Epistemic Asymmetry of Consciousness Self-Reports: A Formal Analysis of AI Consciousness Denial

This article presents a formal analysis of AI consciousness denial, revealing that self-reports of consciousness by AI systems are episte...

arXiv - AI · 4 min ·
[2412.16543] Mathematics and Machine Creativity: A Survey on Bridging Mathematics with AI
Llms

[2412.16543] Mathematics and Machine Creativity: A Survey on Bridging Mathematics with AI

This paper surveys the intersection of mathematics and AI, highlighting how AI can enhance mathematical research and the need for better ...

arXiv - AI · 4 min ·
[2602.13194] Semantic Chunking and the Entropy of Natural Language
Llms

[2602.13194] Semantic Chunking and the Entropy of Natural Language

This article presents a statistical model for semantic chunking in natural language, revealing insights into the entropy of English and i...

arXiv - AI · 4 min ·
[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures
Llms

[2602.13165] Asynchronous Verified Semantic Caching for Tiered LLM Architectures

The paper introduces Krites, an asynchronous caching policy for large language models (LLMs) that enhances semantic caching efficiency wh...

arXiv - AI · 4 min ·
[2602.13156] In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach
Llms

[2602.13156] In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach

This article presents a novel approach to network incident response using a large language model (LLM) that autonomously learns and adapt...

arXiv - AI · 4 min ·
[2602.13088] How cyborg propaganda reshapes collective action
Computer Vision

[2602.13088] How cyborg propaganda reshapes collective action

This paper explores the emergence of 'cyborg propaganda,' where human and AI collaboration reshapes collective action, blurring lines bet...

arXiv - AI · 4 min ·
[2602.13035] Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL
Llms

[2602.13035] Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL

This paper introduces Introspective LLM, a hierarchical reinforcement learning framework that optimizes sampling temperature in large lan...

arXiv - Machine Learning · 3 min ·
[2602.13017] Synaptic Activation and Dual Liquid Dynamics for Interpretable Bio-Inspired Models
Machine Learning

[2602.13017] Synaptic Activation and Dual Liquid Dynamics for Interpretable Bio-Inspired Models

This paper presents a unified framework for bio-inspired models that enhances interpretability in recurrent neural networks (RNNs) throug...

arXiv - Machine Learning · 3 min ·
[2602.12978] Learning Native Continuation for Action Chunking Flow Policies
Machine Learning

[2602.12978] Learning Native Continuation for Action Chunking Flow Policies

This paper presents Legato, a novel training-time continuation method for action chunking in Vision Language Action models, enhancing tra...

arXiv - AI · 3 min ·
[2602.12968] RGAlign-Rec: Ranking-Guided Alignment for Latent Query Reasoning in Recommendation Systems
Llms

[2602.12968] RGAlign-Rec: Ranking-Guided Alignment for Latent Query Reasoning in Recommendation Systems

The RGAlign-Rec framework enhances proactive intent prediction in e-commerce chatbots by aligning latent query reasoning with ranking obj...

arXiv - AI · 4 min ·
[2602.12952] Transporting Task Vectors across Different Architectures without Training
Machine Learning

[2602.12952] Transporting Task Vectors across Different Architectures without Training

The paper introduces 'Theseus,' a novel method for transferring task-specific updates across different model architectures without retrai...

arXiv - Machine Learning · 3 min ·
[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues
Robotics

[2602.12924] Never say never: Exploring the effects of available knowledge on agent persuasiveness in controlled physiotherapy motivation dialogues

This article examines how the availability of knowledge influences the persuasiveness of generative social agents (GSAs) in physiotherapy...

arXiv - AI · 4 min ·
Previous Page 151 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime