AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

Enabling agent-first process redesign | MIT Technology Review

Unlike static, rules-based systems, AI agents can learn, adapt, and optimize processes dynamically. As they interact with data, systems, ...

MIT Technology Review - AI · 4 min · about 2 hours ago

Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Ai Agents

The "Jarvis on day one" trap: why trying to build one AI agent that does everything costs you months

Something I've been thinking about after spending a few months actually trying to build my own AI agent: the biggest trap in this space i...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

All Content

Machine Learning

[2602.17098] Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization

This article presents a comparative study of Deep Reinforcement Learning (DRL) and Mean-Variance Optimization (MVO) for optimal portfolio...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.17071] AdvSynGNN: Structure-Adaptive Graph Neural Nets via Adversarial Synthesis and Self-Corrective Propagation

The paper presents AdvSynGNN, a novel architecture for graph neural networks that enhances resilience against structural noise and non-ho...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning

The paper introduces ALPS, a diagnostic challenge set designed to evaluate Arabic linguistic and pragmatic reasoning, highlighting the li...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.17037] Wink: Recovering from Misbehaviors in Coding Agents

The paper presents 'Wink', a system designed to recover coding agents from misbehaviors, enhancing their reliability in software developm...

arXiv - AI · 4 min · about 2 months ago

Data Science

[2602.17027] Transforming Behavioral Neuroscience Discovery with In-Context Learning and AI-Enhanced Tensor Methods

This article discusses the integration of AI and In-Context Learning to enhance behavioral neuroscience research, particularly in underst...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.17022] ReIn: Conversational Error Recovery with Reasoning Inception

The paper presents Reasoning Inception (ReIn), a method for improving conversational agents' error recovery without altering their parame...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.17003] Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History

The paper introduces Persona2Web, a benchmark for evaluating personalized web agents that utilize user history to resolve ambiguous queri...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.16997] Exploring LLMs for User Story Extraction from Mockups

This article explores the use of large language models (LLMs) for extracting user stories from high-fidelity mockups, enhancing requireme...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16967] Early-Warning Signals of Grokking via Loss-Landscape Geometry

The paper explores early-warning signals of 'grokking' in machine learning, focusing on the commutator defect as a precursor to generaliz...

arXiv - AI · 4 min · about 2 months ago

Ai Agents

[2602.16966] A Unified Framework for Locality in Scalable MARL

This paper presents a unified framework addressing locality in scalable Multi-Agent Reinforcement Learning (MARL), proposing a novel poli...

arXiv - AI · 4 min · about 2 months ago

Data Science

[2602.16959] Eigenmood Space: Uncertainty-Aware Spectral Graph Analysis of Psychological Patterns in Classical Persian Poetry

This article presents a framework for analyzing psychological patterns in Classical Persian poetry using uncertainty-aware spectral graph...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.16957] When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English

This study explores the challenges of cross-lingual euphemism transfer between Turkish and English, highlighting the limitations of seman...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16947] Beyond Message Passing: A Symbolic Alternative for Expressive and Interpretable Graph Learning

The paper presents SymGraph, a novel symbolic framework that enhances graph learning by overcoming limitations of traditional message-pas...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.16932] RankEvolve: Automating the Discovery of Retrieval Algorithms via LLM-Driven Evolution

The paper presents RankEvolve, a novel approach utilizing large language models (LLMs) to automate the discovery of retrieval algorithms,...

arXiv - AI · 3 min · about 2 months ago

Generative Ai

[2602.16930] Say It My Way: Exploring Control in Conversational Visual Question Answering with Blind Users

The paper explores how blind users can customize interactions with conversational visual question answering systems, highlighting the nee...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16928] Discovering Multiagent Learning Algorithms with Large Language Models

This paper explores the use of large language models to automatically discover new multiagent learning algorithms, enhancing the efficien...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16898] MALLVI: a multi agent framework for integrated generalized robotics manipulation

The paper presents MALLVI, a multi-agent framework for robotic manipulation that utilizes closed-loop feedback to enhance task planning a...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16873] AdaptOrch: Task-Adaptive Multi-Agent Orchestration in the Era of LLM Performance Convergence

The paper presents AdaptOrch, a framework for task-adaptive multi-agent orchestration that enhances performance by optimizing orchestrati...

arXiv - AI · 4 min · about 2 months ago

Robotics

[2602.16863] SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

SimToolReal presents a novel approach to zero-shot dexterous tool manipulation using an object-centric policy, enhancing robotic capabili...

arXiv - AI · 4 min · about 2 months ago

Ai Agents

[2602.16844] Overseeing Agents Without Constant Oversight: Challenges and Opportunities

This article explores the challenges and opportunities in overseeing AI agents without constant human oversight, focusing on user studies...

arXiv - AI · 3 min · about 2 months ago

Previous Page 103 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Enabling agent-first process redesign | MIT Technology Review

Stop Overcomplicating AI Workflows. This Is the Simple Framework

The "Jarvis on day one" trap: why trying to build one AI agent that does everything costs you months

All Content

[2602.17098] Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization

[2602.17071] AdvSynGNN: Structure-Adaptive Graph Neural Nets via Adversarial Synthesis and Self-Corrective Propagation

[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning

[2602.17037] Wink: Recovering from Misbehaviors in Coding Agents

[2602.17027] Transforming Behavioral Neuroscience Discovery with In-Context Learning and AI-Enhanced Tensor Methods

[2602.17022] ReIn: Conversational Error Recovery with Reasoning Inception

[2602.17003] Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History

[2602.16997] Exploring LLMs for User Story Extraction from Mockups

[2602.16967] Early-Warning Signals of Grokking via Loss-Landscape Geometry

[2602.16966] A Unified Framework for Locality in Scalable MARL

[2602.16959] Eigenmood Space: Uncertainty-Aware Spectral Graph Analysis of Psychological Patterns in Classical Persian Poetry

[2602.16957] When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English

[2602.16947] Beyond Message Passing: A Symbolic Alternative for Expressive and Interpretable Graph Learning

[2602.16932] RankEvolve: Automating the Discovery of Retrieval Algorithms via LLM-Driven Evolution

[2602.16930] Say It My Way: Exploring Control in Conversational Visual Question Answering with Blind Users

[2602.16928] Discovering Multiagent Learning Algorithms with Large Language Models

[2602.16898] MALLVI: a multi agent framework for integrated generalized robotics manipulation

[2602.16873] AdaptOrch: Task-Adaptive Multi-Agent Orchestration in the Era of LLM Performance Convergence

[2602.16863] SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

[2602.16844] Overseeing Agents Without Constant Oversight: Challenges and Opportunities

Related Topics

Stay updated with AI News