AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Enabling agent-first process redesign | MIT Technology Review
Nlp

Enabling agent-first process redesign | MIT Technology Review

Unlike static, rules-based systems, AI agents can learn, adapt, and optimize processes dynamically. As they interact with data, systems, ...

MIT Technology Review - AI · 4 min ·
Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

The "Jarvis on day one" trap: why trying to build one AI agent that does everything costs you months

Something I've been thinking about after spending a few months actually trying to build my own AI agent: the biggest trap in this space i...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.17098] Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization
Machine Learning

[2602.17098] Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization

This article presents a comparative study of Deep Reinforcement Learning (DRL) and Mean-Variance Optimization (MVO) for optimal portfolio...

arXiv - Machine Learning · 4 min ·
[2602.17071] AdvSynGNN: Structure-Adaptive Graph Neural Nets via Adversarial Synthesis and Self-Corrective Propagation
Machine Learning

[2602.17071] AdvSynGNN: Structure-Adaptive Graph Neural Nets via Adversarial Synthesis and Self-Corrective Propagation

The paper presents AdvSynGNN, a novel architecture for graph neural networks that enhances resilience against structural noise and non-ho...

arXiv - AI · 3 min ·
[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning
Nlp

[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning

The paper introduces ALPS, a diagnostic challenge set designed to evaluate Arabic linguistic and pragmatic reasoning, highlighting the li...

arXiv - AI · 4 min ·
[2602.17037] Wink: Recovering from Misbehaviors in Coding Agents
Llms

[2602.17037] Wink: Recovering from Misbehaviors in Coding Agents

The paper presents 'Wink', a system designed to recover coding agents from misbehaviors, enhancing their reliability in software developm...

arXiv - AI · 4 min ·
[2602.17027] Transforming Behavioral Neuroscience Discovery with In-Context Learning and AI-Enhanced Tensor Methods
Data Science

[2602.17027] Transforming Behavioral Neuroscience Discovery with In-Context Learning and AI-Enhanced Tensor Methods

This article discusses the integration of AI and In-Context Learning to enhance behavioral neuroscience research, particularly in underst...

arXiv - AI · 4 min ·
[2602.17022] ReIn: Conversational Error Recovery with Reasoning Inception
Llms

[2602.17022] ReIn: Conversational Error Recovery with Reasoning Inception

The paper presents Reasoning Inception (ReIn), a method for improving conversational agents' error recovery without altering their parame...

arXiv - AI · 4 min ·
[2602.17003] Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History
Llms

[2602.17003] Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History

The paper introduces Persona2Web, a benchmark for evaluating personalized web agents that utilize user history to resolve ambiguous queri...

arXiv - AI · 3 min ·
[2602.16997] Exploring LLMs for User Story Extraction from Mockups
Llms

[2602.16997] Exploring LLMs for User Story Extraction from Mockups

This article explores the use of large language models (LLMs) for extracting user stories from high-fidelity mockups, enhancing requireme...

arXiv - AI · 3 min ·
[2602.16967] Early-Warning Signals of Grokking via Loss-Landscape Geometry
Machine Learning

[2602.16967] Early-Warning Signals of Grokking via Loss-Landscape Geometry

The paper explores early-warning signals of 'grokking' in machine learning, focusing on the commutator defect as a precursor to generaliz...

arXiv - AI · 4 min ·
[2602.16966] A Unified Framework for Locality in Scalable MARL
Ai Agents

[2602.16966] A Unified Framework for Locality in Scalable MARL

This paper presents a unified framework addressing locality in scalable Multi-Agent Reinforcement Learning (MARL), proposing a novel poli...

arXiv - AI · 4 min ·
[2602.16959] Eigenmood Space: Uncertainty-Aware Spectral Graph Analysis of Psychological Patterns in Classical Persian Poetry
Data Science

[2602.16959] Eigenmood Space: Uncertainty-Aware Spectral Graph Analysis of Psychological Patterns in Classical Persian Poetry

This article presents a framework for analyzing psychological patterns in Classical Persian poetry using uncertainty-aware spectral graph...

arXiv - AI · 4 min ·
[2602.16957] When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English
Machine Learning

[2602.16957] When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English

This study explores the challenges of cross-lingual euphemism transfer between Turkish and English, highlighting the limitations of seman...

arXiv - AI · 3 min ·
[2602.16947] Beyond Message Passing: A Symbolic Alternative for Expressive and Interpretable Graph Learning
Machine Learning

[2602.16947] Beyond Message Passing: A Symbolic Alternative for Expressive and Interpretable Graph Learning

The paper presents SymGraph, a novel symbolic framework that enhances graph learning by overcoming limitations of traditional message-pas...

arXiv - Machine Learning · 3 min ·
[2602.16932] RankEvolve: Automating the Discovery of Retrieval Algorithms via LLM-Driven Evolution
Llms

[2602.16932] RankEvolve: Automating the Discovery of Retrieval Algorithms via LLM-Driven Evolution

The paper presents RankEvolve, a novel approach utilizing large language models (LLMs) to automate the discovery of retrieval algorithms,...

arXiv - AI · 3 min ·
[2602.16930] Say It My Way: Exploring Control in Conversational Visual Question Answering with Blind Users
Generative Ai

[2602.16930] Say It My Way: Exploring Control in Conversational Visual Question Answering with Blind Users

The paper explores how blind users can customize interactions with conversational visual question answering systems, highlighting the nee...

arXiv - AI · 4 min ·
[2602.16928] Discovering Multiagent Learning Algorithms with Large Language Models
Llms

[2602.16928] Discovering Multiagent Learning Algorithms with Large Language Models

This paper explores the use of large language models to automatically discover new multiagent learning algorithms, enhancing the efficien...

arXiv - AI · 4 min ·
[2602.16898] MALLVI: a multi agent framework for integrated generalized robotics manipulation
Llms

[2602.16898] MALLVI: a multi agent framework for integrated generalized robotics manipulation

The paper presents MALLVI, a multi-agent framework for robotic manipulation that utilizes closed-loop feedback to enhance task planning a...

arXiv - Machine Learning · 4 min ·
[2602.16873] AdaptOrch: Task-Adaptive Multi-Agent Orchestration in the Era of LLM Performance Convergence
Llms

[2602.16873] AdaptOrch: Task-Adaptive Multi-Agent Orchestration in the Era of LLM Performance Convergence

The paper presents AdaptOrch, a framework for task-adaptive multi-agent orchestration that enhances performance by optimizing orchestrati...

arXiv - AI · 4 min ·
[2602.16863] SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation
Robotics

[2602.16863] SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

SimToolReal presents a novel approach to zero-shot dexterous tool manipulation using an object-centric policy, enhancing robotic capabili...

arXiv - AI · 4 min ·
[2602.16844] Overseeing Agents Without Constant Oversight: Challenges and Opportunities
Ai Agents

[2602.16844] Overseeing Agents Without Constant Oversight: Challenges and Opportunities

This article explores the challenges and opportunities in overseeing AI agents without constant human oversight, focusing on user studies...

arXiv - AI · 3 min ·
Previous Page 103 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime