AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

I got tired of 3 AM PagerDuty alerts, so I built an AI agent to fix cloud outages while I sleep. (Built with GLM-5.1)

If you've ever been on-call, you know the nightmare. It’s 3:15 AM. You get pinged because heavily-loaded database nodes in us-east-1 are ...

Reddit - Artificial Intelligence · 1 min · 11 minutes ago

Ai Agents

NeuBird AI Raises $19.3 Million To Scale Agentic AI

AI News - General · 4 min · 27 minutes ago

Ai Agents

CodeGraphContext - An MCP server that converts your codebase into a graph database

CodeGraphContext- the go to solution for graph-code indexing 🎉🎉... It's an MCP server that understands a codebase as a graph, not chunks ...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

All Content

Generative Ai

[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students

This study audits the collaboration between online graduate CS students and AI, exploring preferences for automation in academic tasks an...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

This paper presents Contrastive Object-centric Diffusion Alignment (CODA), an enhancement to object-centric learning that reduces slot en...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2512.23482] Theory of Mind for Explainable Human-Robot Interaction

This article explores the integration of Theory of Mind (ToM) in human-robot interaction (HRI) to enhance robot interpretability and user...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2512.22213] On the Existence and Behavior of Secondary Attention Sinks

This paper explores the concept of secondary attention sinks in machine learning models, highlighting their distinct properties and behav...

arXiv - AI · 4 min · about 2 months ago

Llms

[2511.00794] Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration

This paper presents PREPO, a novel approach to enhance data efficiency in reinforcement learning for large language models by leveraging ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2511.00040] Semi-Supervised Preference Optimization with Limited Feedback

This paper discusses Semi-Supervised Preference Optimization (SSPO), which reduces the need for extensive labeled feedback in preference ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2510.15297] VERA-MH Concept Paper

The VERA-MH Concept Paper outlines an innovative framework for evaluating AI chatbots in mental health contexts, focusing on suicide risk...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

The paper presents pi-Flow, a novel approach to few-step generation in machine learning that utilizes imitation distillation to enhance m...

arXiv - AI · 4 min · about 2 months ago

Generative Ai

[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration

CareerPooler introduces an AI-driven metaphorical simulation for career exploration, enhancing user engagement and decision-making throug...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2506.21039] Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning

The paper presents Strict Subgoal Execution (SSE), a novel framework for hierarchical reinforcement learning that enhances long-horizon p...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2506.20555] DeepQuark: A Deep-Neural-Network Approach to Multiquark Bound States

The paper presents DeepQuark, a novel deep-neural-network approach for analyzing multiquark bound states, demonstrating superior performa...

arXiv - AI · 4 min · about 2 months ago

Llms

[2506.11798] Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models

This paper explores the use of Large Language Models (LLMs) to simulate voting behavior in the European Parliament through persona-driven...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2505.17508] On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

This paper presents a unified framework for KL-regularized policy gradient algorithms aimed at enhancing reasoning in large language mode...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2503.04121] Simple Self Organizing Map with Vision Transformers

This paper explores the integration of Self-Organizing Maps (SOMs) with Vision Transformers (ViTs) to enhance performance on small datase...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2502.03752] Self-Improving Skill Learning for Robust Skill-based Meta-Reinforcement Learning

This paper presents Self-Improving Skill Learning (SISL), a novel approach to enhance skill-based meta-reinforcement learning by refining...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2412.18362] Point-DeepONet: Predicting Nonlinear Fields on Non-Parametric Geometries under Variable Load Conditions

Point-DeepONet introduces a novel approach for predicting nonlinear fields in engineering, leveraging deep learning to enhance efficiency...

arXiv - AI · 4 min · about 2 months ago

Llms

[2412.02039] Multi-View 3D Reconstruction using Knowledge Distillation

This paper presents a knowledge distillation approach for Multi-View 3D reconstruction, utilizing a teacher-student model framework to en...

arXiv - Machine Learning · 4 min · about 2 months ago

Robotics

[2602.00307] Autonomous Data Processing using Meta-Agents

The paper presents a novel framework, Autonomous Data Processing using Meta-Agents (ADP-MA), which enhances data processing pipelines thr...

arXiv - AI · 3 min · about 2 months ago

Llms

[2601.15599] Autonomous Business System via Neuro-symbolic AI

The paper presents AUTOBUS, an Autonomous Business System that integrates LLM-based AI agents with predicate-logic programming to enhance...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2601.07463] Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning

This paper presents a novel Local-to-Global (LOGO) world model for offline multi-agent reinforcement learning (MARL), improving policy ge...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 97 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

I got tired of 3 AM PagerDuty alerts, so I built an AI agent to fix cloud outages while I sleep. (Built with GLM-5.1)

NeuBird AI Raises $19.3 Million To Scale Agentic AI

CodeGraphContext - An MCP server that converts your codebase into a graph database

All Content

[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students

[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

[2512.23482] Theory of Mind for Explainable Human-Robot Interaction

[2512.22213] On the Existence and Behavior of Secondary Attention Sinks

[2511.00794] Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration

[2511.00040] Semi-Supervised Preference Optimization with Limited Feedback

[2510.15297] VERA-MH Concept Paper

[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration

[2506.21039] Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning

[2506.20555] DeepQuark: A Deep-Neural-Network Approach to Multiquark Bound States

[2506.11798] Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models

[2505.17508] On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

[2503.04121] Simple Self Organizing Map with Vision Transformers

[2502.03752] Self-Improving Skill Learning for Robust Skill-based Meta-Reinforcement Learning

[2412.18362] Point-DeepONet: Predicting Nonlinear Fields on Non-Parametric Geometries under Variable Load Conditions

[2412.02039] Multi-View 3D Reconstruction using Knowledge Distillation

[2602.00307] Autonomous Data Processing using Meta-Agents

[2601.15599] Autonomous Business System via Neuro-symbolic AI

[2601.07463] Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning

Related Topics

Stay updated with AI News