AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Machine Learning

I got tired of 3 AM PagerDuty alerts, so I built an AI agent to fix cloud outages while I sleep. (Built with GLM-5.1)

If you've ever been on-call, you know the nightmare. It’s 3:15 AM. You get pinged because heavily-loaded database nodes in us-east-1 are ...

Reddit - Artificial Intelligence · 1 min ·
NeuBird AI Raises $19.3 Million To Scale Agentic AI
Ai Agents

NeuBird AI Raises $19.3 Million To Scale Agentic AI

AI News - General · 4 min ·
Ai Agents

CodeGraphContext - An MCP server that converts your codebase into a graph database

CodeGraphContext- the go to solution for graph-code indexing 🎉🎉... It's an MCP server that understands a codebase as a graph, not chunks ...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students
Generative Ai

[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students

This study audits the collaboration between online graduate CS students and AI, exploring preferences for automation in academic tasks an...

arXiv - AI · 3 min ·
[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment
Machine Learning

[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

This paper presents Contrastive Object-centric Diffusion Alignment (CODA), an enhancement to object-centric learning that reduces slot en...

arXiv - AI · 4 min ·
[2512.23482] Theory of Mind for Explainable Human-Robot Interaction
Machine Learning

[2512.23482] Theory of Mind for Explainable Human-Robot Interaction

This article explores the integration of Theory of Mind (ToM) in human-robot interaction (HRI) to enhance robot interpretability and user...

arXiv - AI · 4 min ·
[2512.22213] On the Existence and Behavior of Secondary Attention Sinks
Machine Learning

[2512.22213] On the Existence and Behavior of Secondary Attention Sinks

This paper explores the concept of secondary attention sinks in machine learning models, highlighting their distinct properties and behav...

arXiv - AI · 4 min ·
[2511.00794] Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration
Llms

[2511.00794] Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration

This paper presents PREPO, a novel approach to enhance data efficiency in reinforcement learning for large language models by leveraging ...

arXiv - AI · 4 min ·
[2511.00040] Semi-Supervised Preference Optimization with Limited Feedback
Llms

[2511.00040] Semi-Supervised Preference Optimization with Limited Feedback

This paper discusses Semi-Supervised Preference Optimization (SSPO), which reduces the need for extensive labeled feedback in preference ...

arXiv - AI · 3 min ·
[2510.15297] VERA-MH Concept Paper
Machine Learning

[2510.15297] VERA-MH Concept Paper

The VERA-MH Concept Paper outlines an innovative framework for evaluating AI chatbots in mental health contexts, focusing on suicide risk...

arXiv - AI · 4 min ·
[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
Machine Learning

[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

The paper presents pi-Flow, a novel approach to few-step generation in machine learning that utilizes imitation distillation to enhance m...

arXiv - AI · 4 min ·
[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration
Generative Ai

[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration

CareerPooler introduces an AI-driven metaphorical simulation for career exploration, enhancing user engagement and decision-making throug...

arXiv - AI · 3 min ·
[2506.21039] Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning
Machine Learning

[2506.21039] Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning

The paper presents Strict Subgoal Execution (SSE), a novel framework for hierarchical reinforcement learning that enhances long-horizon p...

arXiv - AI · 4 min ·
[2506.20555] DeepQuark: A Deep-Neural-Network Approach to Multiquark Bound States
Machine Learning

[2506.20555] DeepQuark: A Deep-Neural-Network Approach to Multiquark Bound States

The paper presents DeepQuark, a novel deep-neural-network approach for analyzing multiquark bound states, demonstrating superior performa...

arXiv - AI · 4 min ·
[2506.11798] Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models
Llms

[2506.11798] Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models

This paper explores the use of Large Language Models (LLMs) to simulate voting behavior in the European Parliament through persona-driven...

arXiv - Machine Learning · 4 min ·
[2505.17508] On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning
Llms

[2505.17508] On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

This paper presents a unified framework for KL-regularized policy gradient algorithms aimed at enhancing reasoning in large language mode...

arXiv - AI · 4 min ·
[2503.04121] Simple Self Organizing Map with Vision Transformers
Machine Learning

[2503.04121] Simple Self Organizing Map with Vision Transformers

This paper explores the integration of Self-Organizing Maps (SOMs) with Vision Transformers (ViTs) to enhance performance on small datase...

arXiv - Machine Learning · 4 min ·
[2502.03752] Self-Improving Skill Learning for Robust Skill-based Meta-Reinforcement Learning
Machine Learning

[2502.03752] Self-Improving Skill Learning for Robust Skill-based Meta-Reinforcement Learning

This paper presents Self-Improving Skill Learning (SISL), a novel approach to enhance skill-based meta-reinforcement learning by refining...

arXiv - AI · 3 min ·
[2412.18362] Point-DeepONet: Predicting Nonlinear Fields on Non-Parametric Geometries under Variable Load Conditions
Machine Learning

[2412.18362] Point-DeepONet: Predicting Nonlinear Fields on Non-Parametric Geometries under Variable Load Conditions

Point-DeepONet introduces a novel approach for predicting nonlinear fields in engineering, leveraging deep learning to enhance efficiency...

arXiv - AI · 4 min ·
[2412.02039] Multi-View 3D Reconstruction using Knowledge Distillation
Llms

[2412.02039] Multi-View 3D Reconstruction using Knowledge Distillation

This paper presents a knowledge distillation approach for Multi-View 3D reconstruction, utilizing a teacher-student model framework to en...

arXiv - Machine Learning · 4 min ·
[2602.00307] Autonomous Data Processing using Meta-Agents
Robotics

[2602.00307] Autonomous Data Processing using Meta-Agents

The paper presents a novel framework, Autonomous Data Processing using Meta-Agents (ADP-MA), which enhances data processing pipelines thr...

arXiv - AI · 3 min ·
[2601.15599] Autonomous Business System via Neuro-symbolic AI
Llms

[2601.15599] Autonomous Business System via Neuro-symbolic AI

The paper presents AUTOBUS, an Autonomous Business System that integrates LLM-based AI agents with predicate-logic programming to enhance...

arXiv - AI · 4 min ·
[2601.07463] Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning
Machine Learning

[2601.07463] Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning

This paper presents a novel Local-to-Global (LOGO) world model for offline multi-agent reinforcement learning (MARL), improving policy ge...

arXiv - Machine Learning · 4 min ·
Previous Page 97 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime