AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

I've been building this repo public since day one, roughly 5 weeks now with Claude Code. Here's where it's at. Feels good to be so close....

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

Machine Learning

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...

Reddit - Machine Learning · 1 min · about 7 hours ago

Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 13 hours ago

All Content

Data Science

[2602.12919] EPRBench: A High-Quality Benchmark Dataset for Event Stream Based Visual Place Recognition

EPRBench introduces a benchmark dataset for event stream-based visual place recognition, addressing challenges in low-light and high-spee...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12892] RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training

The paper presents RADAR, a novel evaluation framework for Multi-modal Large Language Models (MLLMs) that addresses performance bottlenec...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12873] Knowledge-Based Design Requirements for Generative Social Robots in Higher Education

The article explores design requirements for generative social robots in higher education, emphasizing the need for knowledge-based frame...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12833] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records (EHRs)

TRACE introduces a novel framework for temporal reasoning in electronic health records, enhancing prediction accuracy and clinical safety...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12829] FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching

The paper presents FLAC, a novel framework for Maximum Entropy Reinforcement Learning that utilizes kinetic energy regularization to opti...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12763] "Not Human, Funnier": How Machine Identity Shapes Humor Perception in Online AI Stand-up Comedy

This article explores how AI's machine identity influences humor perception in online stand-up comedy, revealing that AI can be perceived...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.12691] ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training

The paper presents ALOE, an action-level off-policy evaluation framework aimed at enhancing vision-language-action models through reinfor...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.12687] Trust the uncertain teacher: distilling dark knowledge via calibrated uncertainty

This paper introduces Calibrated Uncertainty Distillation (CUD), a novel approach to knowledge distillation that enhances the transfer of...

arXiv - Machine Learning · 4 min · about 2 months ago

Robotics

[2602.12656] PMG: Parameterized Motion Generator for Human-like Locomotion Control

The PMG paper presents a novel Parameterized Motion Generator for humanoid locomotion, addressing challenges in adapting motion tracking ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.12643] Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

The paper presents Unified Latent Dynamics (ULD), a novel reinforcement learning algorithm that combines the efficiency of model-free met...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12642] Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

This article presents a novel approach to reinforcement learning by reinterpreting the partition function as a difficulty scheduler, enha...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12641] Artic: AI-oriented Real-time Communication for MLLM Video Assistant

The paper presents Artic, an AI-oriented real-time communication framework designed for Multimodal Large Language Model (MLLM) video assi...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12612] Self-EvolveRec: Self-Evolving Recommender Systems with LLM-based Directional Feedback

The paper presents Self-EvolveRec, a framework for self-evolving recommender systems that utilizes LLM-based directional feedback to enha...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.12601] HyperMLP: An Integrated Perspective for Sequence Modeling

The paper presents HyperMLP, a novel approach to sequence modeling that reinterprets autoregressive attention as a dynamic two-layer MLP,...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12593] RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

The paper introduces RQ-GMM, a novel model for improving click-through rate (CTR) prediction by effectively discretizing multimodal embed...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12579] VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

The paper introduces VI-CuRL, a framework for stabilizing verifier-independent reinforcement learning (RL) by utilizing confidence-guided...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.12574] Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL

This paper presents a novel framework, Stage-MCTS, which enhances small language models' ability to generate NoSQL queries through conver...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.12547] A consequence of failed sequential learning: A computational account of developmental amnesia

This article presents a computational model addressing developmental amnesia, characterized by impaired episodic memory and intact semant...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.12517] Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

The paper presents Bench-MFG, a benchmark suite designed to standardize evaluations in learning for stationary Mean Field Games, addressi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12508] Monocular Reconstruction of Neural Tactile Fields

This paper presents a novel approach to robotic navigation using neural tactile fields, enabling robots to predict tactile responses from...

arXiv - AI · 3 min · about 2 months ago

Previous Page 152 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

All Content

[2602.12919] EPRBench: A High-Quality Benchmark Dataset for Event Stream Based Visual Place Recognition

[2602.12892] RADAR: Revealing Asymmetric Development of Abilities in MLLM Pre-training

[2602.12873] Knowledge-Based Design Requirements for Generative Social Robots in Higher Education

[2602.12833] TRACE: Temporal Reasoning via Agentic Context Evolution for Streaming Electronic Health Records (EHRs)

[2602.12829] FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching

[2602.12763] "Not Human, Funnier": How Machine Identity Shapes Humor Perception in Online AI Stand-up Comedy

[2602.12691] ALOE: Action-Level Off-Policy Evaluation for Vision-Language-Action Model Post-Training

[2602.12687] Trust the uncertain teacher: distilling dark knowledge via calibrated uncertainty

[2602.12656] PMG: Parameterized Motion Generator for Human-like Locomotion Control

[2602.12643] Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

[2602.12642] Beyond Normalization: Rethinking the Partition Function as a Difficulty Scheduler for RLVR

[2602.12641] Artic: AI-oriented Real-time Communication for MLLM Video Assistant

[2602.12612] Self-EvolveRec: Self-Evolving Recommender Systems with LLM-based Directional Feedback

[2602.12601] HyperMLP: An Integrated Perspective for Sequence Modeling

[2602.12593] RQ-GMM: Residual Quantized Gaussian Mixture Model for Multimodal Semantic Discretization in CTR Prediction

[2602.12579] VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

[2602.12574] Monte Carlo Tree Search with Reasoning Path Refinement for Small Language Models in Conversational Text-to-NoSQL

[2602.12547] A consequence of failed sequential learning: A computational account of developmental amnesia

[2602.12517] Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

[2602.12508] Monocular Reconstruction of Neural Tactile Fields

Related Topics

Stay updated with AI News