AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Agents

"They operate like slot machines": AI agents are scrambling power users' brains

AI Tools & Products · about 2 hours ago

Ai Agents

Considering NeurIPS submission [D]

Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic sys...

Reddit - Machine Learning · 1 min · about 7 hours ago

Llms

Anthropic cuts off the ability to use Claude subscriptions with OpenClaw and third-party AI agents

AI Tools & Products · about 8 hours ago

All Content

Llms

[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

This study evaluates the performance of generalist Vision Language Models (VLMs) compared to specialist medical VLMs, revealing that gene...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

The paper presents EBPO, a novel framework that enhances Group Relative Policy Optimization (GRPO) by employing Empirical Bayes shrinkage...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.03098] TextME: Bridging Unseen Modalities Through Text Descriptions

The paper introduces TextME, a framework that enables zero-shot cross-modal transfer using only text descriptions, addressing the limitat...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.02853] Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data

The article presents Recurrent Equivariant Constraint Modulation (RECM), a novel approach for learning layer-wise symmetry relaxation in ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2505.16547] Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation

This paper presents a zero-shot reinforcement learning framework for occlusion-aware plant manipulation, achieving high success rates in ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2505.06595] Feature Representation Transferring to Lightweight Models via Perception Coherence

This paper introduces a novel method for transferring feature representations from larger teacher models to lightweight student models us...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2504.04717] Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models

This article surveys advancements in multi-turn interactions with large language models (LLMs), focusing on evaluation methods, challenge...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2503.23377] JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

The paper presents JavisDiT, a novel Joint Audio-Video Diffusion Transformer that enhances synchronized audio-video generation through a ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2503.21258] Learn by Reasoning: Analogical Weight Generation for Few-Shot Class-Incremental Learning

This paper presents a novel approach to Few-Shot Class-Incremental Learning (FSCIL) using an analogical generative method, enhancing mode...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2601.03612] Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias

This article presents a novel approach to polyphonic music generation using structural inductive bias, focusing on Beethoven's piano sona...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2503.14637] KINESIS: Motion Imitation for Human Musculoskeletal Locomotion

KINESIS presents a model-free framework for motion imitation in human musculoskeletal locomotion, achieving robust performance in various...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.01678] HeurekaBench: A Benchmarking Framework for AI Co-scientist

HeurekaBench introduces a benchmarking framework for AI co-scientists, enabling rigorous evaluation of LLM-based systems through realisti...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2503.13444] VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning

VideoMind introduces a novel approach for temporal-grounded video reasoning using a Chain-of-LoRA agent, enhancing multi-modal reasoning ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2601.00728] Precision Autotuning for Linear Solvers via Reinforcement Learning

This paper presents a reinforcement learning framework for adaptive precision tuning of linear solvers, enhancing computational efficienc...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Agents

[2503.04940] VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization

The paper presents VQEL, a novel architecture that enhances self-play in emergent language games through agent-internal vector quantizati...

arXiv - AI · 4 min · about 1 month ago

Llms

[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context

This article evaluates the divergent thinking capabilities of Large Language Models (LLMs) for scientific idea generation using minimal c...

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.00672] ML-Tool-Bench: Tool-Augmented Planning for ML Tasks

The paper presents ML-Tool-Bench, a benchmark for evaluating tool-augmented planning in machine learning tasks, addressing the limitation...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2512.00403] SelfAI: A self-directed framework for long-horizon scientific discovery

The paper introduces SelfAI, a self-directed framework designed for long-horizon scientific discovery, emphasizing efficient exploration ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2412.04272] PoTable: Towards Systematic Thinking via Plan-then-Execute Stage Reasoning on Tables

The paper presents PoTable, a novel approach to table reasoning that integrates systematic thinking through a plan-then-execute mechanism...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

The paper presents E2E-GRec, a novel end-to-end framework that integrates Graph Neural Networks (GNNs) with recommender systems, addressi...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 68 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

"They operate like slot machines": AI agents are scrambling power users' brains

Considering NeurIPS submission [D]

Anthropic cuts off the ability to use Claude subscriptions with OpenClaw and third-party AI agents

All Content

[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

[2602.03098] TextME: Bridging Unseen Modalities Through Text Descriptions

[2602.02853] Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data

[2505.16547] Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation

[2505.06595] Feature Representation Transferring to Lightweight Models via Perception Coherence

[2504.04717] Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models

[2503.23377] JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

[2503.21258] Learn by Reasoning: Analogical Weight Generation for Few-Shot Class-Incremental Learning

[2601.03612] Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias

[2503.14637] KINESIS: Motion Imitation for Human Musculoskeletal Locomotion

[2601.01678] HeurekaBench: A Benchmarking Framework for AI Co-scientist

[2503.13444] VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning

[2601.00728] Precision Autotuning for Linear Solvers via Reinforcement Learning

[2503.04940] VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization

[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context

[2512.00672] ML-Tool-Bench: Tool-Augmented Planning for ML Tasks

[2512.00403] SelfAI: A self-directed framework for long-horizon scientific discovery

[2412.04272] PoTable: Towards Systematic Thinking via Plan-then-Execute Stage Reasoning on Tables

[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

Related Topics

Stay updated with AI News