AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

"They operate like slot machines": AI agents are scrambling power users' brains

AI Tools & Products ·
Ai Agents

Considering NeurIPS submission [D]

Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic sys...

Reddit - Machine Learning · 1 min ·
Llms

Anthropic cuts off the ability to use Claude subscriptions with OpenClaw and third-party AI agents

AI Tools & Products ·

All Content

[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights
Llms

[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights

This study evaluates the performance of generalist Vision Language Models (VLMs) compared to specialist medical VLMs, revealing that gene...

arXiv - AI · 3 min ·
[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization
Llms

[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

The paper presents EBPO, a novel framework that enhances Group Relative Policy Optimization (GRPO) by employing Empirical Bayes shrinkage...

arXiv - AI · 4 min ·
[2602.03098] TextME: Bridging Unseen Modalities Through Text Descriptions
Llms

[2602.03098] TextME: Bridging Unseen Modalities Through Text Descriptions

The paper introduces TextME, a framework that enables zero-shot cross-modal transfer using only text descriptions, addressing the limitat...

arXiv - AI · 3 min ·
[2602.02853] Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data
Machine Learning

[2602.02853] Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data

The article presents Recurrent Equivariant Constraint Modulation (RECM), a novel approach for learning layer-wise symmetry relaxation in ...

arXiv - Machine Learning · 4 min ·
[2505.16547] Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation
Machine Learning

[2505.16547] Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation

This paper presents a zero-shot reinforcement learning framework for occlusion-aware plant manipulation, achieving high success rates in ...

arXiv - AI · 3 min ·
[2505.06595] Feature Representation Transferring to Lightweight Models via Perception Coherence
Machine Learning

[2505.06595] Feature Representation Transferring to Lightweight Models via Perception Coherence

This paper introduces a novel method for transferring feature representations from larger teacher models to lightweight student models us...

arXiv - Machine Learning · 4 min ·
[2504.04717] Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models
Llms

[2504.04717] Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models

This article surveys advancements in multi-turn interactions with large language models (LLMs), focusing on evaluation methods, challenge...

arXiv - AI · 4 min ·
[2503.23377] JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization
Machine Learning

[2503.23377] JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

The paper presents JavisDiT, a novel Joint Audio-Video Diffusion Transformer that enhances synchronized audio-video generation through a ...

arXiv - AI · 4 min ·
[2503.21258] Learn by Reasoning: Analogical Weight Generation for Few-Shot Class-Incremental Learning
Machine Learning

[2503.21258] Learn by Reasoning: Analogical Weight Generation for Few-Shot Class-Incremental Learning

This paper presents a novel approach to Few-Shot Class-Incremental Learning (FSCIL) using an analogical generative method, enhancing mode...

arXiv - AI · 4 min ·
[2601.03612] Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias
Nlp

[2601.03612] Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias

This article presents a novel approach to polyphonic music generation using structural inductive bias, focusing on Beethoven's piano sona...

arXiv - Machine Learning · 3 min ·
[2503.14637] KINESIS: Motion Imitation for Human Musculoskeletal Locomotion
Machine Learning

[2503.14637] KINESIS: Motion Imitation for Human Musculoskeletal Locomotion

KINESIS presents a model-free framework for motion imitation in human musculoskeletal locomotion, achieving robust performance in various...

arXiv - Machine Learning · 4 min ·
[2601.01678] HeurekaBench: A Benchmarking Framework for AI Co-scientist
Llms

[2601.01678] HeurekaBench: A Benchmarking Framework for AI Co-scientist

HeurekaBench introduces a benchmarking framework for AI co-scientists, enabling rigorous evaluation of LLM-based systems through realisti...

arXiv - Machine Learning · 4 min ·
[2503.13444] VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning
Llms

[2503.13444] VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning

VideoMind introduces a novel approach for temporal-grounded video reasoning using a Chain-of-LoRA agent, enhancing multi-modal reasoning ...

arXiv - AI · 4 min ·
[2601.00728] Precision Autotuning for Linear Solvers via Reinforcement Learning
Machine Learning

[2601.00728] Precision Autotuning for Linear Solvers via Reinforcement Learning

This paper presents a reinforcement learning framework for adaptive precision tuning of linear solvers, enhancing computational efficienc...

arXiv - Machine Learning · 4 min ·
[2503.04940] VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization
Ai Agents

[2503.04940] VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization

The paper presents VQEL, a novel architecture that enhances self-play in emergent language games through agent-internal vector quantizati...

arXiv - AI · 4 min ·
[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context
Llms

[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context

This article evaluates the divergent thinking capabilities of Large Language Models (LLMs) for scientific idea generation using minimal c...

arXiv - AI · 4 min ·
[2512.00672] ML-Tool-Bench: Tool-Augmented Planning for ML Tasks
Llms

[2512.00672] ML-Tool-Bench: Tool-Augmented Planning for ML Tasks

The paper presents ML-Tool-Bench, a benchmark for evaluating tool-augmented planning in machine learning tasks, addressing the limitation...

arXiv - AI · 4 min ·
[2512.00403] SelfAI: A self-directed framework for long-horizon scientific discovery
Ai Agents

[2512.00403] SelfAI: A self-directed framework for long-horizon scientific discovery

The paper introduces SelfAI, a self-directed framework designed for long-horizon scientific discovery, emphasizing efficient exploration ...

arXiv - AI · 4 min ·
[2412.04272] PoTable: Towards Systematic Thinking via Plan-then-Execute Stage Reasoning on Tables
Llms

[2412.04272] PoTable: Towards Systematic Thinking via Plan-then-Execute Stage Reasoning on Tables

The paper presents PoTable, a novel approach to table reasoning that integrates systematic thinking through a plan-then-execute mechanism...

arXiv - AI · 4 min ·
[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems
Machine Learning

[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

The paper presents E2E-GRec, a novel end-to-end framework that integrates Graph Neural Networks (GNNs) with recommender systems, addressi...

arXiv - Machine Learning · 4 min ·
Previous Page 68 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime