AI Agents
Autonomous agents, tool use, and agentic systems
Top This Week
Considering NeurIPS submission [D]
Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic sys...
Anthropic cuts off the ability to use Claude subscriptions with OpenClaw and third-party AI agents
All Content
[2506.17337] Can Generalist Vision Language Models (VLMs) Rival Specialist Medical VLMs? Benchmarking and Strategic Insights
This study evaluates the performance of generalist Vision Language Models (VLMs) compared to specialist medical VLMs, revealing that gene...
[2602.05165] EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization
The paper presents EBPO, a novel framework that enhances Group Relative Policy Optimization (GRPO) by employing Empirical Bayes shrinkage...
[2602.03098] TextME: Bridging Unseen Modalities Through Text Descriptions
The paper introduces TextME, a framework that enables zero-shot cross-modal transfer using only text descriptions, addressing the limitat...
[2602.02853] Recurrent Equivariant Constraint Modulation: Learning Per-Layer Symmetry Relaxation from Data
The article presents Recurrent Equivariant Constraint Modulation (RECM), a novel approach for learning layer-wise symmetry relaxation in ...
[2505.16547] Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation
This paper presents a zero-shot reinforcement learning framework for occlusion-aware plant manipulation, achieving high success rates in ...
[2505.06595] Feature Representation Transferring to Lightweight Models via Perception Coherence
This paper introduces a novel method for transferring feature representations from larger teacher models to lightweight student models us...
[2504.04717] Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models
This article surveys advancements in multi-turn interactions with large language models (LLMs), focusing on evaluation methods, challenge...
[2503.23377] JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization
The paper presents JavisDiT, a novel Joint Audio-Video Diffusion Transformer that enhances synchronized audio-video generation through a ...
[2503.21258] Learn by Reasoning: Analogical Weight Generation for Few-Shot Class-Incremental Learning
This paper presents a novel approach to Few-Shot Class-Incremental Learning (FSCIL) using an analogical generative method, enhancing mode...
[2601.03612] Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias
This article presents a novel approach to polyphonic music generation using structural inductive bias, focusing on Beethoven's piano sona...
[2503.14637] KINESIS: Motion Imitation for Human Musculoskeletal Locomotion
KINESIS presents a model-free framework for motion imitation in human musculoskeletal locomotion, achieving robust performance in various...
[2601.01678] HeurekaBench: A Benchmarking Framework for AI Co-scientist
HeurekaBench introduces a benchmarking framework for AI co-scientists, enabling rigorous evaluation of LLM-based systems through realisti...
[2503.13444] VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning
VideoMind introduces a novel approach for temporal-grounded video reasoning using a Chain-of-LoRA agent, enhancing multi-modal reasoning ...
[2601.00728] Precision Autotuning for Linear Solvers via Reinforcement Learning
This paper presents a reinforcement learning framework for adaptive precision tuning of linear solvers, enhancing computational efficienc...
[2503.04940] VQEL: Enabling Self-Play in Emergent Language Games via Agent-Internal Vector Quantization
The paper presents VQEL, a novel architecture that enhances self-play in emergent language games through agent-internal vector quantizati...
[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context
This article evaluates the divergent thinking capabilities of Large Language Models (LLMs) for scientific idea generation using minimal c...
[2512.00672] ML-Tool-Bench: Tool-Augmented Planning for ML Tasks
The paper presents ML-Tool-Bench, a benchmark for evaluating tool-augmented planning in machine learning tasks, addressing the limitation...
[2512.00403] SelfAI: A self-directed framework for long-horizon scientific discovery
The paper introduces SelfAI, a self-directed framework designed for long-horizon scientific discovery, emphasizing efficient exploration ...
[2412.04272] PoTable: Towards Systematic Thinking via Plan-then-Execute Stage Reasoning on Tables
The paper presents PoTable, a novel approach to table reasoning that integrates systematic thinking through a plan-then-execute mechanism...
[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems
The paper presents E2E-GRec, a novel end-to-end framework that integrates Graph Neural Networks (GNNs) with recommender systems, addressi...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime