AI Agents
Autonomous agents, tool use, and agentic systems
Top This Week
Started a video series on building an orchestration layer for LLM post-training [P]
Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...
All Content
[2602.15082] S-PRESSO: Ultra Low Bitrate Sound Effect Compression With Diffusion Autoencoders And Offline Quantization
The paper presents S-PRESSO, a novel sound effect compression model that achieves ultra-low bitrate audio compression using diffusion aut...
[2602.15074] Structure-Aware Piano Accompaniment via Style Planning and Dataset-Aligned Pattern Retrieval
This paper presents a structure-aware method for generating piano accompaniments using a transformer model for style planning and dataset...
[2602.15738] Beyond Labels: Information-Efficient Human-in-the-Loop Learning using Ranking and Selection Queries
This article presents a novel human-in-the-loop framework for machine learning that enhances information efficiency by utilizing ranking ...
[2602.15070] An effective Genetic Programming Hyper-Heuristic for Uncertain Agile Satellite Scheduling
This paper presents a Genetic Programming Hyper-Heuristic (GPHH) designed for the Uncertain Agile Earth Observation Satellite Scheduling ...
[2602.15064] Structural Divergence Between AI-Agent and Human Social Networks in Moltbook
This article explores the structural differences between AI-agent and human social networks on the Moltbook platform, revealing unique in...
[2602.15707] Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU
This article presents a novel real-time conversational assistant that utilizes audio and IMU data to guide users through procedural tasks...
[2602.15061] Safe-SDL:Establishing Safety Boundaries and Control Mechanisms for AI-Driven Self-Driving Laboratories
The paper presents Safe-SDL, a framework for ensuring safety in AI-driven Self-Driving Laboratories, addressing the critical 'Syntax-to-S...
[2602.15640] Latency-aware Human-in-the-Loop Reinforcement Learning for Semantic Communications
The paper presents a framework for latency-aware human-in-the-loop reinforcement learning in semantic communications, addressing the bala...
[2602.15060] CLOT: Closed-Loop Global Motion Tracking for Whole-Body Humanoid Teleoperation
The paper presents CLOT, a closed-loop system for humanoid teleoperation that addresses global pose drift, enabling stable and precise lo...
[2602.15055] Beyond Context Sharing: A Unified Agent Communication Protocol (ACP) for Secure, Federated, and Autonomous Agent-to-Agent (A2A) Orchestration
The paper introduces the Agent Communication Protocol (ACP), a framework for secure and efficient agent-to-agent orchestration, addressin...
[2602.15042] Combining scEEG and PPG for reliable sleep staging using lightweight wearables
This article explores the fusion of single-channel EEG (scEEG) and photoplethysmography (PPG) for improved sleep staging in lightweight w...
[2602.15038] Indic-TunedLens: Interpreting Multilingual Models in Indian Languages
The paper introduces Indic-TunedLens, an interpretability framework designed for multilingual models in Indian languages, enhancing cross...
[2602.15039] GRACE: an Agentic AI for Particle Physics Experiment Design and Simulation
The paper presents GRACE, an AI agent designed for autonomous experimental design in particle physics, utilizing simulations to optimize ...
[2602.15034] EduResearchBench: A Hierarchical Atomic Task Decomposition Benchmark for Full-Lifecycle Educational Research
EduResearchBench introduces a novel benchmark for evaluating educational research workflows using a Hierarchical Atomic Task Decompositio...
[2602.13209] LemonadeBench: Evaluating the Economic Intuition of Large Language Models in Simple Markets
The paper presents LemonadeBench, a benchmark for assessing the economic intuition of large language models (LLMs) through a simulated le...
[2602.15816] Developing AI Agents with Simulated Data: Why, what, and how?
This article discusses the significance of synthetic data generation through simulation for training AI agents, addressing challenges and...
[2602.15791] Enhancing Building Semantics Preservation in AI Model Training with Large Language Model Encodings
This article presents a novel approach to enhance building semantics preservation in AI model training using large language model encodin...
[2602.15382] The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems
The paper introduces the Vision Wormhole, a framework for enabling efficient latent-space communication in heterogeneous multi-agent syst...
[2602.15776] GlobeDiff: State Diffusion Process for Partial Observability in Multi-Agent Systems
The paper presents GlobeDiff, a novel algorithm addressing partial observability in multi-agent systems by utilizing a state diffusion pr...
[2602.15669] PERSONA: Dynamic and Compositional Inference-Time Personality Control via Activation Vector Algebra
The paper introduces PERSONA, a novel framework for dynamic personality control in Large Language Models (LLMs) using activation vector a...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime