Microsoft's newest open-source project: Runtime security for AI agents
submitted by /u/Fcking_Chuck [link] [comments]
Autonomous agents, tool use, and agentic systems
submitted by /u/Fcking_Chuck [link] [comments]
Abstract page for arXiv paper 2510.16609: Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods
Abstract page for arXiv paper 2604.02131: Intelligent Cloud Orchestration: A Hybrid Predictive and Heuristic Framework for Cost Optimization
The Wasserstein Barycenter Soft Actor-Critic (WBSAC) algorithm enhances sample efficiency in reinforcement learning by combining pessimis...
This survey explores the challenges of building trustworthy GUI agents, highlighting the execution gap and proposing a taxonomy for under...
This paper explores the regularity and stability properties of selective state-space models (SSMs) with discontinuous gating, focusing on...
The paper discusses a novel approach to incentive-compatible exploration in bandit settings, addressing the misalignment between principa...
This paper presents gap-dependent performance guarantees for nearly minimax-optimal reinforcement learning algorithms using linear functi...
The paper presents SMaRT, an innovative algorithm for online resource allocation in the Kenyan judiciary, focusing on mediator assignment...
The paper presents SOM-VQ, a novel tokenization method that enhances interactive generative models by integrating vector quantization wit...
The paper presents SELAUR, a reinforcement learning framework that enhances large language models (LLMs) by integrating uncertainty into ...
This paper explores the challenges of multi-agent imitation learning (MA-IL), particularly the exploitability of learned policies in mult...
This article presents a novel approach to EEG-to-text decoding, exploring how hierarchical abstraction levels affect classification perfo...
The paper presents the Semantic-guided Adaptive Expert Forest (SAEF), a novel approach for Class-Incremental Learning (CIL) that enhances...
This paper introduces transcoder adapters, a method for analyzing the internal changes in reasoning models post fine-tuning, demonstratin...
This paper examines the effectiveness of benchmarks in cooperative multi-agent reinforcement learning (MARL) by analyzing Dec-POMDP reaso...
This article investigates how the magnitude of parameter updates affects forgetting and generalization in continual learning, proposing a...
This paper explores the impact of rehearsal scale on continual learning, revealing counterintuitive effects on adaptability and memory re...
The paper presents ECO, a new learning paradigm for Neural Combinatorial Optimization that enhances efficiency through offline self-play,...
The paper presents Fuz-RL, a fuzzy-guided framework for safe reinforcement learning that addresses uncertainties in real-world applicatio...
The paper presents QEDBench, a benchmark for evaluating the alignment of automated systems in assessing university-level mathematical pro...
The paper presents GATES, a self-distillation method for document-grounded question answering, enhancing model performance by leveraging ...
The paper presents a novel framework, Memory-guided Prototypical Co-occurrence Learning (MPCL), aimed at improving mixed emotion recognit...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime