Agentic AI capabilities to be integrated into defense platforms by BAE Systems, Scale AI
FALLS CHURCH, Virginia. BAE Systems and Scale AI have signed a strategic relationship agreement to speed the development and fielding of ...
Autonomous agents, tool use, and agentic systems
FALLS CHURCH, Virginia. BAE Systems and Scale AI have signed a strategic relationship agreement to speed the development and fielding of ...
Al agents are running on infrastructure built for humans. Every state check runs 9 shell commands. Every cold start re-discovers context ...
submitted by /u/Fcking_Chuck [link] [comments]
Abstract page for arXiv paper 2603.21972: Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
Abstract page for arXiv paper 2603.21331: AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
Abstract page for arXiv paper 2603.20405: Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP
Abstract page for arXiv paper 2603.19220: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Abstract page for arXiv paper 2603.07496: From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents
Abstract page for arXiv paper 2602.07150: On Randomness in Agentic Evals
Abstract page for arXiv paper 2601.07148: Measuring Iterative Temporal Reasoning with Time Puzzles
Abstract page for arXiv paper 2511.11828: Conformal Constrained Policy Optimization for Cost-Effective LLM Agents
Abstract page for arXiv paper 2509.08157: Risk-Bounded Multi-Agent Visual Navigation via Iterative Risk Allocation
Abstract page for arXiv paper 2603.11721: When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows
Abstract page for arXiv paper 2603.11679: LLMs can construct powerful representations and streamline sample-efficient supervised learning
Abstract page for arXiv paper 2603.11382: Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Contin...
Abstract page for arXiv paper 2603.08388: A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Gen...
Abstract page for arXiv paper 2602.01297: RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis
Abstract page for arXiv paper 2511.12876: Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decisi...
Abstract page for arXiv paper 2511.06626: Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives
Abstract page for arXiv paper 2506.13113: Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning
Abstract page for arXiv paper 2603.21613: AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents
Abstract page for arXiv paper 2603.21594: Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limi...
Abstract page for arXiv paper 2603.21564: Toward a Theory of Hierarchical Memory for Language Agents
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime