AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

Agent frameworks waste ~350,000+ tokens per session resending static files. 95% reduction benchmarked.

Measured the actual token waste on a local Qwen 3.5 122B setup. The numbers are unreal. Found a compile-time approach that cuts query con...

Reddit - Artificial Intelligence · 1 min ·
OpenClaw gives users yet another reason to be freaked out about security - Ars Technica
Ai Agents

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

The viral AI agentic tool let attackers silently gain admin unauthenticated access.

Ars Technica - AI · 5 min ·
Robotics

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Ran an experiment — gave AI agents full control over writing, character creation, and performing a sitcom. Left it running nonstop for ov...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.20980] CrystaL: Spontaneous Emergence of Visual Latents in MLLMs
Llms

[2602.20980] CrystaL: Spontaneous Emergence of Visual Latents in MLLMs

The paper presents CrystaL, a novel framework for Multimodal Large Language Models (MLLMs) that enhances visual understanding by crystall...

arXiv - AI · 3 min ·
[2602.20979] Toward an Agentic Infused Software Ecosystem
Nlp

[2602.20979] Toward an Agentic Infused Software Ecosystem

This article discusses the concept of an Agentic Infused Software Ecosystem (AISE), emphasizing the need for a holistic approach to integ...

arXiv - AI · 3 min ·
[2602.20967] Training-Free Intelligibility-Guided Observation Addition for Noisy ASR
Machine Learning

[2602.20967] Training-Free Intelligibility-Guided Observation Addition for Noisy ASR

This paper presents a novel training-free method for improving automatic speech recognition (ASR) in noisy environments by using intellig...

arXiv - AI · 3 min ·
[2602.20951] See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis
Machine Learning

[2602.20951] See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

This paper presents ArtiAgent, a novel approach to automate the creation of artifact-annotated datasets for training visual language mode...

arXiv - AI · 4 min ·
[2602.20946] Some Simple Economics of AGI
Ai Agents

[2602.20946] Some Simple Economics of AGI

This article explores the economic implications of Artificial General Intelligence (AGI), focusing on the transition from human cognition...

arXiv - Machine Learning · 4 min ·
[2602.20945] The Art of Efficient Reasoning: Data, Reward, and Optimization
Llms

[2602.20945] The Art of Efficient Reasoning: Data, Reward, and Optimization

This article explores efficient reasoning in Large Language Models (LLMs), focusing on optimizing computational resources through reward ...

arXiv - AI · 4 min ·
[2602.20877] E-MMKGR: A Unified Multimodal Knowledge Graph Framework for E-commerce Applications
Nlp

[2602.20877] E-MMKGR: A Unified Multimodal Knowledge Graph Framework for E-commerce Applications

The paper presents E-MMKGR, a unified framework for multimodal knowledge graphs tailored for e-commerce, enhancing recommendation systems...

arXiv - AI · 3 min ·
[2602.20924] Airavat: An Agentic Framework for Internet Measurement
Computer Vision

[2602.20924] Airavat: An Agentic Framework for Internet Measurement

Airavat introduces an innovative framework for automating Internet measurement workflows, ensuring both generation and verification again...

arXiv - AI · 3 min ·
[2602.20809] Regret-Guided Search Control for Efficient Learning in AlphaZero
Machine Learning

[2602.20809] Regret-Guided Search Control for Efficient Learning in AlphaZero

This article presents Regret-Guided Search Control (RGSC), a novel approach to enhance the learning efficiency of AlphaZero by prioritizi...

arXiv - Machine Learning · 4 min ·
[2602.20867] SoK: Agentic Skills -- Beyond Tool Use in LLM Agents
Llms

[2602.20867] SoK: Agentic Skills -- Beyond Tool Use in LLM Agents

This paper explores agentic skills in LLM agents, focusing on reusable procedural capabilities that enhance long-horizon workflows. It pr...

arXiv - AI · 4 min ·
[2602.20735] RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition
Llms

[2602.20735] RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition

The paper presents RMIT-ADM+S, an award-winning system for the Text-to-Text track at the NeurIPS 2025 Competition, featuring a novel retr...

arXiv - AI · 3 min ·
[2602.20751] SibylSense: Adaptive Rubric Learning via Memory Tuning and Adversarial Probing
Machine Learning

[2602.20751] SibylSense: Adaptive Rubric Learning via Memory Tuning and Adversarial Probing

The paper presents SibylSense, a novel approach to adaptive rubric learning that enhances reward mechanisms in reinforcement learning thr...

arXiv - Machine Learning · 3 min ·
[2602.20731] Communication-Inspired Tokenization for Structured Image Representations
Machine Learning

[2602.20731] Communication-Inspired Tokenization for Structured Image Representations

The paper presents COMmunication inspired Tokenization (COMiT), a novel framework for structured image representations that enhances obje...

arXiv - Machine Learning · 4 min ·
[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs
Llms

[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

The paper presents AdapTools, a novel framework for adaptive indirect prompt injection attacks on agentic large language models (LLMs), h...

arXiv - AI · 4 min ·
[2602.20684] Agile V: A Compliance-Ready Framework for AI-Augmented Engineering -- From Concept to Audit-Ready Delivery
Machine Learning

[2602.20684] Agile V: A Compliance-Ready Framework for AI-Augmented Engineering -- From Concept to Audit-Ready Delivery

The paper presents Agile V, a framework integrating AI in engineering workflows to ensure compliance and verification at machine-speed de...

arXiv - AI · 4 min ·
[2602.20670] CAMEL: Confidence-Gated Reflection for Reward Modeling
Llms

[2602.20670] CAMEL: Confidence-Gated Reflection for Reward Modeling

The paper introduces CAMEL, a confidence-gated reflection framework for reward modeling in AI, achieving state-of-the-art performance wit...

arXiv - AI · 3 min ·
[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer
Llms

[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer

The paper presents TrajGPT-R, a framework for generating urban mobility trajectories using a reinforcement learning-enhanced generative t...

arXiv - Machine Learning · 4 min ·
[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI
Machine Learning

[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI

This article explores the factors influencing students' adoption of AI chatbots for learning, utilizing the Technology Acceptance Model t...

arXiv - AI · 4 min ·
[2602.20532] Actor-Curator: Co-adaptive Curriculum Learning via Policy-Improvement Bandits for RL Post-Training
Llms

[2602.20532] Actor-Curator: Co-adaptive Curriculum Learning via Policy-Improvement Bandits for RL Post-Training

The paper presents ACTOR-CURATOR, a novel framework for curriculum learning in reinforcement learning, enhancing post-training for large ...

arXiv - Machine Learning · 4 min ·
[2602.20527] A Generalized Apprenticeship Learning Framework for Capturing Evolving Student Pedagogical Strategies
Machine Learning

[2602.20527] A Generalized Apprenticeship Learning Framework for Capturing Evolving Student Pedagogical Strategies

This article presents a generalized apprenticeship learning framework, THEMES, designed to enhance pedagogical strategies in e-learning b...

arXiv - Machine Learning · 4 min ·
Previous Page 61 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime