AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] A control plane for post-training workflows

We have been exploring a project around post-training infrastructure, a minimalist tool that does one thing really well: Make post-traini...

Reddit - Machine Learning · 1 min ·
Enabling agent-first process redesign | MIT Technology Review
Nlp

Enabling agent-first process redesign | MIT Technology Review

Unlike static, rules-based systems, AI agents can learn, adapt, and optimize processes dynamically. As they interact with data, systems, ...

MIT Technology Review - AI · 4 min ·

All Content

[2602.17221] From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan's Humanities and Social Sciences
Llms

[2602.17221] From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan's Humanities and Social Sciences

This article explores a methodological experiment using AI agents to enhance research in Taiwan's humanities and social sciences, proposi...

arXiv - AI · 4 min ·
[2602.17217] Continual learning and refinement of causal models through dynamic predicate invention
Machine Learning

[2602.17217] Continual learning and refinement of causal models through dynamic predicate invention

This paper presents a framework for online construction of symbolic causal world models, enhancing agents' decision-making through contin...

arXiv - AI · 3 min ·
[2602.17189] Texo: Formula Recognition within 20M Parameters
Machine Learning

[2602.17189] Texo: Formula Recognition within 20M Parameters

The paper presents Texo, a compact formula recognition model with 20 million parameters, achieving high performance comparable to larger ...

arXiv - AI · 3 min ·
[2602.17162] JEPA-DNA: Grounding Genomic Foundation Models through Joint-Embedding Predictive Architectures
Llms

[2602.17162] JEPA-DNA: Grounding Genomic Foundation Models through Joint-Embedding Predictive Architectures

The paper introduces JEPA-DNA, a novel framework for genomic foundation models that enhances predictive capabilities by integrating joint...

arXiv - AI · 4 min ·
[2602.17130] Efficient Parallel Algorithm for Decomposing Hard CircuitSAT Instances
Ai Agents

[2602.17130] Efficient Parallel Algorithm for Decomposing Hard CircuitSAT Instances

This article presents a new parallel algorithm designed to decompose complex CircuitSAT instances, enhancing efficiency in solving SAT pr...

arXiv - AI · 3 min ·
[2602.17111] Instructor-Aligned Knowledge Graphs for Personalized Learning
Machine Learning

[2602.17111] Instructor-Aligned Knowledge Graphs for Personalized Learning

This article presents InstructKG, a framework for creating instructor-aligned knowledge graphs that enhance personalized learning by mapp...

arXiv - AI · 4 min ·
[2602.17096] Agentic Wireless Communication for 6G: Intent-Aware and Continuously Evolving Physical-Layer Intelligence
Robotics

[2602.17096] Agentic Wireless Communication for 6G: Intent-Aware and Continuously Evolving Physical-Layer Intelligence

The paper explores the evolution of 6G wireless communication, emphasizing the shift towards intent-aware, autonomous systems that adapt ...

arXiv - AI · 4 min ·
[2602.17084] How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses
Llms

[2602.17084] How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses

This study analyzes how AI coding agents create pull request descriptions and how human reviewers respond, revealing distinct styles and ...

arXiv - AI · 3 min ·
[2602.17062] Retaining Suboptimal Actions to Follow Shifting Optima in Multi-Agent Reinforcement Learning
Machine Learning

[2602.17062] Retaining Suboptimal Actions to Follow Shifting Optima in Multi-Agent Reinforcement Learning

This paper presents Successive Sub-value Q-learning (S2Q), a novel approach in multi-agent reinforcement learning (MARL) that retains sub...

arXiv - AI · 3 min ·
[2602.17049] IntentCUA: Learning Intent-level Representations for Skill Abstraction and Multi-Agent Planning in Computer-Use Agents
Nlp

[2602.17049] IntentCUA: Learning Intent-level Representations for Skill Abstraction and Multi-Agent Planning in Computer-Use Agents

The paper presents IntentCUA, a framework for multi-agent planning in computer-use agents, focusing on intent-level representations to en...

arXiv - AI · 4 min ·
[2602.17046] Dynamic System Instructions and Tool Exposure for Efficient Agentic LLMs
Llms

[2602.17046] Dynamic System Instructions and Tool Exposure for Efficient Agentic LLMs

The paper presents Instruction-Tool Retrieval (ITR), a method that optimizes the operation of Large Language Model (LLM) agents by dynami...

arXiv - AI · 3 min ·
[2602.17038] Phase-Aware Mixture of Experts for Agentic Reinforcement Learning
Llms

[2602.17038] Phase-Aware Mixture of Experts for Agentic Reinforcement Learning

The paper presents a novel Phase-Aware Mixture of Experts (PA-MoE) architecture for reinforcement learning, addressing the limitations of...

arXiv - AI · 4 min ·
[2602.17016] M2F: Automated Formalization of Mathematical Literature at Scale
Ai Agents

[2602.17016] M2F: Automated Formalization of Mathematical Literature at Scale

The paper presents M2F, an innovative framework for the automated formalization of mathematical literature, enabling project-scale conver...

arXiv - AI · 4 min ·
[2602.17017] Sales Research Agent and Sales Research Bench
Machine Learning

[2602.17017] Sales Research Agent and Sales Research Bench

The paper presents the Sales Research Agent, an AI tool in Microsoft Dynamics 365 Sales, designed to provide insights from live CRM data....

arXiv - AI · 3 min ·
[2602.17015] Cinder: A fast and fair matchmaking system
Nlp

[2602.17015] Cinder: A fast and fair matchmaking system

The paper introduces Cinder, a two-stage matchmaking system designed to enhance fairness and speed in multiplayer online games by utilizi...

arXiv - AI · 3 min ·
[2602.17001] Sonar-TS: Search-Then-Verify Natural Language Querying for Time Series Databases
Machine Learning

[2602.17001] Sonar-TS: Search-Then-Verify Natural Language Querying for Time Series Databases

Sonar-TS introduces a neuro-symbolic framework for natural language querying of time series databases, addressing limitations of existing...

arXiv - AI · 3 min ·
[2602.16990] Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation
Machine Learning

[2602.16990] Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation

The paper introduces Conv-FinRe, a benchmark for evaluating financial recommendation systems that emphasizes utility-grounded decision-ma...

arXiv - AI · 4 min ·
[2602.16958] Automating Agent Hijacking via Structural Template Injection
Llms

[2602.16958] Automating Agent Hijacking via Structural Template Injection

This paper presents Phantom, an automated framework for agent hijacking via Structural Template Injection, enhancing attack success rates...

arXiv - Machine Learning · 4 min ·
[2602.16984] Fundamental Limits of Black-Box Safety Evaluation: Information-Theoretic and Computational Barriers from Latent Context Conditioning
Machine Learning

[2602.16984] Fundamental Limits of Black-Box Safety Evaluation: Information-Theoretic and Computational Barriers from Latent Context Conditioning

This paper explores the limitations of black-box safety evaluations in AI systems, highlighting the challenges posed by latent context co...

arXiv - AI · 4 min ·
[2602.16953] LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation
Llms

[2602.16953] LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation

The paper presents LLM4Cov, an offline learning framework for high-coverage testbench generation, addressing challenges in hardware verif...

arXiv - Machine Learning · 3 min ·
Previous Page 105 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime