AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

Abstract page for arXiv paper 2601.00809: A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

arXiv - AI · 4 min · about 1 hour ago

Machine Learning

[2511.11483] ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

Abstract page for arXiv paper 2511.11483: ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

arXiv - AI · 4 min · about 1 hour ago

Nlp

[2410.09134] Multi-Agent Actor-Critics in Autonomous Cyber Defense

Abstract page for arXiv paper 2410.09134: Multi-Agent Actor-Critics in Autonomous Cyber Defense

arXiv - AI · 3 min · about 1 hour ago

All Content

Llms

[2602.22828] TCM-DiffRAG: Personalized Syndrome Differentiation Reasoning Method for Traditional Chinese Medicine based on Knowledge Graph and Chain of Thought

The article presents TCM-DiffRAG, a novel reasoning framework for Traditional Chinese Medicine (TCM) that enhances diagnosis through know...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23136] Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs

This article explores the concept of modality collapse in multimodal large language models (LLMs), highlighting the limitations of decode...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23132] From Agnostic to Specific: Latent Preference Diffusion for Multi-Behavior Sequential Recommendation

This paper presents FatsMB, a novel framework for Multi-Behavior Sequential Recommendation (MBSR) that enhances user preference modeling ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22752] Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction

This article presents a study on the operational validity of using Large Language Models (LLMs) to simulate social media user behavior th...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2602.23061] MoDora: Tree-Based Semi-Structured Document Analysis System

MoDora is a novel LLM-powered system designed for analyzing semi-structured documents, addressing challenges in information retrieval and...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.22740] AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation

The paper presents AMLRIS, a novel training strategy for Referring Image Segmentation (RIS) that enhances object segmentation through ali...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.22735] Simulation-based Optimization for Augmented Reading

This article presents a novel approach to augmented reading systems, proposing a simulation-based optimization framework that enhances te...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.22724] AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification

AgentSentry introduces a novel framework to mitigate indirect prompt injection (IPI) in LLM agents, enhancing their security while mainta...

arXiv - AI · 4 min · about 1 month ago

Ai Safety

[2602.22710] Same Words, Different Judgments: Modality Effects on Preference Alignment

This study explores how modality affects preference alignment in AI systems, comparing human and synthetic evaluations of audio and text ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.22698] Tokenization, Fusion and Decoupling: Bridging the Granularity Mismatch Between Large Language Models and Knowledge Graphs

This paper presents KGT, a novel framework addressing the granularity mismatch between large language models (LLMs) and knowledge graphs ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.22938] pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation

The paper presents pMoE, a novel Mixture-of-Experts prompt tuning method that enhances visual adaptation by integrating diverse domain kn...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.22925] Beyond NNGP: Large Deviations and Feature Learning in Bayesian Neural Networks

This paper explores the behavior of wide Bayesian neural networks, focusing on rare fluctuations that influence posterior concentration b...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.22683] SUPERGLASSES: Benchmarking Vision Language Models as Intelligent Agents for AI Smart Glasses

The paper introduces SUPERGLASSES, a benchmark for evaluating Vision Language Models (VLMs) in AI smart glasses, addressing the limitatio...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.22697] Reinforcing Real-world Service Agents: Balancing Utility and Cost in Task-oriented Dialogue

The paper presents InteractCS-RL, a novel framework for enhancing task-oriented dialogue systems by balancing empathetic communication an...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.22903] PSQE: A Theoretical-Practical Approach to Pseudo Seed Quality Enhancement for Unsupervised MMEA

The paper presents PSQE, a method for enhancing pseudo seed quality in unsupervised multimodal entity alignment, addressing challenges in...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22624] Instruction-based Image Editing with Planning, Reasoning, and Generation

This paper presents a novel approach to instruction-based image editing by integrating planning, reasoning, and generation through a mult...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.22884] Unsupervised Continual Learning for Amortized Bayesian Inference

This article presents a novel framework for Unsupervised Continual Learning in Amortized Bayesian Inference, addressing performance issue...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

This article explores the application of diffusion models in end-to-end autonomous driving, demonstrating their effectiveness through ext...

arXiv - Machine Learning · 4 min · about 1 month ago

Generative Ai

[2602.22606] CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support

CoLyricist is an AI-assisted tool designed to enhance the lyric writing process by aligning with the common workflows of lyricists, impro...

arXiv - AI · 3 min · about 1 month ago

Ai Agents

[2602.22786] QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning

The paper introduces QSIM, a novel framework that addresses the issue of Q-value overestimation in multi-agent reinforcement learning (MA...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 35 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

[2511.11483] ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

[2410.09134] Multi-Agent Actor-Critics in Autonomous Cyber Defense

All Content

[2602.22828] TCM-DiffRAG: Personalized Syndrome Differentiation Reasoning Method for Traditional Chinese Medicine based on Knowledge Graph and Chain of Thought

[2602.23136] Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs

[2602.23132] From Agnostic to Specific: Latent Preference Diffusion for Multi-Behavior Sequential Recommendation

[2602.22752] Towards Simulating Social Media Users with LLMs: Evaluating the Operational Validity of Conditioned Comment Prediction

[2602.23061] MoDora: Tree-Based Semi-Structured Document Analysis System

[2602.22740] AMLRIS: Alignment-aware Masked Learning for Referring Image Segmentation

[2602.22735] Simulation-based Optimization for Augmented Reading

[2602.22724] AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification

[2602.22710] Same Words, Different Judgments: Modality Effects on Preference Alignment

[2602.22698] Tokenization, Fusion and Decoupling: Bridging the Granularity Mismatch Between Large Language Models and Knowledge Graphs

[2602.22938] pMoE: Prompting Diverse Experts Together Wins More in Visual Adaptation

[2602.22925] Beyond NNGP: Large Deviations and Feature Learning in Bayesian Neural Networks

[2602.22683] SUPERGLASSES: Benchmarking Vision Language Models as Intelligent Agents for AI Smart Glasses

[2602.22697] Reinforcing Real-world Service Agents: Balancing Utility and Cost in Task-oriented Dialogue

[2602.22903] PSQE: A Theoretical-Practical Approach to Pseudo Seed Quality Enhancement for Unsupervised MMEA

[2602.22624] Instruction-based Image Editing with Planning, Reasoning, and Generation

[2602.22884] Unsupervised Continual Learning for Amortized Bayesian Inference

[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

[2602.22606] CoLyricist: Enhancing Lyric Writing with AI through Workflow-Aligned Support

[2602.22786] QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning

Related Topics

Stay updated with AI News