AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

Spent today at MIT's Open Agentic Web conference. Six things worth thinking about.

We're in the DNS era of agent infrastructure. Before agents can find and trust each other at scale, you need identity, attestation, reput...

Reddit - Artificial Intelligence · 1 min ·
AMD's GAIA Now Allows Building Custom AI Agents Via Chat, Becomes "True Desktop App"
Ai Agents

AMD's GAIA Now Allows Building Custom AI Agents Via Chat, Becomes "True Desktop App"

In addition to their efforts around the Lemonade SDK itself, AMD software engineers working on their AI initiatives continue to be invest...

AI Tools & Products · 4 min ·

All Content

[2602.13255] DPBench: Large Language Models Struggle with Simultaneous Coordination
Llms

[2602.13255] DPBench: Large Language Models Struggle with Simultaneous Coordination

The paper introduces DPBench, a benchmark assessing how well large language models (LLMs) coordinate in multi-agent systems, revealing si...

arXiv - AI · 3 min ·
[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains
Llms

[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

The paper introduces Lang2Act, a novel framework for enhancing visual reasoning in Vision-Language Models (VLMs) through self-emergent li...

arXiv - AI · 4 min ·
[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models
Llms

[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

NL2LOGIC presents a novel framework for translating natural language into first-order logic using large language models, enhancing accura...

arXiv - AI · 4 min ·
[2602.13234] Stay in Character, Stay Safe: Dual-Cycle Adversarial Self-Evolution for Safety Role-Playing Agents
Llms

[2602.13234] Stay in Character, Stay Safe: Dual-Cycle Adversarial Self-Evolution for Safety Role-Playing Agents

The paper presents a novel framework, Dual-Cycle Adversarial Self-Evolution, aimed at enhancing the safety and fidelity of role-playing a...

arXiv - AI · 4 min ·
[2602.13230] Intelligence as Trajectory-Dominant Pareto Optimization
Machine Learning

[2602.13230] Intelligence as Trajectory-Dominant Pareto Optimization

The paper presents a novel framework for understanding intelligence through the lens of trajectory-dominant Pareto optimization, addressi...

arXiv - Machine Learning · 4 min ·
[2602.13218] Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning
Machine Learning

[2602.13218] Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning

The paper presents SSLogic, a novel framework for scaling logical reasoning in reinforcement learning, enhancing the synthesis of verifia...

arXiv - AI · 4 min ·
[2602.13214] BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors
Llms

[2602.13214] BotzoneBench: Scalable LLM Evaluation via Graded AI Anchors

The paper presents BotzoneBench, a scalable framework for evaluating Large Language Models (LLMs) using graded AI anchors, addressing the...

arXiv - AI · 4 min ·
[2602.13213] Agentic AI for Commercial Insurance Underwriting with Adversarial Self-Critique
Ai Agents

[2602.13213] Agentic AI for Commercial Insurance Underwriting with Adversarial Self-Critique

This paper presents an agentic AI system for commercial insurance underwriting that incorporates adversarial self-critique to enhance dec...

arXiv - Machine Learning · 4 min ·
[2602.13215] When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching
Machine Learning

[2602.13215] When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching

The paper presents AMOR, an entropy-based metacognitive gate that enhances attention switching in state space models, improving efficienc...

arXiv - AI · 3 min ·
Machine Learning

[R] We spent a decade scaling models. Now, by just shifting towards memory and continual learning, we can get to a human like AI or "A-GEE-I"

The article discusses the shift from scaling AI models to enhancing memory and continual learning as key factors for achieving human-like...

Reddit - Machine Learning · 1 min ·
Google-Ipsos Report Finds South Africans Rapidly Adopting AI for Learning, Work
Ai Agents

Google-Ipsos Report Finds South Africans Rapidly Adopting AI for Learning, Work

A recent Google-Ipsos report reveals that South Africans are increasingly adopting AI tools for learning, work, and major life decisions,...

AI News - General · 5 min ·
How AI is Transforming Document Processing and PDF Workflows
Machine Learning

How AI is Transforming Document Processing and PDF Workflows

The article discusses how AI is revolutionizing document processing and PDF workflows, highlighting advancements in automation, accuracy,...

AI News - General · 10 min ·
Machine Learning

[R] Learning State-Tracking from Code Using Linear RNNs

This article discusses the use of linear RNNs for state-tracking tasks, particularly focusing on permutation composition and its implicat...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Self-Reference Circuits in Transformers: Do Induction Heads Create De Se Beliefs?

This article explores how transformers process indexical language, focusing on self-reference circuits and their implications for underst...

Reddit - Machine Learning · 1 min ·
AI Agents Handle 30% of Airbnb Customer Support Tickets as Company Expands Automation
Ai Agents

AI Agents Handle 30% of Airbnb Customer Support Tickets as Company Expands Automation

Airbnb's AI agents now handle 30% of customer support tickets in North America, enhancing service efficiency and customer satisfaction as...

AI Tools & Products · 6 min ·
Anthropic tries to hide Claude's AI actions. Devs hate it
Llms

Anthropic tries to hide Claude's AI actions. Devs hate it

Anthropic's Claude Code update conceals file access details, prompting backlash from developers who rely on this information for effectiv...

AI Tools & Products · 7 min ·
Machine Learning

AI set to make medical scan reports twice as easy to understand for patients

AI advancements are set to simplify medical scan reports, making them more comprehensible for patients, enhancing their understanding of ...

AI Tools & Products · 1 min ·
Reddit's human content wins amid the AI flood
Generative Ai

Reddit's human content wins amid the AI flood

Reddit emphasizes the value of human-generated content as users seek authentic interactions amid a surge of AI-generated material, highli...

AI Tools & Products · 6 min ·
Fraudulent AI Assistants Target User Information
Ai Safety

Fraudulent AI Assistants Target User Information

A wave of malicious browser extensions masquerading as AI assistants has emerged on Google’s Chrome web store, stealing users' personal i...

AI Tools & Products · 4 min ·
Manus launches personal AI agents in Telegram, with more messaging apps to come
Ai Agents

Manus launches personal AI agents in Telegram, with more messaging apps to come

Manus has launched personal AI agents in Telegram, enabling users to create customized agents for complex tasks. More integrations with o...

AI Tools & Products · 5 min ·
Previous Page 146 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime