AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

Microsoft's newest open-source project: Runtime security for AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods
Llms

[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

Abstract page for arXiv paper 2510.16609: Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

arXiv - Machine Learning · 4 min ·
[2604.02131] Intelligent Cloud Orchestration: A Hybrid Predictive and Heuristic Framework for Cost Optimization
Machine Learning

[2604.02131] Intelligent Cloud Orchestration: A Hybrid Predictive and Heuristic Framework for Cost Optimization

Abstract page for arXiv paper 2604.02131: Intelligent Cloud Orchestration: A Hybrid Predictive and Heuristic Framework for Cost Optimization

arXiv - Machine Learning · 3 min ·

All Content

[2602.21556] Power and Limitations of Aggregation in Compound AI Systems
Machine Learning

[2602.21556] Power and Limitations of Aggregation in Compound AI Systems

The paper explores the effectiveness of aggregating outputs from multiple AI models in compound AI systems, examining its potential to en...

arXiv - AI · 4 min ·
[2602.21534] ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning
Machine Learning

[2602.21534] ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

The paper presents ARLArena, a framework designed to enhance stability in agentic reinforcement learning (ARL) by providing a systematic ...

arXiv - AI · 4 min ·
[2602.21496] Beyond Refusal: Probing the Limits of Agentic Self-Correction for Semantic Sensitive Information
Llms

[2602.21496] Beyond Refusal: Probing the Limits of Agentic Self-Correction for Semantic Sensitive Information

The paper explores the limitations of self-correction in Large Language Models (LLMs) regarding semantic sensitive information, introduci...

arXiv - AI · 3 min ·
[2602.21351] A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives
Llms

[2602.21351] A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives

The paper presents PANGAEA-GPT, a hierarchical multi-agent system designed to enhance autonomous data discovery in geoscientific archives...

arXiv - AI · 3 min ·
Llms

Showed to some friends, they said post on reddit. I said hmk.

An AI enthusiast shares a project overview on Reddit, seeking feedback on a front-end tool for memory that integrates with various AI mod...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

had a voice conversation with my physical ai system today

The author shares their experience of having a voice conversation with a physical AI system, highlighting its contextual understanding an...

Reddit - Artificial Intelligence · 1 min ·
Salesforce CEO Marc Benioff: This isn't our first SaaSpocalypse | TechCrunch
Ai Agents

Salesforce CEO Marc Benioff: This isn't our first SaaSpocalypse | TechCrunch

Salesforce CEO Marc Benioff reassures investors during the earnings call, emphasizing the company's resilience amid fears of an AI-driven...

TechCrunch - AI · 6 min ·
Anthropic acquires computer-use AI startup Vercept after Meta poached one of its founders | TechCrunch
Ai Agents

Anthropic acquires computer-use AI startup Vercept after Meta poached one of its founders | TechCrunch

Anthropic has acquired Vercept, an AI startup known for developing advanced agentic tools, following the poaching of one of its founders ...

TechCrunch - AI · 6 min ·
Ai Agents

How Quickly Will A.I. Agents Rip Through the Economy?

The article features an in-depth interview with Anthropic co-founder discussing the potential impact of AI agents on the economy, explori...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

We built a cryptographic authorization gateway for AI agents and planning to run limited red-team sessions

Sentinel Gateway addresses the challenge of instruction provenance in AI agents by ensuring only user-signed prompts are treated as execu...

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] Made my own engine for Social-Simulations

The article discusses the creation of a custom engine for social simulations using LLMs, where agents interact in a controlled environmen...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Is advantage learning dead or unexplored?

The discussion centers on the current status of advantage learning in Q-learning optimization, questioning whether it is a dead end or st...

Reddit - Machine Learning · 1 min ·
Google and Samsung just launched the AI features Apple couldn’t with Siri | The Verge
Llms

Google and Samsung just launched the AI features Apple couldn’t with Siri | The Verge

Google's Gemini introduces new AI features for multi-step tasks on phones, challenging Apple's delayed Siri enhancements, showcasing adva...

The Verge - AI · 7 min ·
OpenClaw Users Are Allegedly Bypassing Anti-Bot Systems | WIRED
Ai Agents

OpenClaw Users Are Allegedly Bypassing Anti-Bot Systems | WIRED

The article discusses how users of the OpenClaw AI tool are leveraging an open-source project called Scrapling to bypass anti-bot systems...

Wired - AI · 6 min ·
Llms

Google's Aletheia AI Agent Autonomously Solves 6/10 Novel FirstProof Math Problems

Google's Aletheia AI agent successfully solved 6 out of 10 novel math problems in the FirstProof challenge, showcasing advancements in AI...

Reddit - Artificial Intelligence · 1 min ·
Gemini Can Now Book You an Uber or Order a DoorDash Meal on Your Phone. Here’s How It Works | WIRED
Llms

Gemini Can Now Book You an Uber or Order a DoorDash Meal on Your Phone. Here’s How It Works | WIRED

Google's Gemini voice assistant can now automate tasks in apps like Uber and DoorDash, starting with the Samsung Galaxy S26, enhancing us...

Wired - AI · 9 min ·
Google Gemini can book an Uber or order food for you with new agentic AI features | The Verge
Llms

Google Gemini can book an Uber or order food for you with new agentic AI features | The Verge

Google's Gemini AI now features task automation, allowing users to order rides and groceries with minimal input, enhancing user convenien...

The Verge - AI · 6 min ·
Llms

[D] What exactly do companies mean by "AI Agents" right now? (NLP Grad Student)

The article discusses the ambiguity surrounding the term 'AI Agents' in job descriptions, particularly for roles in machine learning and ...

Reddit - Machine Learning · 1 min ·
Llms

[D] Is it possible to create a benchmark that can measure human-like intelligence?

The article discusses the limitations of current benchmarks for measuring human-like intelligence in AI, highlighting Francois Chollet's ...

Reddit - Machine Learning · 1 min ·
Gemini can now automate some multi-step tasks on Android | TechCrunch
Llms

Gemini can now automate some multi-step tasks on Android | TechCrunch

Google's Gemini AI on Android now automates multi-step tasks like ordering rides or food delivery, enhancing user convenience while maint...

TechCrunch - AI · 6 min ·
Previous Page 55 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime