AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

hey everyone. been lurking here for a while and wanted to share something we been building. the problem: ai coding agents are only as goo...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Ai Agents

Okta CEO: The next frontier of security is AI agent identity | The Verge

Todd McKinnon on why AI agents need an identity, security in an OpenClaw era, and being “paranoid” in preparing for the SaaSpocalypse.

The Verge - AI · 61 min · about 6 hours ago

Llms

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

Abstract page for arXiv paper 2506.20964: Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

arXiv - AI · 4 min · about 16 hours ago

All Content

Ai Agents

A new wearable AI system watches your hands through smart glasses, guiding experiments and stopping mistakes before they happen

A new AI wearable system utilizes smart glasses to monitor hand movements, enhancing experimental accuracy and preventing errors in real-...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Agents

The Age of the Human Employee

The article discusses the potential decline of human employees in favor of digital alternatives, particularly in sectors like finance and...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Robotics

Anthropic vs. the Pentagon: What’s actually at stake? | TechCrunch

The article discusses the conflict between Anthropic and the Pentagon over the use of AI in military applications, focusing on ethical co...

TechCrunch - AI · 8 min · about 1 month ago

Llms

[D] On-device Game AI: would you try AI characters, and what should we build next? Discussion

The discussion focuses on developing on-device Game AI capable of real-time conversations and context-aware interactions, exploring poten...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

AI vs. the Pentagon: killer robots, mass surveillance, and red lines | The Verge

The article discusses the ongoing conflict between AI companies, particularly Anthropic, and the Pentagon over military contract terms th...

The Verge - AI · 6 min · about 1 month ago

Robotics

We don’t have to have unsupervised killer robots | The Verge

The article discusses the Pentagon's ultimatum to Anthropic regarding military access to AI technology, raising ethical concerns among te...

The Verge - AI · 11 min · about 1 month ago

Ai Agents

Samsung’s Galaxy S26 AI camera features are a photography nightmare | The Verge

The Vergecast discusses Samsung's Galaxy S26 AI camera features, arguing they redefine photography and raise concerns about the essence o...

The Verge - AI · 5 min · about 1 month ago

Generative Ai

[R] Community Members, Kindly share your opinion on my article. Am I clear in my thoughts? Anything I miss here?

The article seeks feedback on a research piece about AI's role in creative writing, inviting community insights to enhance future work.

Reddit - Machine Learning · 1 min · about 1 month ago

Ai Safety

Anthropic Rejects the Pentagon’s Demand That It Remove AI Safeguards

Anthropic has rejected the Pentagon's demand to remove AI safeguards for its model Claude, aiming to prevent its use in mass surveillance...

AI Tools & Products · 5 min · about 1 month ago

Machine Learning

Fed on Reams of Cell Data, AI Maps New Neighborhoods in the Brain

Researchers are enhancing brain mapping by utilizing AI to analyze cellular data, leading to more detailed and functional brain maps that...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

[2512.17053] Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL

This article presents a novel Knowledge Distillation framework, Struct-SQL, which enhances Small Language Models for Text-to-SQL tasks by...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.11836] ULTRA:Urdu Language Transformer-based Recommendation Architecture

The paper presents ULTRA, a transformer-based recommendation architecture tailored for the Urdu language, addressing challenges in semant...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2510.25726] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

The Tool Decathlon introduces a benchmark for evaluating language agents on diverse, realistic, and complex tasks, highlighting significa...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2510.20505] RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA

The paper presents RELOOP, a novel framework for recursive retrieval in heterogeneous question answering (QA) that enhances efficiency an...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.03495] AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents

AgentHub proposes a registry for AI agents that enhances discoverability, verifiability, and reproducibility, addressing gaps in current ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.02306] Spark: Modular Spiking Neural Networks

The paper presents Spark, a modular framework for spiking neural networks aimed at improving data and energy efficiency in AI applications.

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2512.24796] LeanCat: A Benchmark Suite for Formal Category Theory in Lean (Part I: 1-Categories)

LeanCat introduces a benchmark suite for formal category theory in Lean, highlighting the challenges in reasoning with high-level abstrac...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2512.14990] Imitation Game: Reproducing Deep Learning Bugs Leveraging an Intelligent Agent

The paper presents RepGen, an intelligent agent designed to automate the reproduction of deep learning bugs, achieving an 80.19% success ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.16801] Decoding Translation-Related Functional Sequences in 5'UTRs Using Interpretable Deep Learning Models

This paper presents UTR-STCNet, a novel deep learning model designed to analyze 5' untranslated regions (5'UTRs) for improved prediction ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.07885] Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

The paper presents a metric called Intelligence per Watt (IPW) to evaluate the efficiency of local AI models compared to centralized clou...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 28 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

Okta CEO: The next frontier of security is AI agent identity | The Verge

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

All Content

A new wearable AI system watches your hands through smart glasses, guiding experiments and stopping mistakes before they happen

The Age of the Human Employee

Anthropic vs. the Pentagon: What’s actually at stake? | TechCrunch

[D] On-device Game AI: would you try AI characters, and what should we build next? Discussion

AI vs. the Pentagon: killer robots, mass surveillance, and red lines | The Verge

We don’t have to have unsupervised killer robots | The Verge

Samsung’s Galaxy S26 AI camera features are a photography nightmare | The Verge

[R] Community Members, Kindly share your opinion on my article. Am I clear in my thoughts? Anything I miss here?

Anthropic Rejects the Pentagon’s Demand That It Remove AI Safeguards

Fed on Reams of Cell Data, AI Maps New Neighborhoods in the Brain

[2512.17053] Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL

[2602.11836] ULTRA:Urdu Language Transformer-based Recommendation Architecture

[2510.25726] The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

[2510.20505] RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA

[2510.03495] AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents

[2602.02306] Spark: Modular Spiking Neural Networks

[2512.24796] LeanCat: A Benchmark Suite for Formal Category Theory in Lean (Part I: 1-Categories)

[2512.14990] Imitation Game: Reproducing Deep Learning Bugs Leveraging an Intelligent Agent

[2507.16801] Decoding Translation-Related Functional Sequences in 5'UTRs Using Interpretable Deep Learning Models

[2511.07885] Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

Related Topics

Stay updated with AI News