[2506.05634] AutoQD: Automatic Discovery of Diverse Behaviors with

[2506.05634] AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization

arXiv - AI March 05, 2026 4 min read

About this article

Abstract page for arXiv paper 2506.05634: AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization

Computer Science > Machine Learning arXiv:2506.05634 (cs) [Submitted on 5 Jun 2025 (v1), last revised 4 Mar 2026 (this version, v2)] Title:AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization Authors:Saeed Hedayatian, Stefanos Nikolaidis View a PDF of the paper titled AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization, by Saeed Hedayatian and 1 other authors View PDF HTML (experimental) Abstract:Quality-Diversity (QD) algorithms have shown remarkable success in discovering diverse, high-performing solutions, but rely heavily on hand-crafted behavioral descriptors that constrain exploration to predefined notions of diversity. Leveraging the equivalence between policies and occupancy measures, we present a theoretically grounded approach to automatically generate behavioral descriptors by embedding the occupancy measures of policies in Markov Decision Processes. Our method, AutoQD, leverages random Fourier features to approximate the Maximum Mean Discrepancy (MMD) between policy occupancy measures, creating embeddings whose distances reflect meaningful behavioral differences. A low-dimensional projection of these embeddings that captures the most behaviorally significant dimensions can then be used as behavioral descriptors for CMA-MAE, a state of the art blackbox QD method, to discover diverse policies. We prove that our embeddings converge to true MMD distances between occupancy measures as the number of sam...

Originally published on March 05, 2026. Curated by AI News.

Nlp

What does your AI bot buddy really think of you?

Try out this prompt and let us know if you find the response to be unsettling. (Hint: you should) Prompt: You have been maintaining an in...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Nlp

Persistent memory MCP server for AI agents (MCP + REST)

Pluribus is a memory service for agents (MCP + HTTP, Postgres-backed) that stores structured memory: constraints, decisions, patterns, an...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...

Reddit - Machine Learning · 1 min · about 7 hours ago

Nlp

[P] Using YouTube as a data source (lessons from building a coffee domain dataset)

I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...

Reddit - Machine Learning · 1 min · about 9 hours ago

[2506.05634] AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization

About this article

Related Articles

What does your AI bot buddy really think of you?

Persistent memory MCP server for AI agents (MCP + REST)

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

[P] Using YouTube as a data source (lessons from building a coffee domain dataset)

No comments

Stay updated with AI News