[2506.05634] AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization
Nlp

[2506.05634] AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2506.05634: AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization

Computer Science > Machine Learning arXiv:2506.05634 (cs) [Submitted on 5 Jun 2025 (v1), last revised 4 Mar 2026 (this version, v2)] Title:AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization Authors:Saeed Hedayatian, Stefanos Nikolaidis View a PDF of the paper titled AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization, by Saeed Hedayatian and 1 other authors View PDF HTML (experimental) Abstract:Quality-Diversity (QD) algorithms have shown remarkable success in discovering diverse, high-performing solutions, but rely heavily on hand-crafted behavioral descriptors that constrain exploration to predefined notions of diversity. Leveraging the equivalence between policies and occupancy measures, we present a theoretically grounded approach to automatically generate behavioral descriptors by embedding the occupancy measures of policies in Markov Decision Processes. Our method, AutoQD, leverages random Fourier features to approximate the Maximum Mean Discrepancy (MMD) between policy occupancy measures, creating embeddings whose distances reflect meaningful behavioral differences. A low-dimensional projection of these embeddings that captures the most behaviorally significant dimensions can then be used as behavioral descriptors for CMA-MAE, a state of the art blackbox QD method, to discover diverse policies. We prove that our embeddings converge to true MMD distances between occupancy measures as the number of sam...

Originally published on March 05, 2026. Curated by AI News.

Related Articles

Nlp

What does your AI bot buddy really think of you?

Try out this prompt and let us know if you find the response to be unsettling. (Hint: you should) Prompt: You have been maintaining an in...

Reddit - Artificial Intelligence · 1 min ·
Nlp

Persistent memory MCP server for AI agents (MCP + REST)

Pluribus is a memory service for agents (MCP + HTTP, Postgres-backed) that stores structured memory: constraints, decisions, patterns, an...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...

Reddit - Machine Learning · 1 min ·
Nlp

[P] Using YouTube as a data source (lessons from building a coffee domain dataset)

I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...

Reddit - Machine Learning · 1 min ·
More in Nlp: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime