[2506.05334] Search Arena: Analyzing Search-Augmented LLMs

arXiv - Machine Learning March 04, 2026 4 min read

About this article

Abstract page for arXiv paper 2506.05334: Search Arena: Analyzing Search-Augmented LLMs

Computer Science > Computation and Language arXiv:2506.05334 (cs) [Submitted on 5 Jun 2025 (v1), last revised 2 Mar 2026 (this version, v2)] Title:Search Arena: Analyzing Search-Augmented LLMs Authors:Mihran Miroyan, Tsung-Han Wu, Logan King, Tianle Li, Jiayi Pan, Xinyan Hu, Wei-Lin Chiang, Anastasios N. Angelopoulos, Trevor Darrell, Narges Norouzi, Joseph E. Gonzalez View a PDF of the paper titled Search Arena: Analyzing Search-Augmented LLMs, by Mihran Miroyan and 10 other authors View PDF HTML (experimental) Abstract:Search-augmented language models combine web search with Large Language Models (LLMs) to improve response groundedness and freshness. However, analyzing these systems remains challenging: existing datasets are limited in scale and narrow in scope, often constrained to static, single-turn, fact-checking questions. In this work, we introduce Search Arena, a crowd-sourced, large-scale, human-preference dataset of over 24,000 paired multi-turn user interactions with search-augmented LLMs. The dataset spans diverse intents and languages, and contains full system traces with around 12,000 human preference votes. Our analysis reveals that user preferences are influenced by the number of citations, even when the cited content does not directly support the attributed claims, uncovering a gap between perceived and actual credibility. Furthermore, user preferences vary across cited sources, revealing that community-driven platforms are generally preferred and static e...

Originally published on March 04, 2026. Curated by AI News.

Llms

Apple to open Siri to rival AI services beyond ChatGPT

Apple plans to open its Siri voice assistant to rival artificial intelligence (AI) services, moving beyond its partnership with OpenAI, a...

AI Tools & Products · 4 min · 28 minutes ago

Llms

Claude's scheduled tasks finally fixed what ChatGPT, Gemini, and every other AI tool got wrong

The boring stuff finally does itself.

AI Tools & Products · 9 min · 28 minutes ago

Llms

ChatGPT Just Got 33% More Accurate (The AI News You Missed)

ChatGPT has improved its accuracy by 33%, marking a notable enhancement for users of the AI platform.

AI Tools & Products · 1 min · 28 minutes ago

Llms

Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT

The content discusses the sudden decline of OpenAI's most anticipated product since ChatGPT.

AI Tools & Products · 1 min · 28 minutes ago

[2506.05334] Search Arena: Analyzing Search-Augmented LLMs

About this article

Related Articles

Apple to open Siri to rival AI services beyond ChatGPT

Claude's scheduled tasks finally fixed what ChatGPT, Gemini, and every other AI tool got wrong

ChatGPT Just Got 33% More Accurate (The AI News You Missed)

Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT

No comments

Stay updated with AI News