[2604.03237] The Persuasion Paradox: When LLM Explanations Fail to

[2604.03237] The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance

arXiv - AI April 07, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.03237: The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance

Computer Science > Human-Computer Interaction arXiv:2604.03237 (cs) [Submitted on 31 Jan 2026] Title:The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance Authors:Ruth Cohen, Lu Feng, Ayala Bloch, Sarit Kraus View a PDF of the paper titled The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance, by Ruth Cohen and 2 other authors View PDF HTML (experimental) Abstract:While natural-language explanations from large language models (LLMs) are widely adopted to improve transparency and trust, their impact on objective human-AI team performance remains poorly understood. We identify a Persuasion Paradox: fluent explanations systematically increase user confidence and reliance on AI without reliably improving, and in some cases undermining, task accuracy. Across three controlled human-subject studies spanning abstract visual reasoning (RAVEN matrices) and deductive logical reasoning (LSAT problems), we disentangle the effects of AI predictions and explanations using a multi-stage reveal design and between-subjects comparisons. In visual reasoning, LLM explanations increase confidence but do not improve accuracy beyond the AI prediction alone, and substantially suppress users' ability to recover from model errors. Interfaces exposing model uncertainty via predicted probabilities, as well as a selective automation policy that defers uncertain cases to humans, achieve significantly higher accuracy and error recovery t...

Originally published on April 07, 2026. Curated by AI News.

Llms

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

TL;DR - I've written two novel functions that shape the training signal for LLMs. Early tests show people prefer responses from models tr...

Reddit - Machine Learning · 1 min · 11 minutes ago

Llms

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

TL;DR: I got tired of manually running Shapiro-Wilk tests and copy-pasting p-values at 2 AM. I built an open-source, async Python pipelin...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

Llms

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Days after the remains of one of the two missing University of South Florida doctoral students were found, prosecutors say the suspect ma...

AI Tools & Products · 3 min · about 7 hours ago

[2604.03237] The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance

About this article

Related Articles

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

No comments

Stay updated with AI News