[2603.29025] The Model Says Walk: How Surface Heuristics Override

[2603.29025] The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning

arXiv - AI April 01, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.29025: The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning

Computer Science > Computation and Language arXiv:2603.29025 (cs) [Submitted on 30 Mar 2026] Title:The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Authors:Yubo Li, Lu Zhang, Tianchong Jiang, Ramayya Krishnan, Rema Padman View a PDF of the paper titled The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning, by Yubo Li and 4 other authors View PDF HTML (experimental) Abstract:Large language models systematically fail when a salient surface cue conflicts with an unstated feasibility constraint. We study this through a diagnose-measure-bridge-treat framework. Causal-behavioral analysis of the ``car wash problem'' across six models reveals approximately context-independent sigmoid heuristics: the distance cue exerts 8.7 to 38 times more influence than the goal, and token-level attribution shows patterns more consistent with keyword associations than compositional inference. The Heuristic Override Benchmark (HOB) -- 500 instances spanning 4 heuristic by 5 constraint families with minimal pairs and explicitness gradients -- demonstrates generality across 14 models: under strict evaluation (10/10 correct), no model exceeds 75%, and presence constraints are hardest (44%). A minimal hint (e.g., emphasizing the key object) recovers +15 pp on average, suggesting the failure lies in constraint inference rather than missing knowledge; 12/14 models perform worse when the constraint is removed (up to -39 pp), r...

Originally published on April 01, 2026. Curated by AI News.

Llms

Gemini caught a $280M crypto exploit before it hit the news, then retracted it as a hallucination because I couldn't verify it - because the news hadn't dropped yet

So this happened mere hours ago and I feel like I genuinely stumbled onto something worth documenting for people interested in AI behavio...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

GPT-4 vs Claude vs Gemini for coding — honest breakdown after 3 months of daily use

I am a solo developer who has been using all three seriously. Here is what I actually think: GPT-4o — Strengths: Large context window, st...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Llms

You're giving feedback on a new version of ChatGPT

So I will be paying attention to these system messages more now- the last time I got one of these not so long back the 'tone' changed to ...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Llms

Gemma 4 actually running usable on an Android phone (not llama.cpp)

I wanted a real local assistant on my phone, not a demo. First tried the usual llama.cpp in Termux — Gemma 4 was 2–3 tok/s and the phone ...

Reddit - Artificial Intelligence · 1 min · about 9 hours ago

[2603.29025] The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning

About this article

Related Articles

Gemini caught a $280M crypto exploit before it hit the news, then retracted it as a hallucination because I couldn't verify it - because the news hadn't dropped yet

GPT-4 vs Claude vs Gemini for coding — honest breakdown after 3 months of daily use

You're giving feedback on a new version of ChatGPT

Gemma 4 actually running usable on an Android phone (not llama.cpp)

No comments

Stay updated with AI News