[2602.10625] To Think or Not To Think, That is The Question for Large

[2602.10625] To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

arXiv - AI March 03, 2026 4 min read

About this article

Abstract page for arXiv paper 2602.10625: To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

Computer Science > Artificial Intelligence arXiv:2602.10625 (cs) [Submitted on 11 Feb 2026 (v1), last revised 28 Feb 2026 (this version, v2)] Title:To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks Authors:Nanxu Gong, Haotian Li, Sixun Dong, Jianxun Lian, Yanjie Fu, Xing Xie View a PDF of the paper titled To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks, by Nanxu Gong and 5 other authors View PDF HTML (experimental) Abstract:Theory of Mind (ToM) assesses whether models can infer hidden mental states such as beliefs, desires, and intentions, which is essential for natural social interaction. Although recent progress in Large Reasoning Models (LRMs) has boosted step-by-step inference in mathematics and coding, it is still underexplored whether this benefit transfers to socio-cognitive skills. We present a systematic study of nine advanced Large Language Models (LLMs), comparing reasoning models with non-reasoning models on three representative ToM benchmarks. The results show that reasoning models do not consistently outperform non-reasoning models and sometimes perform worse. A fine-grained analysis reveals three insights. First, slow thinking collapses: accuracy significantly drops as responses grow longer, and larger reasoning budgets hurt performance. Second, moderate and adaptive reasoning benefits performance: constraining reasoning length mitigates failure, while distinct ...

Originally published on March 03, 2026. Curated by AI News.

Llms

Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet

Anthropic is testing an unreleased artificial intelligence (AI) model with capabilities that exceed any system it has previously released...

AI Tools & Products · 5 min · 30 minutes ago

Llms

Anthropic leaks part of Claude Code's internal source code

Claude Code has seen massive adoption over the last year, and its run-rate revenue had swelled to more than $2.5 billion as of February.

AI Tools & Products · 3 min · 30 minutes ago

Llms

Australian government and Anthropic sign MOU for AI safety and research

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

AI Tools & Products · 5 min · 30 minutes ago

Llms

Penguin to sue OpenAI over ChatGPT version of German children’s book

Publisher alleges AI research company’s chatbot violated its copyright over Coconut the Little Dragon series

AI Tools & Products · 3 min · 30 minutes ago

[2602.10625] To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

About this article

Related Articles

Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet

Anthropic leaks part of Claude Code's internal source code

Australian government and Anthropic sign MOU for AI safety and research

Penguin to sue OpenAI over ChatGPT version of German children’s book

No comments

Stay updated with AI News