[2604.04720] What Makes Good Multilingual Reasoning? Disentangling

[2604.04720] What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features

arXiv - AI April 07, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.04720: What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features

Computer Science > Computation and Language arXiv:2604.04720 (cs) [Submitted on 6 Apr 2026] Title:What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features Authors:Dayeon Ki, Kevin Duh, Marine Carpuat View a PDF of the paper titled What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features, by Dayeon Ki and 2 other authors View PDF Abstract:Large Reasoning Models (LRMs) still exhibit large performance gaps between English and other languages, yet much current work assumes these gaps can be closed simply by making reasoning in every language resemble English reasoning. This work challenges this assumption by asking instead: what actually characterizes effective reasoning in multilingual settings, and to what extent do English-derived reasoning features genuinely help in other languages? We first define a suite of measurable reasoning features spanning multilingual alignment, reasoning step, and reasoning flow aspects of reasoning traces, and use logistic regression to quantify how each feature associates with final answer accuracy. We further train sparse autoencoders over multilingual traces to automatically discover latent reasoning concepts that instantiate or extend these features. Finally, we use the features as test-time selection policies to examine whether they can steer models toward stronger multilingual reasoning. Across two mathematical reasoning benchmarks, four LRMs, and 10 languages, w...

Originally published on April 07, 2026. Curated by AI News.

Machine Learning

AeroJAX: JAX-native CFD, differentiable end-to-end. ~560 FPS at 128x128 on CPU [P]

I have been building a JAX based CFD framework for differentiable Navier Stokes simulation inside ML loops such as inverse design and lea...

Reddit - Machine Learning · 1 min · about 2 hours ago

Llms

Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge

Larry Ellison and Oracle have staked their future on a data center deal with OpenAI and a big bet that enterprise AI will pay off.

The Verge - AI · 32 min · about 2 hours ago

Machine Learning

Am I crazy to think that the UAI authors are confusing the discussion deadline with the rebuttal deadline ? [D]

Hello everyone. UAI review results were released last Thursday, and the discussion period was clearly stated as April 23 to May 2nd. Howe...

Reddit - Machine Learning · 1 min · about 4 hours ago

Machine Learning

GitHub rushed to fix a critical vulnerability in less than six hours | The Verge

A critical remote code execution vulnerability was discovered using an AI model and patched within hours.

The Verge - AI · 4 min · about 6 hours ago

[2604.04720] What Makes Good Multilingual Reasoning? Disentangling Reasoning Traces with Measurable Features

About this article

Related Articles

AeroJAX: JAX-native CFD, differentiable end-to-end. ~560 FPS at 128x128 on CPU [P]

Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge

Am I crazy to think that the UAI authors are confusing the discussion deadline with the rebuttal deadline ? [D]

GitHub rushed to fix a critical vulnerability in less than six hours | The Verge

No comments

Stay updated with AI News