[2603.23971] The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More
About this article
Abstract page for arXiv paper 2603.23971: The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More
Computer Science > Computation and Language arXiv:2603.23971 (cs) [Submitted on 25 Mar 2026] Title:The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More Authors:Lingjiao Chen, Chi Zhang, Yeye He, Ion Stoica, Matei Zaharia, James Zou View a PDF of the paper titled The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More, by Lingjiao Chen and Chi Zhang and Yeye He and Ion Stoica and Matei Zaharia and James Zou View PDF HTML (experimental) Abstract:Developers and consumers increasingly choose reasoning language models (RLMs) based on their listed API prices. However, how accurately do these prices reflect actual inference costs? We conduct the first systematic study of this question, evaluating 8 frontier RLMs across 9 diverse tasks covering competition math, science QA, code generation, and multi-domain reasoning. We uncover the pricing reversal phenomenon: in 21.8% of model-pair comparisons, the model with a lower listed price actually incurs a higher total cost, with reversal magnitude reaching up to 28x. For example, Gemini 3 Flash's listed price is 78% cheaper than GPT-5.2's, yet its actual cost across all tasks is 22% higher. We trace the root cause to vast heterogeneity in thinking token consumption: on the same query, one model may use 900% more thinking tokens than another. In fact, removing thinking token costs reduces ranking reversals by 70% and raises the rank correlation (Kendall's $\tau$ ) between price and cos...