[2602.19404] One Size Fits None: Modeling NYC Taxi Trips

[2602.19404] One Size Fits None: Modeling NYC Taxi Trips

arXiv - AI 3 min read Article

Summary

This paper analyzes 280 million NYC taxi trips to compare tipping behaviors between traditional taxis and app-based services, revealing distinct modeling challenges.

Why It Matters

Understanding tipping patterns in NYC's evolving taxi landscape is crucial for both drivers and ride-sharing platforms. The findings highlight the limitations of universal predictive models, emphasizing the need for tailored approaches in data analysis and service optimization.

Key Takeaways

  • Traditional taxi tipping is predictable, with an R² of approximately 0.72.
  • App-based tipping shows randomness, with an R² of only about 0.17.
  • A single predictive model for both taxi types is ineffective due to Simpson's paradox.
  • Specialized models are necessary for accurate predictions in different taxi categories.
  • The study underscores the impact of technology on tipping culture in NYC.

Computer Science > Machine Learning arXiv:2602.19404 (cs) [Submitted on 10 Dec 2025] Title:One Size Fits None: Modeling NYC Taxi Trips Authors:Tomas Eglinskas View a PDF of the paper titled One Size Fits None: Modeling NYC Taxi Trips, by Tomas Eglinskas View PDF HTML (experimental) Abstract:The rise of app-based ride-sharing has fundamentally changed tipping culture in New York City. We analyzed 280 million trips from 2024 to see if we could predict tips for traditional taxis versus high-volume for-hire services. By testing methods from linear regression to deep neural networks, we found two very different outcomes. Traditional taxis are highly predictable ($R^2 \approx 0.72$) due to the in-car payment screen. In contrast, app-based tipping is random and hard to model ($R^2 \approx 0.17$). In conclusion, we show that building one universal model is a mistake and, due to Simpson's paradox, a combined model looks accurate on average but fails to predict tips for individual taxi categories requiring specialized models. Comments: Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI) Cite as: arXiv:2602.19404 [cs.LG]   (or arXiv:2602.19404v1 [cs.LG] for this version)   https://doi.org/10.48550/arXiv.2602.19404 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission history From: Tomas Eglinskas [view email] [v1] Wed, 10 Dec 2025 23:20:40 UTC (2,678 KB) Full-text links: Access Paper: View a PDF of the paper titled One Size Fits None: Mod...

Related Articles

Machine Learning

[P] SpeakFlow - AI Dialogue Practice Coach with GLM 5.1

Built SpeakFlow for the Z.AI Builder Series hackathon. AI dialogue practice coach that evaluates your spoken responses in real-time. Two ...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

[R] ICML Anonymized git repos for rebuttal

A number of the papers I'm reviewing for have submitted additional figures and code through anonymized git repos (e.g. https://anonymous....

Reddit - Machine Learning · 1 min ·
Llms

[R] Reference model free behavioral discovery of AudiBench model organisms via Probe-Mediated Adaptive Auditing

Anthropic's AuditBench - 56 Llama 3.3 70B models with planted hidden behaviors - their best agent detects the behaviros 10-13% of the tim...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime