[2511.14827] Implicit Bias of the JKO Scheme
About this article
Abstract page for arXiv paper 2511.14827: Implicit Bias of the JKO Scheme
Statistics > Machine Learning arXiv:2511.14827 (stat) [Submitted on 18 Nov 2025 (v1), last revised 3 Mar 2026 (this version, v3)] Title:Implicit Bias of the JKO Scheme Authors:Peter Halmos, Boris Hanin View a PDF of the paper titled Implicit Bias of the JKO Scheme, by Peter Halmos and Boris Hanin View PDF HTML (experimental) Abstract:Wasserstein gradient flow provides a general framework for minimizing an energy functional $J$ over the space of probability measures on a Riemannian manifold $(M,g)$. Its canonical time-discretization, the Jordan-Kinderlehrer-Otto (JKO) scheme, produces for any step size $\eta>0$ a sequence of probability distributions $\rho_k^\eta$ that approximate to first order in $\eta$ Wasserstein gradient flow on $J$. But the JKO scheme also has many other remarkable properties not shared by other first order integrators, e.g. it preserves energy dissipation and exhibits unconditional stability for $\lambda$-geodesically convex functionals $J$. To better understand the JKO scheme we characterize its implicit bias at second order in $\eta$. We show that $\rho_k^\eta$ are approximated to order $\eta^2$ by Wasserstein gradient flow on a modified energy \[ J^{\eta}(\rho) = J(\rho) - \frac{\eta}{4}\int_M \Big\lVert \nabla_g \frac{\delta J}{\delta \rho} (\rho) \Big\rVert_{2}^{2} \,\rho(dx), \] obtained by subtracting from $J$ the squared metric curvature of $J$ times $\eta/4$. The JKO scheme therefore adds at second order in $\eta$ a deceleration in direction...