Generative Modeling by Minimizing the Wasserstein-2 Loss

View PDF HTML (experimental)

Abstract:This paper develops a generative model by minimizing the second-order Wasserstein loss (the $W_2$ loss) through a distribution-dependent ordinary differential equation (ODE), whose dynamics involves the Kantorovich potential associated with the true data distribution and a current estimate of it. A main result shows that the time-marginal laws of the ODE form a gradient flow for the $W_2$ loss, which converges exponentially to the true data distribution. An Euler scheme for the ODE is proposed and it is shown to recover the gradient flow for the $W_2$ loss in the limit. An algorithm is designed by following the scheme and applying persistent training, which naturally fits our gradient-flow approach. In both low- and high-dimensional experiments, our algorithm outperforms Wasserstein generative adversarial networks by increasing the level of persistent training appropriately.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
MSC classes:	34A06, 49Q22, 68T01
Cite as:	arXiv:2406.13619 [stat.ML]
	(or arXiv:2406.13619v4 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2406.13619 arXiv-issued DOI via DataCite

Submission history

From: Yu-Jui Huang [view email]
[v1] Wed, 19 Jun 2024 15:15:00 UTC (318 KB)
[v2] Sun, 14 Jul 2024 05:54:39 UTC (318 KB)
[v3] Sun, 28 Dec 2025 21:53:24 UTC (461 KB)
[v4] Tue, 12 May 2026 21:20:49 UTC (461 KB)