Fast Rates for Inverse Reinforcement Learning

View PDF HTML (experimental)

Abstract:We establish novel structural and statistical results for entropy-regularized min-max inverse reinforcement learning (Min-Max-IRL) with linear reward classes in finite-horizon MDPs with Borel state and action spaces. On the structural side, we show that maximum likelihood estimation (MLE) and Min-Max-IRL are equivalent at the population level, and at the empirical level under deterministic dynamics. On the statistical side, exploiting pseudo-self-concordance of the Min-Max-IRL loss, we prove that both the trajectory-level KL divergence and the squared parameter error in the Hessian norm decay at the fast rate $\mathcal{O}(n^{-1})$, where $n$ is the number of expert trajectories. Our guarantees apply under misspecification and require no exploration assumptions. We further extend reward-identifiability results to general Borel spaces and derive novel results on the derivatives of the soft-optimal value function with respect to reward parameters.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2605.14599 [cs.LG]
	(or arXiv:2605.14599v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2605.14599 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Andreas Schlaginhaufen [view email]
[v1] Thu, 14 May 2026 09:07:31 UTC (34 KB)