Tight Sample Complexity Bounds for Entropic Best Policy Identification — AI News