Every Token-Based Language Model Is Throwing Away Information at the Last Step. — AI News