DUET: Optimize Token-Budget Allocation for Reinforcement Learning with Verifiable Rewards — AI News