BranPO: Scalable Contrastive Branch Sampling for Long-Horizon Agentic Reinforcement Learning — AI News