Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding — AI News