Towards Generalizable Reasoning: Group Causal Counterfactual Policy Optimization for LLM Reasoning — AI News