MUON+: Towards More Effective Muon via One Additional Normalization Step for LLM Pre-training — AI News