On the Generalization of Knowledge Distillation: An Information-Theoretic View — AI News