Holistic Evaluation and Failure Diagnosis of AI Agents — AI News