MINOS: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text — AI News