A Semantic-Sampling Framework for Evaluating Calibration in Open-Ended Question Answering — AI News