PreFT: Prefill-only finetuning for efficient inference — AI News