RDKV: Rate-Distortion Bit Allocation for Joint Eviction and Quantization of the KV Cache — AI News