b3063
750f60c0 · CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (#7681) · Jun 01, 2024