b3063
750f60c0
·
CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (#7681)
·
Jun 01, 2024