Skip to content
GitLab
Explore
Sign in
b1299
f5ef5cfb
·
ggml-cuda : perform cublas mat mul of quantized types as f16 (#3412)
·
Sep 30, 2023