Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
dev
a169bb88
·
Gate signal support on being on a unixoid system. (#74)
·
Mar 13, 2023
tcp_server
3a0dcb39
·
Implement server mode.
·
Mar 22, 2023
q4_1_more_accel_loopsplit
71122470
·
Break up loop for numeric stability
·
Mar 23, 2023
q4_1_more_accel_kahan
66ea164e
·
Kahan summation on Q4_1
·
Mar 23, 2023
q4_1_more_accel
4aeee216
·
Regroup q4_1 dot addition for better numerics.
·
Mar 24, 2023
mmap
c9c820ff
·
Added support for _POSIX_MAPPED_FILES if defined in source (#564)
·
Mar 28, 2023
flash-attn
36ddd129
·
llama : add flash attention (demo)
·
Apr 05, 2023
mmap-pages-stats
15067374
·
Add mmap pages stats (disabled by default)
·
Apr 16, 2023
quant-attn
4b8d5e38
·
llama : quantize attention results
·
Apr 22, 2023
gg/rmse_quantization
a0242a83
·
Minor, plus rebase on master
·
Apr 22, 2023
q4_0-q4_2-range-fix
71e6ae37
·
ggml : continue from #729 (wip)
·
Apr 22, 2023
q4_3-range-fix
102cd980
·
ggml : Q4_3c using 2x "Full range" approach
·
Apr 23, 2023
ci_cublas
31ff9e2e
·
ci : add cublas to windows release
·
May 03, 2023
jed/spm-clblast
4baa8563
·
Fix build
·
May 06, 2023
remove-vzip
e116eb63
·
ggml : speed-up Q5_0 + Q5_1 at 4 threads
·
May 11, 2023
dequantize-matmul-3-gg
a3e6d622
·
cuda : alternative q4_q8 kernel
·
May 12, 2023
f16c
40ec4882
·
ggml : use F16C conversion when available
·
May 17, 2023
steering
95dc4d72
·
Merge 'origin/master' into steering
·
May 19, 2023
chunks
a1cdd29c
·
ggml : rms_norm in chunks
·
May 20, 2023
fix_clblast
20054a38
·
Fix directory name
·
May 27, 2023
Prev
1
2
3
4
5
…
13
Next