Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
passkey
d57cb9c2
·
passkey : add readme
·
Jan 08, 2024
gg/fix-vld1q_s8_x4-4872
7216af5c
·
ggml : fix 32-bit ARM compat (cont)
·
Jan 09, 2024
gg/server-infill-empty-prompt-4027
24096933
·
server : try to fix infill when prompt is empty
·
Jan 09, 2024
ik/iq2_2.31bpw
9bfcb16f
·
Add llama enum for IQ2_XS
·
Jan 11, 2024
gg/update-phi2-convert
1fb563eb
·
py : try to fix flake stuff
·
Jan 13, 2024
gg/add-phixtral
9998ecd1
·
llama : add phixtral support (wip)
·
Jan 13, 2024
ik/imatrix_legacy_quants
bb9abb5c
·
imatrix: guard Q4_0/Q5_0 against ffn_down craziness
·
Jan 16, 2024
gg/iq2-refactor-and-tests
49bafe09
·
tests : avoid creating RNGs for each tensor
·
Jan 17, 2024
ik/better_q2_k_s
9fd1e83f
·
Use Q4_K for attn_v for Q2_K_S when n_gqa >= 4
·
Jan 17, 2024
gg/fix-spm-added-tokens-dict-4958
23742deb
·
py : fix padded dummy tokens (I hope)
·
Jan 17, 2024
gg/imatrix-gpu-4931
2917e6b5
·
Merge branch 'master' into gg/imatrix-gpu-4931
·
Jan 17, 2024
ik/faster_hellaswag
ccc78a20
·
hellaswag: speed up even more by parallelizing log-prob evaluation
·
Jan 18, 2024
ceb/nomic-vulkan-fix-add
14532151
·
kompute : fix ggml_add kernel
·
Jan 19, 2024
ceb/restore-convert
4a3bc152
·
py : linting with mypy and isort
·
Jan 19, 2024
ceb/fix-msvc-build
32a392fe
·
try a differerent fix
·
Jan 19, 2024
gg/flash-attn-online
a9681feb
·
ggml : online attention (CPU)
·
Jan 20, 2024
gg/flash-attn-wip2
06c2d0d1
·
wip
·
Jan 23, 2024
gg/flash-attn-wip4
da23b56f
·
wip : no ic 8 step
·
Jan 24, 2024
gg/flash-attn-wip3
6ccbd177
·
wip
·
Jan 24, 2024
gg/flash-attn-simd
2bf91c53
·
metal : clean up
·
Jan 25, 2024
Prev
1
…
3
4
5
6
7
8
9
10
11
…
13
Next