Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
upd-issue-templates
b9bb4cbe
·
Separate bug and enhancement template + no default title
·
Oct 23, 2023
server-rev
c0f4d548
·
server : add comment about changing slot_state to bool
·
Oct 22, 2023
perf-study
cb79f8a2
·
llama : add SKIP_KQ_KQV option
·
Oct 22, 2023
sampling-refactor
56ba00b9
·
sampling : hide prev behind API and apply #3661
·
Oct 20, 2023
speculative-tree
ad2727d0
·
Merge branch 'master' into speculative-tree
·
Oct 18, 2023
llava-fix-offloading
932589c0
·
Honor -ngl option for Cuda offloading in llava
·
Oct 14, 2023
rev-sampling
5261aee8
·
sampling : one sequence per sampling context
·
Oct 12, 2023
batched-bench
2fcdf869
·
batched-bench : add mmq CLI arg
·
Oct 11, 2023
alloc-assert-fix
ee745692
·
ggml-alloc : fix assert in debug builds
·
Oct 09, 2023
fix-kv-cache-access
ee268b54
·
llama : no longer perform uninitialized access to the KV cache
·
Oct 08, 2023
fix-refact
acead654
·
Merge branch 'master' into fix-refact
·
Oct 08, 2023
metal-improve-batching
6b9554a7
·
metal : print more GPU info + disable mul_mm for MTLGPUFamiliy < Apple7
·
Oct 08, 2023
gguf-fix-publish
ba44776d
·
bump version
·
Oct 07, 2023
server-parallel
5ab6c213
·
server-parallel : add "--reverse-prompt" + compiler warning fixes
·
Oct 06, 2023
fix-sessions
5418932b
·
llama : fix comments for llama_kv_cache API
·
Oct 03, 2023
custom-attention-mask
c5650ed4
·
server : avoid context swaps by shifting the KV cache
·
Sep 28, 2023
cam-simple-fix
72e7ef4e
·
simple : fixes
·
Sep 26, 2023
custom-attention-mask-no-roped-cache
784d14ed
·
llama : store non-RoPEd K cache (WIP)
·
Sep 17, 2023
support-starcoder-fix
92a4f868
·
llama : make starcoder graph build more consistent with others
·
Sep 15, 2023
fix-cmake-out-of-source-install
c2217ca2
·
Fix llama.h location when built outside of root directory
·
Sep 14, 2023
Prev
1
…
6
7
8
9
10
11
12
13
Next