Branches · Till-Ole Herbst / Llama.Cpp · GitLab

upd-issue-templates

b9bb4cbe · Separate bug and enhancement template + no default title · Oct 23, 2023
server-rev

c0f4d548 · server : add comment about changing slot_state to bool · Oct 22, 2023
perf-study

cb79f8a2 · llama : add SKIP_KQ_KQV option · Oct 22, 2023
sampling-refactor

56ba00b9 · sampling : hide prev behind API and apply #3661 · Oct 20, 2023
speculative-tree

ad2727d0 · Merge branch 'master' into speculative-tree · Oct 18, 2023
llava-fix-offloading

932589c0 · Honor -ngl option for Cuda offloading in llava · Oct 14, 2023
rev-sampling

5261aee8 · sampling : one sequence per sampling context · Oct 12, 2023
batched-bench

2fcdf869 · batched-bench : add mmq CLI arg · Oct 11, 2023
alloc-assert-fix

ee745692 · ggml-alloc : fix assert in debug builds · Oct 09, 2023
fix-kv-cache-access

ee268b54 · llama : no longer perform uninitialized access to the KV cache · Oct 08, 2023
fix-refact

acead654 · Merge branch 'master' into fix-refact · Oct 08, 2023
metal-improve-batching

6b9554a7 · metal : print more GPU info + disable mul_mm for MTLGPUFamiliy < Apple7 · Oct 08, 2023
gguf-fix-publish

ba44776d · bump version · Oct 07, 2023
server-parallel

5ab6c213 · server-parallel : add "--reverse-prompt" + compiler warning fixes · Oct 06, 2023
fix-sessions

5418932b · llama : fix comments for llama_kv_cache API · Oct 03, 2023
custom-attention-mask

c5650ed4 · server : avoid context swaps by shifting the KV cache · Sep 28, 2023
cam-simple-fix

72e7ef4e · simple : fixes · Sep 26, 2023
custom-attention-mask-no-roped-cache

784d14ed · llama : store non-RoPEd K cache (WIP) · Sep 17, 2023
support-starcoder-fix

92a4f868 · llama : make starcoder graph build more consistent with others · Sep 15, 2023
fix-cmake-out-of-source-install

c2217ca2 · Fix llama.h location when built outside of root directory · Sep 14, 2023

Prev
1
…
6
7
8
9
10
11
12
13
Next