-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
SYCL: Add all missing unary kernels
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13074
opened Apr 23, 2025 by
qnixsynapse
Loading…
Reduce enum sizes some are used in structs, which allowed them to be optimized.
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#13071
opened Apr 22, 2025 by
GermanAizek
Loading…
fix(rpc): Improve input validation and error handling
ggml
changes relating to the ggml tensor library for machine learning
#13069
opened Apr 22, 2025 by
thevilledev
Loading…
cmake : do not include ./src as public for libllama
examples
testing
Everything test related
#13062
opened Apr 22, 2025 by
ggerganov
Loading…
rpc : add command line option for number of threads for the CPU backend
examples
#13060
opened Apr 22, 2025 by
rgerganov
Loading…
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file
python
python script changes
#13058
opened Apr 22, 2025 by
glide-the
Loading…
Update README.md for tts example to use afplay on MacOS
examples
#13056
opened Apr 22, 2025 by
maxxam1221
Loading…
ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel
ggml
changes relating to the ggml tensor library for machine learning
#13053
opened Apr 21, 2025 by
eddnjjn
Loading…
llama-gemma3-cli: Sigint rework in gemma3 vision example
examples
#13043
opened Apr 21, 2025 by
pl752
Loading…
[CANN]Support OP MUL_MAT_ID
ggml
changes relating to the ggml tensor library for machine learning
#13042
opened Apr 21, 2025 by
noemotiovon
Loading…
gguf-py : avoid requiring PySide6 for packaged scripts
bugfix
fixes an issue or bug
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
#13036
opened Apr 20, 2025 by
compilade
Loading…
quantize: improve pattern matching for allowed tensors
examples
#13033
opened Apr 20, 2025 by
EAddario
Loading…
Bitnet: directly use scale instead of inverting it twice
python
python script changes
#13026
opened Apr 19, 2025 by
viraatdas
Loading…
Append mult-eos,half-rope,bos to GLM4-0414 and Z
python
python script changes
#13021
opened Apr 19, 2025 by
piDack
Loading…
2 of 4 tasks
vulkan: matmul gcn tuning
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#13016
opened Apr 18, 2025 by
netrunnereve
Loading…
Nix portability improvements
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#13005
opened Apr 18, 2025 by
hacker1024
Loading…
[SYCL][OPT] Fix reorder optimization for Q4_0
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13003
opened Apr 18, 2025 by
NeoZhangJianyu
Loading…
make memset range dynamic
ggml
changes relating to the ggml tensor library for machine learning
#13002
opened Apr 18, 2025 by
pockers21
Loading…
threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling
examples
ggml
changes relating to the ggml tensor library for machine learning
#12995
opened Apr 17, 2025 by
max-krasnyansky
Loading…
[CANN] Add the n_graph_splits performance metric to llama-bench.
examples
#12994
opened Apr 17, 2025 by
bachelor-dou
Loading…
Fix convert script for non-hf GLM4 checkpoints
python
python script changes
#12992
opened Apr 17, 2025 by
Tianyue-Zhao
Loading…
2 of 4 tasks
sycl: use DNN in the first part of ggml_sycl_mul_mat_batched_sycl
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12972
opened Apr 16, 2025 by
lslusarczyk
•
Draft
Resolved half rope,multi-EOS issues in convert_hf_togguf.py for GLM4Z Model
python
python script changes
#12957
opened Apr 15, 2025 by
piDack
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.