-
Notifications
You must be signed in to change notification settings - Fork 13.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix garbled output with REPACK at high thread counts
ggml
changes relating to the ggml tensor library for machine learning
#16956
opened Nov 2, 2025 by
NoahOksuz
Loading…
CUDA: add implicit conv3d
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
Model: Minimax M2 - chat support
testing
Everything test related
#16946
opened Nov 2, 2025 by
pwilkin
Loading…
Model: add openPangu-Embedded
python
python script changes
#16941
opened Nov 2, 2025 by
Lpzhan931
Loading…
Add e2e tests for embedding raw flag
devops
improvements to build systems and github actions
examples
python
python script changes
testing
Everything test related
#16940
opened Nov 2, 2025 by
SamMalayek
Loading…
doc: Windows + clang/ninja build guide format cleanup
documentation
Improvements or additions to documentation
#16939
opened Nov 2, 2025 by
jsjtxietian
Loading…
CUDA: avoid mul + bias fusion when buffers are split
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16935
opened Nov 2, 2025 by
am17an
Loading…
server: add minimax-m2 reasoning format override for MiniMax-M2 compatibility
examples
server
#16933
opened Nov 2, 2025 by
ServeurpersoCom
•
Draft
common: Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS)
testing
Everything test related
#16932
opened Nov 2, 2025 by
hksdpc255
Loading…
hparams : add n_embd_full to support extended embed
examples
#16928
opened Nov 1, 2025 by
CISC
Loading…
vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16919
opened Nov 1, 2025 by
jeffbolznv
Loading…
add TheRock HIP backend build instructions
documentation
Improvements or additions to documentation
#16915
opened Nov 1, 2025 by
lihaofd
Loading…
opencl: support imrope
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
ggml-hexagon: replace sprintf with snprintf in changes relating to the ggml tensor library for machine learning
ops-utils.h
ggml
#16913
opened Nov 1, 2025 by
chraac
Loading…
Vulkan: improve mul_mat_vec_iq1_m
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16907
opened Nov 1, 2025 by
lovedheart
Loading…
Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16900
opened Oct 31, 2025 by
0cc4m
Loading…
rpc: join small packets in changes relating to the ggml tensor library for machine learning
send_msg and recv_msg
ggml
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.