Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix garbled output with REPACK at high thread counts ggml changes relating to the ggml tensor library for machine learning
#16956 opened Nov 2, 2025 by NoahOksuz Loading…
CUDA: add implicit conv3d ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16948 opened Nov 2, 2025 by bssrdf Draft
Model: Minimax M2 - chat support testing Everything test related
#16946 opened Nov 2, 2025 by pwilkin Loading…
server : add props.model_alias examples server
#16943 opened Nov 2, 2025 by ggerganov Loading…
Model: add openPangu-Embedded python python script changes
#16941 opened Nov 2, 2025 by Lpzhan931 Loading…
Add e2e tests for embedding raw flag devops improvements to build systems and github actions examples python python script changes testing Everything test related
#16940 opened Nov 2, 2025 by SamMalayek Loading…
doc: Windows + clang/ninja build guide format cleanup documentation Improvements or additions to documentation
#16939 opened Nov 2, 2025 by jsjtxietian Loading…
CUDA: avoid mul + bias fusion when buffers are split ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16935 opened Nov 2, 2025 by am17an Loading…
chat: Allow reasoning_content to be passed back examples python python script changes server
#16934 opened Nov 2, 2025 by tarruda Loading…
benches : add folder with benchmarks
#16931 opened Nov 2, 2025 by ggerganov Loading…
Add initial devcontainer configuration
#16926 opened Nov 1, 2025 by FXJEFE Loading…
mtmd: add --image-min/max-tokens examples server
#16921 opened Nov 1, 2025 by ngxson Loading…
vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16919 opened Nov 1, 2025 by jeffbolznv Loading…
add TheRock HIP backend build instructions documentation Improvements or additions to documentation
#16915 opened Nov 1, 2025 by lihaofd Loading…
opencl: support imrope ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#16914 opened Nov 1, 2025 by lhez Draft
ggml-hexagon: replace sprintf with snprintf in ops-utils.h ggml changes relating to the ggml tensor library for machine learning
#16913 opened Nov 1, 2025 by chraac Loading…
Vulkan: improve mul_mat_vec_iq1_m ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16907 opened Nov 1, 2025 by lovedheart Loading…
Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16900 opened Oct 31, 2025 by 0cc4m Loading…
rpc: join small packets in send_msg and recv_msg ggml changes relating to the ggml tensor library for machine learning
#16892 opened Oct 31, 2025 by jukofyork Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.