Skip to content

Conversation

@jan-service-account
Copy link

Updates dev branch with latest release (b6795) from ggml-org/llama.cpp

shawngu-quic and others added 4 commits October 17, 2025 17:55
* opencl: transposed gemm/gemv moe kernel with mxfp4,f32

* add restore kernel for moe transpose

* fix trailing whitespaces

* resolve compilation warnings
Uses the technique used in the vulkan PR ggml-org#16641. Neat trick!
…6641)

This is similar to the CUDA shader from ggml-org#16130, but doesn't use shared memory
and handles different subgroup sizes.
@jan-service-account jan-service-account merged commit adfcaf2 into dev Oct 19, 2025
1 check passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-10-19-00-38 branch October 19, 2025 00:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants