-
Notifications
You must be signed in to change notification settings - Fork 12.2k
sycl: Cleanup codepaths in Get Rows in sycl backend #14215
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@@ -277,13 +188,8 @@ void ggml_sycl_op_get_rows(ggml_backend_sycl_context & ctx, ggml_tensor * dst) { | |||
src1_i32, (float *)dst->data, ctx.stream()); | |||
break; | |||
case GGML_TYPE_Q4_0: | |||
if (ctx.opt_feature.reorder && dst->op == GGML_OP_MUL_MAT) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Highlighting it for future reference, the condition dst->op == GGML_OP_MUL_MAT
is always false because of this switch
:
llama.cpp/ggml/src/ggml-sycl/ggml-sycl.cpp
Lines 3612 to 3614 in 0dbcabd
case GGML_OP_GET_ROWS: | |
ggml_sycl_get_rows(ctx, dst); | |
break; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR cleans up the SYCL backend’s Get Rows implementation by removing unused reorder codepaths and simplifying the dispatch logic.
- Eliminates the
k_get_rows_reorder
kernel and its host wrapperget_rows_sycl_reorder
- Removes the conditional branch for
ctx.opt_feature.reorder
in the Q4_0 case - Leaves only the core
k_get_rows
andget_rows_sycl
paths for all quantization types
Comments suppressed due to low confidence (1)
ggml/src/ggml-sycl/getrows.cpp:62
- Since
k_get_rows_reorder
and its host wrapper have been removed, verify whetherdequantize_kernel_t_reorder
is still referenced elsewhere; if not, remove its typedef and related includes to eliminate unused code.
template<int qk, int qr, dequantize_kernel_t_reorder dequantize_kernel_recorder, typename dst_t>
* mamba2-sync: (24 commits) sync : ggml Add `ggml_roll` (ggml/1274) docs : fix the link to llama.h (ggml-org#14293) CUDA: add conv_2d_transpose (ggml-org#14287) lint : remove trailing whitepace (ggml-org#14304) vocab : prevent tokenizer overflow (ggml-org#14301) sycl: add usage of enqueue_functions extension (ggml-org#14244) Implement GGML_CPU_ALL_VARIANTS for PowerPC (ggml-org#14286) llama : improve sep token handling (ggml-org#14272) cuda : synchronize graph capture and cublas handle destruction (ggml-org#14288) ggml : fix repack work size for mul_mat_id (ggml-org#14292) ggml: Update KleidiAI to v1.9.0 (ggml-org#14277) model : more uniform output id handling (ggml-org#14275) ubatch : new splitting logic (ggml-org#14217) CUDA: add conv_2d_dw (ggml-org#14265) ggml-cpu : remove unnecesary arm feature detection (ggml-org#14281) gguf-py : make sentencepiece optional (ggml-org#14200) server : add server parameters for draft model cache type (ggml-org#13782) build : suppress gcc15 compile warnings (ggml-org#14261) sycl: Cleanup codepaths in Get Rows in sycl backend (ggml-org#14215) ...
This PR removes some dead codepaths and unused functions inside Get Rows implementation in the sycl backend.