Skip to content

Introduce block_softmax_adjustment kernel (#163) #263

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 1 commit into
base: deepseek_r1
Choose a base branch
from

Conversation

kdamaszk
Copy link
Contributor

@kdamaszk kdamaszk commented Jul 8, 2025

Cherry-pick from main #163

* Add option to call block_softmax_adjustment op

* Enable block_softmax_adjustment by default for testing

* Add additional type conversion and checks for fp32_softmax

* Change default for VLLM_FUSED_BLOCK_SOFTMAX_ADJUSTMENT

* Reorder version checks and reorganize kernel specification

---------

Co-authored-by: Michal Adamczyk <[email protected]>
czhu15 added a commit that referenced this pull request Jul 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants