Split attention #1218

dxqb · 2025-12-21T22:44:56Z

Split attention can significantly speed up higher-resolution training, and somewhat low resolution training:

To test:

git fetch origin pull/1218/head:pr-1218
git switch pr-1218

then update.

limitations:

only implemented for Qwen, even though it could easily be done for Chroma, Z-Image, more
probably only for Linux, unless torch have recently improved SDPA in torch 2.8 or 2.9. To be tested

On linux, select

On Windows, install Flash attention and select FLASH_SPLIT
Here are pre-built wheels for Windows by @zzlol63 https://github.com/zzlol63/flash-attention-prebuild-wheels/releases

uses huggingface/diffusers#12870

… attention update merge with attention selection, add FLASH_SPLIT

dxqb · 2025-12-26T18:48:46Z

this should now also work on windows, with similar speed-up. Though I have no way to test

dxqb linked an issue Dec 21, 2025 that may be closed by this pull request

[Feat]: Splitting batched attention #1110

Open

dxqb added 2 commits December 26, 2025 19:03

update

141476b

remove bug workaround fixed in huggingface/diffusers#12702, use split…

ccc07bf

… attention update merge with attention selection, add FLASH_SPLIT

dxqb force-pushed the split_attention branch from 4884551 to ccc07bf Compare December 26, 2025 18:34

dependency

5063621

dxqb added 2 commits December 26, 2025 19:54

warning instead of error

71c9870

merge

94641de

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Split attention #1218

Split attention #1218

Uh oh!

dxqb commented Dec 21, 2025 •

edited

Loading

Uh oh!

dxqb commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Split attention #1218

Are you sure you want to change the base?

Split attention #1218

Uh oh!

Conversation

dxqb commented Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dxqb commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dxqb commented Dec 21, 2025 •

edited

Loading