-
Notifications
You must be signed in to change notification settings - Fork 5.9k
Stable Cascade Controlnet training #8390
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Sorry, we won't have the bandwidth to work on that script right now. I will leave it open to the community in case someone from the community wants to pick it up. |
Cc: @kashif as well since he worked on the SD Cascade scripts. |
Hi @sayakpaul, could you pls tell how may I start working on this? Like how may I proceed in doing such tasks? Any help will be appreciated. |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
Closing due to inactivity. |
Is your feature request related to a problem? Please describe.
Last year I wrote a long article on how to train controlnets using diffusers, and trained two controlnets using diffusers. A few months ago Stable Cascade was released which requires less resources than SDXL and also should make controlnet training for high quality diffusion models more viable. I tried to run the official training script (batchsize=4, bf16) but ran into OOM. I'd hope that a diffusers implementation would provide more optimization and lower VRAM requirements.
Describe the solution you'd like.
Please provide a
examples/controlnet/train_controlnet_stablecascade.py
Describe alternatives you've considered.
The official training script => OOM
Additional context.
A training script for the Würstchen architecture was already anticipated a long time ago (see #5071).
Pretty please!
The text was updated successfully, but these errors were encountered: