-
Notifications
You must be signed in to change notification settings - Fork 5.9k
using zero stage3 finetune sd2, dimension error occurs #1865
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I don't really know much about deepspeed maybe @williamberman knows more here. |
I don't know off the top of my head. Someone opened an issue with the same error message using deepspeed stage 3 with transformers. deepspeedai/DeepSpeed#2746 Maybe best to see if the deepspeed team knows before digging in :) |
Fixed in #3076 (but zero-3 support is only partial) Please carefully read the OP of the PR for details. |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
Describe the bug
An error is reported when using deepspeed's zero stage3 finetune diffusers/examples/text_to_image/train_text_to_image.py script. My machine's GPU is 4*2080ti, and because a single GPU cannot accommodate all SD2 parameters, the deepspeed zero stage3 strategy must be used.
Reproduction
accelerate.yaml
/home/kas/zero_stage3_offload_config.json
launch script
Logs
System Info
diffusers
version: 0.11.1The text was updated successfully, but these errors were encountered: