Add sdxl prompt embeddings #3995

patrickvonplaten · 2023-07-07T13:14:20Z

What does this PR do?

Make sure prompt embeddings can be passed to SD-XL.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…sdxl_prompt_embeddings

sayakpaul · 2023-07-07T13:23:44Z

src/diffusers/pipelines/stable_diffusion_xl/__init__.py

@@ -17,13 +16,9 @@ class StableDiffusionXLPipelineOutput(BaseOutput):
        images (`List[PIL.Image.Image]` or `np.ndarray`)
            List of denoised PIL images of length `batch_size` or numpy array of shape `(batch_size, height, width,
            num_channels)`. PIL images or numpy array present the denoised images of the diffusion pipeline.
-        nsfw_content_detected (`List[bool]`)


Why are removing this?

@sayakpaul There is no safety checker in these pipelines.

sayakpaul

I would have expected to see a change in encode_prompt() to accommodate the pooled embeddings. But didn't notice it.

What am I missing out on? We should maybe also include tests for those.

HuggingFaceDocBuilderDev · 2023-07-07T13:27:34Z

The documentation is not available anymore as the PR was closed or merged.

pcuenca

I don't fully follow. How would this be used with the compel library?

pcuenca · 2023-07-07T13:28:47Z

src/diffusers/pipelines/stable_diffusion_xl/__init__.py

@@ -17,13 +16,9 @@ class StableDiffusionXLPipelineOutput(BaseOutput):
        images (`List[PIL.Image.Image]` or `np.ndarray`)
            List of denoised PIL images of length `batch_size` or numpy array of shape `(batch_size, height, width,
            num_channels)`. PIL images or numpy array present the denoised images of the diffusion pipeline.
-        nsfw_content_detected (`List[bool]`)


@sayakpaul There is no safety checker in these pipelines.

patrickvonplaten · 2023-07-07T14:50:37Z

Having chatted offilne with @pcuenca this is the only way really we can do things.

If the user passes prompt_embeds we assume that the text encoder is not needed anymore and that the user does not have to pass prompt. This however means that the user has to pass pooled_prompt_embeds.

patrickvonplaten · 2023-07-07T14:50:47Z

Merging

adhikjoshi · 2023-07-07T15:37:50Z

So, pooled_prompt_embeds is also needed with prompt_embeds

We have understanding of creating embeds, but pooled embeds will be same or they are different?

Can we pass same prompt embeddings in pooled and prompt_embeds?

If not, how can we create pooled embeds? Can compel work?

sayakpaul · 2023-07-07T15:46:43Z

Hi @adhikjoshi.

If you check out

diffusers/src/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py

Line 242 in 78922ed

def encode_prompt(

you will notice the code for computing the pooled prompt embeddings.

Let me know if that helps.

bghira · 2023-07-07T16:26:13Z

                # We are only ALWAYS interested in the pooled output of the final text encoder
                negative_pooled_prompt_embeds = negative_prompt_embeds[0]

seems like it's not needed and we can just pass in the negative prompt embeds etc?

bghira · 2023-07-07T16:39:22Z

docs for encode_prompt say prompt is optional, but signature has no default value of None

* Add sdxl prompt embeddings * Fix more * fix some slow tests

patrickvonplaten added 4 commits July 7, 2023 15:13

Add sdxl prompt embeddings

27c4de0

Fix more

70da70a

Merge branch 'main' of https://github.com/huggingface/diffusers into …

431b324

…sdxl_prompt_embeddings

fix some slow tests

5344d4b

patrickvonplaten requested review from pcuenca and sayakpaul July 7, 2023 13:20

sayakpaul reviewed Jul 7, 2023

View reviewed changes

pcuenca reviewed Jul 7, 2023

View reviewed changes

patrickvonplaten merged commit 78922ed into main Jul 7, 2023

patrickvonplaten deleted the sdxl_prompt_embeddings branch July 7, 2023 14:50

patrickvonplaten mentioned this pull request Jul 9, 2023

[SD-XL] Ability to easily split prompt over the two text encoders #4004

Closed

xiaohu2015 mentioned this pull request Jul 14, 2023

fix a bug of prompt embeds in sdxl #4099

Merged

6 tasks

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

Add sdxl prompt embeddings (huggingface#3995)

9d3d85f

* Add sdxl prompt embeddings * Fix more * fix some slow tests

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

Add sdxl prompt embeddings (huggingface#3995)

201c743

* Add sdxl prompt embeddings * Fix more * fix some slow tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add sdxl prompt embeddings #3995

Add sdxl prompt embeddings #3995

Uh oh!

patrickvonplaten commented Jul 7, 2023 •

edited

Loading

Uh oh!

sayakpaul Jul 7, 2023

Uh oh!

pcuenca Jul 7, 2023

Uh oh!

sayakpaul left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 7, 2023 •

edited

Loading

Uh oh!

pcuenca left a comment

Uh oh!

pcuenca Jul 7, 2023

Uh oh!

patrickvonplaten commented Jul 7, 2023

Uh oh!

patrickvonplaten commented Jul 7, 2023

Uh oh!

adhikjoshi commented Jul 7, 2023

Uh oh!

sayakpaul commented Jul 7, 2023

Uh oh!

bghira commented Jul 7, 2023

Uh oh!

bghira commented Jul 7, 2023

Uh oh!

Uh oh!

Add sdxl prompt embeddings #3995

Add sdxl prompt embeddings #3995

Uh oh!

Conversation

patrickvonplaten commented Jul 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul Jul 7, 2023

Choose a reason for hiding this comment

Uh oh!

pcuenca Jul 7, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

pcuenca Jul 7, 2023

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten commented Jul 7, 2023

Uh oh!

patrickvonplaten commented Jul 7, 2023

Uh oh!

adhikjoshi commented Jul 7, 2023

Uh oh!

sayakpaul commented Jul 7, 2023

Uh oh!

bghira commented Jul 7, 2023

Uh oh!

bghira commented Jul 7, 2023

Uh oh!

Uh oh!

patrickvonplaten commented Jul 7, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 7, 2023 •

edited

Loading