DiT Pipeline #1806

kashif · 2022-12-22T12:38:25Z

DiT pipeline: https://github.com/facebookresearch/DiT

model using CrossAttention
convert weights
pipeline
scheduler

from diffusers import DiTPipeline

# kashif/DiT-XL-2-512 for 512 x 512 version
pipe = DiTPipeline.from_pretrained("kashif/DiT-XL-2-256")
pipe = pipe.to("cuda")
output = pipe(class_labels=[207, 360, 387, 974, 88, 979, 417, 279])

output.images[4]

HuggingFaceDocBuilderDev · 2022-12-22T12:44:50Z

The documentation is not available anymore as the PR was closed or merged.

patil-suraj

Amazing work @kashif, thanks a lot for quickly integrating this! I left a few comments mostly nits. Wondering if it's possible to leverage the existing Transformer2DModel for this, or if we should leave DiT as a new model class. Think given the performance of the model, there will be many more models like this, so I'm leaning toward the latter. Curious to hear what you think @patrickvonplaten @pcuenca @anton-l @williamberman @yiyixuxu

Need to address few more things before we could merge

Check if it works with the existing scheduler or needs any changes.
Add tests
Add docs.

src/diffusers/models/dit.py

patil-suraj · 2022-12-22T13:07:31Z

src/diffusers/models/dit.py

+class DiT(ModelMixin, ConfigMixin):
+    """
+    Diffusion model with a Transformer backbone.
+    """
+
+    @register_to_config
+    def __init__(
+        self,
+        input_size=32,
+        patch_size=2,
+        in_channels=4,
+        hidden_size=1152,
+        depth=28,
+        num_heads=16,
+        mlp_ratio=4.0,
+        class_dropout_prob=0.1,
+        num_classes=1000,


src/diffusers/models/dit.py

src/diffusers/models/attention.py

Co-authored-by: Suraj Patil <[email protected]>

patrickvonplaten · 2023-01-17T19:20:24Z

src/diffusers/pipelines/dit/pipeline_dit.py

+from ..pipeline_utils import DiffusionPipeline, ImagePipelineOutput
+
+
+class DiTPipeline(DiffusionPipeline):


Very clean!

src/diffusers/pipelines/dit/pipeline_dit.py

patrickvonplaten · 2023-01-17T22:09:23Z

Failing tests seem to be flaky - can't reproduce locally. All slow pipeline tests were passing locally -> let's merge it ❤️

@kashif I did some changes. Official model weights are now:

Also added the image net classes directly to the config.

Great job on the PR!

kashif · 2023-01-17T22:10:12Z

i should be the one to thank you!!

* added dit model * import * initial pipeline * initial convert script * initial pipeline * make style * raise valueerror * single function * rename classes * use DDIMScheduler * timesteps embedder * samples to cpu * fix var names * fix numpy type * use timesteps class for proj * fix typo * fix arg name * flip_sin_to_cos and better var names * fix C shape cal * make style * remove unused imports * cleanup * add back patch_size * initial dit doc * typo * Update docs/source/api/pipelines/dit.mdx Co-authored-by: Suraj Patil <[email protected]> * added copyright license headers * added example usage and toc * fix variable names asserts * remove comment * added docs * fix typo * upstream changes * set proper device for drop_ids * added initial dit pipeline test * update docs * fix imports * make fix-copies * isort * fix imports * get rid of more magic numbers * fix code when guidance is off * remove block_kwargs * cleanup script * removed to_2tuple * use FeedForward class instead of another MLP * style * work on mergint DiTBlock with BasicTransformerBlock * added missing final_dropout and args to BasicTransformerBlock * use norm from block * fix arg * remove unused arg * fix call to class_embedder * use timesteps * make style * attn_output gets multiplied * removed commented code * use Transformer2D * use self.is_input_patches * fix flags * fixed conversion to use Transformer2DModel * fixes for pipeline * remove dit.py * fix timesteps device * use randn_tensor and fix fp16 inf. * timesteps_emb already the right dtype * fix dit test class * fix test and style * fix norm2 usage in vq-diffusion * added author names to pipeline and lmagenet labels link * fix tests * use norm_type as string * rename dit to transformer * fix name * fix test * set norm_type = "layer" by default * fix tests * do not skip common tests * Update src/diffusers/models/attention.py Co-authored-by: Suraj Patil <[email protected]> * revert AdaLayerNorm API * fix norm_type name * make sure all components are in eval mode * revert norm2 API * compact * finish deprecation * add slow tests * remove @ * refactor some stuff * upload * Update src/diffusers/pipelines/dit/pipeline_dit.py * finish more * finish docs * improve docs * finish docs Co-authored-by: Suraj Patil <[email protected]> Co-authored-by: William Berman <[email protected]> Co-authored-by: Patrick von Platen <[email protected]>

kashif added 6 commits December 21, 2022 11:36

added dit model

351c9a7

import

a833b3e

initial pipeline

9ec5e44

initial convert script

5c7d4f3

initial pipeline

037cfc7

make style

9259d9f

patil-suraj reviewed Dec 22, 2022

View reviewed changes

patil-suraj requested review from patrickvonplaten, williamberman, anton-l, pcuenca and yiyixuxu December 22, 2022 13:13

kashif added 17 commits December 22, 2022 15:54

raise valueerror

bcc7b04

single function

a3f7f83

rename classes

0e26563

use DDIMScheduler

5cbcf53

Merge branch 'main' into dit

75b809c

timesteps embedder

a0dbe46

samples to cpu

f2a074e

fix var names

68dfd3f

fix numpy type

c6c476f

use timesteps class for proj

a719508

fix typo

e61a606

fix arg name

83b9607

flip_sin_to_cos and better var names

1f127e0

fix C shape cal

b8ec4ef

make style

af59837

remove unused imports

f6b1fb1

cleanup

2cc64b8

kashif added 3 commits January 16, 2023 15:38

Merge branch 'main' into dit

95df97e

fix tests

5a03801

do not skip common tests

72cfe79

patil-suraj reviewed Jan 17, 2023

View reviewed changes

src/diffusers/models/attention.py Outdated Show resolved Hide resolved

kashif and others added 7 commits January 17, 2023 10:05

Update src/diffusers/models/attention.py

0611450

Co-authored-by: Suraj Patil <[email protected]>

revert AdaLayerNorm API

1662798

fix norm_type name

b7bf49c

make sure all components are in eval mode

9a1fddf

revert norm2 API

5867e3f

compact

df44ae5

Merge branch 'main' into dit

3b91645

patrickvonplaten reviewed Jan 17, 2023

View reviewed changes

patrickvonplaten added 5 commits January 17, 2023 19:43

finish deprecation

6fb598d

add slow tests

13f5fb6

remove @

28afb02

refactor some stuff

79ff6d2

upload

e84159d

patrickvonplaten reviewed Jan 17, 2023

View reviewed changes

src/diffusers/pipelines/dit/pipeline_dit.py Outdated Show resolved Hide resolved

patrickvonplaten added 7 commits January 17, 2023 22:37

Update src/diffusers/pipelines/dit/pipeline_dit.py

25d4d0b

finish more

956e221

finish docs

8d159ac

improve docs

2d73a61

Merge branch 'dit' of https://github.com/kashif/diffusers into dit

17ab998

finish docs

66f051e

Merge branch 'dit' of https://github.com/kashif/diffusers into dit

bc652a5

patrickvonplaten merged commit 37d113c into huggingface:main Jan 17, 2023

kashif deleted the dit branch January 17, 2023 22:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DiT Pipeline #1806

DiT Pipeline #1806

Uh oh!

kashif commented Dec 22, 2022 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Dec 22, 2022 •

edited

Loading

Uh oh!

patil-suraj left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patil-suraj Dec 22, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten Jan 17, 2023

Uh oh!

Uh oh!

patrickvonplaten commented Jan 17, 2023

Uh oh!

kashif commented Jan 17, 2023

Uh oh!

Uh oh!

		from ..pipeline_utils import DiffusionPipeline, ImagePipelineOutput


		class DiTPipeline(DiffusionPipeline):

DiT Pipeline #1806

DiT Pipeline #1806

Uh oh!

Conversation

kashif commented Dec 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Dec 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patil-suraj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patil-suraj Dec 22, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten Jan 17, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

patrickvonplaten commented Jan 17, 2023

Uh oh!

kashif commented Jan 17, 2023

Uh oh!

Uh oh!

kashif commented Dec 22, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 22, 2022 •

edited

Loading