[Pipelines] Support for T2I-Adapter #2390

wfng92 · 2023-02-17T05:49:35Z

Model/Pipeline/Scheduler description

From the official repository, T2I-Adapter by @TencentARC is

... a simple and small (~70M parameters, ~300M storage space) network that can provide extra guidance to pre-trained text-to-image models while freezing the original large text-to-image models.
T2I-Adapter aligns internal knowledge in T2I models with external control signals. We can train various adapters according to different conditions, and achieve rich control and editing effects.

Would be great to have this plug and play adapters in diffusers module.

Open source status

The model implementation is available
The model weights are available (Only relevant if addition is not a scheduler).

Provide useful links for the implementation

Original code: https://github.com/TencentARC/T2I-Adapter
Pre-trained models: https://huggingface.co/TencentARC/T2I-Adapter

The text was updated successfully, but these errors were encountered:

HimariO · 2023-02-19T07:38:54Z

I have made a quick attempt to implement the T2I-Adapter in diffusers, which can be found over here.
Based on the results obtained from the t2iadapter_seg_sd14v1.pth adapter, it appears to be working correctly.:

The adapter module itself is quite simple, so I think the main consideration of integrating the adapter into diffusers will be:

Should the design of the adapter module allow it to inject the adapter hidden state to any layer inside UNet (in the official implementation the adapter state is always added after the last ResnetBlock from each downsample block)
Should the adapter be integrated into UNet, since the number of feature maps and size of feature maps the adapter output all depend on the UNet model it is working with

sayakpaul · 2023-02-20T08:18:10Z

Thanks so much for your hard work, @HimariO! Your questions are quite valid. Do you think the design philosophy of how integrated LoRA into diffusers would be of help (PR)? I mentioned it because LoRA also falls under the adapter series of neural nets.

Let me see what other members think.

Cc: @patil-suraj @patrickvonplaten @williamberman

HimariO · 2023-02-21T17:27:36Z

@sayakpaul, thank you for directing me to the LoRA PR. It has been very helpful in giving me a general idea of the design philosophy of similar features. After reviewing the PR for LoRA and the draft PR for ControlNet, I believe we can create a more versatile API that can support T2I-Adapter, ControlNet, and other similar modules that have independent input and will inject the output into diffusion model. PoC can be found here.

haofanwang · 2023-02-21T18:27:42Z

Agree. As T2I and ControlNet (they share similar designs) both require some changes of UNet, more similar pipelines in the future may lead to crash. It is necessary to consider how to efficiently merge them into one framework.

sayakpaul · 2023-02-22T02:46:08Z

@HimariO I left a comment directly on your commit. Thanks so much!

We usually consider the code-level impact we might have before accommodating a large change in the API. So, I request @patil-suraj @williamberman @patrickvonplaten @yiyixuxu to chime in here too.

Note that this is a lighter-than-usual week for us, so there might be some delay in our response.

takuma104 · 2023-02-22T16:43:03Z

My PR for ControlNet (#2407) has been open for a while now. I am also in favor of the Sideload-related changes. The T2I-Adapter and ControlNet share a similar concept in that they both interfere with UNet. I think that the Sideload concept could be a common foundation and have good potential for future extensions. (I have left a comment on the ControlNet thread.)

williamberman · 2023-02-23T21:06:20Z

My understanding from a preliminary read of the t2i adapter paper is that the outputs from the adapter model are just added with the intermediate features of the encoder of the unet. This shouldn't require any hacking of the existing block definitions and could be done just by passing the outputs of the adapter to the forward method of the unet.

HimariO · 2023-02-25T16:43:05Z

@williamberman your understanding is correct, and what you describe is exactly what I do with my first prototype, The main motivation for trying out new concepts like sideloading is to avoid modifying every sub-module the adapter/controlnet-like model interacts with, especially when those modules are buried deep in the module hierarchy or there are different adapter variation targeting different modules.

sayakpaul · 2023-02-28T11:24:01Z

Thanks for thinking this through @HimariO! Let us know whenever you're read with a PR and / or if you need any help.

AK391 · 2023-02-28T23:26:43Z

related: https://github.com/cloneofsimo/t2i-adapter-diffusers

HimariO · 2023-03-01T04:10:37Z

Hi @sayakpaul, just a quick note to let you know that I'm planning on creating the PR this week, and I'll let you know if there are any design-related issues that require further discussion. Thanks!

github-actions · 2023-03-25T15:03:59Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

This was referenced Feb 21, 2023

[Pipelines] Add a ControlNet pipeline #2331

Closed

[Pipelines] Add a T2I-Adapter pipeline #2437

Closed

HimariO mentioned this issue Mar 5, 2023

Add T2I-Adapter model and pipeline #2555

Closed

8 tasks

github-actions bot added the stale Issues that haven't received updates label Mar 25, 2023

github-actions bot closed this as completed Apr 4, 2023

sayakpaul reopened this Apr 4, 2023

github-actions bot closed this as completed Apr 13, 2023

geroldmeisinger mentioned this issue Nov 2, 2023

How to combine t2i module in StableDiffusionControlNetPipeline lllyasviel/ControlNet#570

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pipelines] Support for T2I-Adapter #2390

[Pipelines] Support for T2I-Adapter #2390

wfng92 commented Feb 17, 2023

HimariO commented Feb 19, 2023 •

edited

Loading

sayakpaul commented Feb 20, 2023

HimariO commented Feb 21, 2023 •

edited

Loading

haofanwang commented Feb 21, 2023 •

edited

Loading

sayakpaul commented Feb 22, 2023

takuma104 commented Feb 22, 2023

williamberman commented Feb 23, 2023

HimariO commented Feb 25, 2023 •

edited

Loading

sayakpaul commented Feb 28, 2023

AK391 commented Feb 28, 2023

HimariO commented Mar 1, 2023

github-actions bot commented Mar 25, 2023

[Pipelines] Support for T2I-Adapter #2390

[Pipelines] Support for T2I-Adapter #2390

Comments

wfng92 commented Feb 17, 2023

Model/Pipeline/Scheduler description

Open source status

Provide useful links for the implementation

HimariO commented Feb 19, 2023 • edited Loading

sayakpaul commented Feb 20, 2023

HimariO commented Feb 21, 2023 • edited Loading

haofanwang commented Feb 21, 2023 • edited Loading

sayakpaul commented Feb 22, 2023

takuma104 commented Feb 22, 2023

williamberman commented Feb 23, 2023

HimariO commented Feb 25, 2023 • edited Loading

sayakpaul commented Feb 28, 2023

AK391 commented Feb 28, 2023

HimariO commented Mar 1, 2023

github-actions bot commented Mar 25, 2023

HimariO commented Feb 19, 2023 •

edited

Loading

HimariO commented Feb 21, 2023 •

edited

Loading

haofanwang commented Feb 21, 2023 •

edited

Loading

HimariO commented Feb 25, 2023 •

edited

Loading