Add T5 GGUF loading support #33389

junejae · 2024-09-09T13:41:28Z

What does this PR do?

Add T5 GGUF loading support

Due to the nature of T5's architecture, I decided to replicate gguf's conversion logic, so the final code gets messy.
I tried to avoid any logical conflicts between t5's and existing model architectures, but feel free to edit codes if you find any mistakes that I haven't noticed out.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. Link: Community contribution: Adding GGUF support for more architectures #33260
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@SunMarc @LysandreJik @ArthurZucker , could you please review this PR?

src/transformers/modeling_gguf_pytorch_utils.py

bonlime · 2024-09-11T16:06:42Z

Does your code allow loading T5 Encoder XXL (the one used in Flux)?
Example files here:
https://huggingface.co/city96/t5-v1_1-xxl-encoder-gguf/tree/main

junejae · 2024-09-11T16:17:02Z

@bonlime Yes, it works with AutoModelForTextEncoding class. I've tested with T5 encoder of the exact repo you linked, but I didn't commit the independent test code block for T5 encoder since the code could be dirty with embedding vector style example output.

SunMarc

Thanks for your work @junejae and sorry for the delay ! Just a few nits. There are a few merge conflits, can you fix them also ?

src/transformers/modeling_gguf_pytorch_utils.py

tests/quantization/ggml/test_ggml.py

src/transformers/modeling_gguf_pytorch_utils.py

junejae · 2024-10-03T11:48:41Z

@SunMarc
I've resolved conflicts and added more tests. Could you please review it again?

tests/quantization/ggml/test_ggml.py

SunMarc · 2024-10-03T15:17:55Z

Make sure to fix the CI also !

HuggingFaceDocBuilderDev · 2024-10-03T15:42:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…mers into feature/t5-gguf

SunMarc

Nice, thanks a lot !

LysandreJik

Boom, awesome! Thanks @junejae

* add: GGUFT5Converter * add: tensormapping for t5 * add: test code for t5 * fix: Remove whitespace from blank line * add: t5 fp16 tests * fix: whitespace formatting * fix: minor formatting * fix: testing every weights

mirix · 2025-03-12T18:39:31Z

Is the commit merged? I still get the error:

ValueError: GGUF model with architecture t5encoder is not supported yet.

t5_gguf = 'city96/t5-v1_1-xxl-encoder-gguf'
t5_file = 't5-v1_1-xxl-encoder-Q8_0.gguf'

text_encoder_2 = T5EncoderModel.from_pretrained(
    t5_gguf,
    gguf_file=t5_file,
    torch_dtype=torch.bfloat16,
)

SunMarc · 2025-03-13T11:21:16Z

cc @Isotr0py

SlimRG · 2025-08-10T07:53:14Z

Pls, add this example into comments into UMT5EncoderModel's code
Using from_pretrained except from_single_file was new for me

Is the commit merged? I still get the error:

ValueError: GGUF model with architecture t5encoder is not supported yet.
t5_gguf = 'city96/t5-v1_1-xxl-encoder-gguf'
t5_file = 't5-v1_1-xxl-encoder-Q8_0.gguf'

text_encoder_2 = T5EncoderModel.from_pretrained(
    t5_gguf,
    gguf_file=t5_file,
    torch_dtype=torch.bfloat16,
)

zwukong · 2025-12-19T15:31:39Z

too slow😭.
need to convert first
Converting and de-quantizing GGUF tensors...: 100%|██████████████████████████████████████████████████████| 219/219 [00:56<00:00, 3.89it/s]

SunMarc · 2025-12-19T16:02:30Z

Yeah, we need to make this multi-threaded to go faster ;D

zwukong · 2025-12-19T16:15:21Z

comfyui is much faster, don't need to convert in each running.For some reason,i have to test gguf in diffusers😄

junejae added 3 commits September 5, 2024 22:59

add: GGUFT5Converter

b216596

add: tensormapping for t5

d1c52fe

add: test code for t5

508432c

junejae commented Sep 9, 2024

View reviewed changes

src/transformers/modeling_gguf_pytorch_utils.py Outdated Show resolved Hide resolved

fix: Remove whitespace from blank line

b5e9e38

SunMarc mentioned this pull request Sep 11, 2024

Community contribution: Adding GGUF support for more architectures #33260

Open

15 tasks

vladmandic mentioned this pull request Sep 20, 2024

Add GGUF loader for FluxTransformer2DModel huggingface/diffusers#9487

Closed

ArthurZucker requested a review from SunMarc September 27, 2024 13:20

SunMarc approved these changes Sep 27, 2024

View reviewed changes

src/transformers/modeling_gguf_pytorch_utils.py Show resolved Hide resolved

tests/quantization/ggml/test_ggml.py Show resolved Hide resolved

src/transformers/modeling_gguf_pytorch_utils.py Show resolved Hide resolved

junejae and others added 3 commits October 3, 2024 19:36

add: t5 fp16 tests

918a34d

Merge branch 'main' into feature/t5-gguf

92eac30

fix: whitespace formatting

8979cfc

junejae requested a review from SunMarc October 3, 2024 11:37

SunMarc reviewed Oct 3, 2024

View reviewed changes

tests/quantization/ggml/test_ggml.py Outdated Show resolved Hide resolved

tests/quantization/ggml/test_ggml.py Show resolved Hide resolved

junejae and others added 4 commits October 21, 2024 22:12

Merge branch 'main' into feature/t5-gguf

29bcd1e

fix: minor formatting

b884bdf

Merge branch 'feature/t5-gguf' of https://github.com/junejae/transfor…

4572ab3

…mers into feature/t5-gguf

fix: testing every weights

5784339

junejae requested a review from SunMarc October 23, 2024 02:52

SunMarc approved these changes Oct 23, 2024

View reviewed changes

SunMarc requested a review from LysandreJik October 23, 2024 12:24

LysandreJik approved these changes Oct 24, 2024

View reviewed changes

SunMarc merged commit dd267fc into huggingface:main Oct 24, 2024

Isotr0py mentioned this pull request Mar 13, 2025

Add GGUF support to T5-Encoder #36700

Merged

7 tasks

Add T5 GGUF loading support #33389

Add T5 GGUF loading support #33389

Uh oh!

Conversation

junejae commented Sep 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Uh oh!

bonlime commented Sep 11, 2024

Uh oh!

junejae commented Sep 11, 2024

Uh oh!

SunMarc left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

junejae commented Oct 3, 2024

Uh oh!

Uh oh!

Uh oh!

SunMarc commented Oct 3, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 3, 2024

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

mirix commented Mar 12, 2025

Uh oh!

SunMarc commented Mar 13, 2025

Uh oh!

SlimRG commented Aug 10, 2025

Uh oh!

zwukong commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SunMarc commented Dec 19, 2025

Uh oh!

zwukong commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

junejae commented Sep 9, 2024 •

edited

Loading

SunMarc left a comment •

edited

Loading

zwukong commented Dec 19, 2025 •

edited

Loading