Add image classifier donut & update loss calculation for all swins #37224

eljandoubi · 2025-04-02T22:47:36Z

What does this PR do?

Add classifier head to Donut
Add image classifier loss to LOSS_MAPPING
Update classifier loss for all swin models

Models:

vision models: @amyeroberts, @qubvel

github-actions · 2025-04-02T22:47:47Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

src/transformers/models/donut/modeling_donut_swin.py

zucchini-nlp · 2025-04-03T07:48:14Z

src/transformers/models/donut/modeling_donut_swin.py

 _EXPECTED_OUTPUT_SHAPE = [1, 49, 768]

+# Image classification docstring
+_IMAGE_CLASS_CHECKPOINT = "eljandoubi/donut-base-encoder"


is this a checkpoint trained for classification or base model ckpt?

Traind base model with randomly initialized head. It is meant for further fine tuning.

zucchini-nlp

Thanks, looks good to me! Requesting review from core maintainer

eljandoubi · 2025-04-08T18:07:04Z

@zucchini-nlp @ArthurZucker @ydshieh any feedback?

zucchini-nlp · 2025-04-10T12:59:57Z

sorry, let's merge it. Doesn't touch much core code, should be good to go

…uggingface#37224) * add classifier head to donut * add to transformers __init__ * add to auto model * fix typo * add loss for image classification * add checkpoint * remove no needed import * reoder import * format * consistency * add test of classifier * add doc * try ignore * update loss for all swin models

eljandoubi and others added 9 commits April 2, 2025 21:39

add classifier head to donut

043e8de

add to transformers __init__

cca3cb4

add to auto model

8eac550

fix typo

32fc7d8

add loss for image classification

cced1c1

Merge branch 'huggingface:main' into add_image_classif_donut

a9d7e42

add checkpoint

2a3d02c

remove no needed import

00c9ba6

reoder import

f1d5dd0

github-actions bot marked this pull request as draft April 2, 2025 22:47

eljandoubi added 4 commits April 3, 2025 00:58

format

fc54637

consistency

f0f215e

add test of classifier

07148f6

add doc

7b0c45d

eljandoubi marked this pull request as ready for review April 3, 2025 06:29

github-actions bot requested review from ydshieh and zucchini-nlp April 3, 2025 06:29

zucchini-nlp reviewed Apr 3, 2025

View reviewed changes

eljandoubi added 2 commits April 3, 2025 11:15

try ignore

d5d501e

update loss for all swin models

675c2c1

eljandoubi changed the title ~~Add image classifier donut~~ Add image classifier donut & update loss calculation for all swins Apr 3, 2025

eljandoubi added 2 commits April 3, 2025 16:12

Merge branch 'main' into add_image_classif_donut

cb045a7

Merge branch 'main' into add_image_classif_donut

022590c

zucchini-nlp approved these changes Apr 3, 2025

View reviewed changes

zucchini-nlp requested a review from ArthurZucker April 3, 2025 15:45

zucchini-nlp merged commit 7ecc5b8 into huggingface:main Apr 10, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add image classifier donut & update loss calculation for all swins #37224

Add image classifier donut & update loss calculation for all swins #37224

Uh oh!

eljandoubi commented Apr 2, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 2, 2025

Uh oh!

Uh oh!

zucchini-nlp Apr 3, 2025

Uh oh!

eljandoubi Apr 3, 2025 •

edited

Loading

Uh oh!

zucchini-nlp left a comment

Uh oh!

eljandoubi commented Apr 8, 2025 •

edited

Loading

Uh oh!

zucchini-nlp commented Apr 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add image classifier donut & update loss calculation for all swins #37224

Add image classifier donut & update loss calculation for all swins #37224

Uh oh!

Conversation

eljandoubi commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented Apr 2, 2025

Uh oh!

Uh oh!

zucchini-nlp Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

eljandoubi Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

eljandoubi commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zucchini-nlp commented Apr 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eljandoubi commented Apr 2, 2025 •

edited

Loading

eljandoubi Apr 3, 2025 •

edited

Loading

eljandoubi commented Apr 8, 2025 •

edited

Loading