Pass num_items_in_batch directly to loss computation #36753

eljandoubi · 2025-03-16T13:50:44Z

What does this PR do?

Models:

vision models: @amyeroberts, @qubvel

github-actions · 2025-03-16T13:50:58Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

qubvel

Thank you @eljandoubi for opening a PR and fixing the issue! Just a small nit

qubvel · 2025-03-18T08:56:34Z

src/transformers/models/vision_encoder_decoder/modeling_vision_encoder_decoder.py

+            loss = fixed_cross_entropy(
+                logits.reshape(-1, self.decoder.config.vocab_size),
+                labels.reshape(-1),
+                num_items_in_batch=num_items_in_batch,
+            )


We can use instead, see llama for example. All reshape/view ops will happen under the hood

loss = self.loss_function(logits=logits, labels=labels, vocab_size=self.config.vocab_size, num_items_in_batch=num_items_in_batch)

@qubvel like this ?

A bit different, there is no need to make view while passing to the function, see my example above

@qubvel I've updated my code, what do you think about it now?

qubvel · 2025-03-20T10:36:28Z

Thanks @eljandoubi

* Pass num_items_in_batch directly to loss computation * use self loss instead * fix loss kwrgs * fix vocab size

Pass num_items_in_batch directly to loss computation

95c32a7

github-actions bot marked this pull request as draft March 16, 2025 13:50

eljandoubi marked this pull request as ready for review March 16, 2025 13:59

github-actions bot requested review from ArthurZucker and Rocketknight1 March 16, 2025 13:59

Merge branch 'main' into fix_vision_encoder_decoder_forward

ca13597

qubvel reviewed Mar 18, 2025

View reviewed changes

eljandoubi and others added 7 commits March 18, 2025 11:35

Merge branch 'huggingface:main' into fix_vision_encoder_decoder_forward

8cf8e42

Merge branch 'huggingface:main' into fix_vision_encoder_decoder_forward

2873a96

Merge branch 'huggingface:main' into fix_vision_encoder_decoder_forward

4d93023

use self loss instead

cc3fb98

Merge branch 'main' into fix_vision_encoder_decoder_forward

4ea6b06

fix loss kwrgs

52b9743

fix vocab size

2ee7d79

ArthurZucker approved these changes Mar 20, 2025

View reviewed changes

qubvel merged commit e7337ee into huggingface:main Mar 20, 2025
12 checks passed

NielsRogge mentioned this pull request Sep 1, 2025

Unexpected behaviour with transformers versions above 4.28 for Donut #39473

Closed

4 tasks

NielsRogge mentioned this pull request Sep 13, 2025

[VisionEncoderDecoderModel] Update loss function #40863

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pass num_items_in_batch directly to loss computation #36753

Pass num_items_in_batch directly to loss computation #36753

Uh oh!

eljandoubi commented Mar 16, 2025

Uh oh!

github-actions bot commented Mar 16, 2025

Uh oh!

qubvel left a comment

Uh oh!

qubvel Mar 18, 2025

Uh oh!

eljandoubi Mar 18, 2025

Uh oh!

qubvel Mar 18, 2025

Uh oh!

eljandoubi Mar 19, 2025

Uh oh!

Uh oh!

qubvel commented Mar 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Pass num_items_in_batch directly to loss computation #36753

Pass num_items_in_batch directly to loss computation #36753

Uh oh!

Conversation

eljandoubi commented Mar 16, 2025

What does this PR do?

Uh oh!

github-actions bot commented Mar 16, 2025

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

qubvel Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

eljandoubi Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

eljandoubi Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

qubvel commented Mar 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants