Flops counter issue

### System Info

Two issues:
1. It seems the current implementation of the flops counter is counting the total flops of all the training step which causes the tflops/s/gpu metric goes up with increasing number of training steps. (reference: https://github.com/pytorch/pytorch/issues/145947#issuecomment-2632532214)
2. actually, it's a question: in the training config,  what is the impact of the default num_freeze_layers: int = 1 on mllama model (llama3.2-11b vision model)? I am suspecting if it is causing the underestimate of total model tflops issue.
 
Thank you!

### Information

- [x] The official example scripts
- [ ] My own modified scripts

### 🐛 Describe the bug

steps to reproduce can be found in this ticket: https://github.com/pytorch/pytorch/issues/145947#issuecomment-2632532214

### Error logs

Explained in above and also in this ticket: https://github.com/pytorch/pytorch/issues/145947#issuecomment-2632532214

### Expected behavior

The Flops counting is buggy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Flops counter issue #874

System Info

Information

🐛 Describe the bug

Error logs

Expected behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Flops counter issue #874

Description

System Info

Information

🐛 Describe the bug

Error logs

Expected behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions