Skip to content

Conversation

@vkuzo
Copy link
Contributor

@vkuzo vkuzo commented Nov 12, 2025

Summary:

This is needed for LLaMa 4 MoE model weight loading,
such as here:
https://github.com/vllm-project/vllm/blob/34553b9d2702dd2a27a578fec819e88e76dcbfb4/vllm/model_executor/models/llama4.py#L455

Test Plan:

pytest test/quantization/quantize_/workflows/float8/test_float8_tensor.py -s -x -k chunk

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]
@vkuzo
Copy link
Contributor Author

vkuzo commented Nov 12, 2025

Stack from ghstack (oldest at bottom):

@pytorch-bot
Copy link

pytorch-bot bot commented Nov 12, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3334

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 027eadf with merge base b5587e4 (image):

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vkuzo added a commit that referenced this pull request Nov 12, 2025
Summary:

This is needed for LLaMa 4 MoE model weight loading,
such as here:
https://github.com/vllm-project/vllm/blob/34553b9d2702dd2a27a578fec819e88e76dcbfb4/vllm/model_executor/models/llama4.py#L455

Test Plan:

```
pytest test/quantization/quantize_/workflows/float8/test_float8_tensor.py -s -x -k chunk
```

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 843a40b
ghstack-comment-id: 3523627462
Pull-Request: #3334
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 12, 2025
[ghstack-poisoned]
vkuzo added a commit that referenced this pull request Nov 12, 2025
Summary:

This is needed for LLaMa 4 MoE model weight loading,
such as here:
https://github.com/vllm-project/vllm/blob/34553b9d2702dd2a27a578fec819e88e76dcbfb4/vllm/model_executor/models/llama4.py#L455

Test Plan:

```
pytest test/quantization/quantize_/workflows/float8/test_float8_tensor.py -s -x -k chunk
```

Reviewers:

Subscribers:

Tasks:

Tags:
ghstack-source-id: 54dd86f
ghstack-comment-id: 3523627462
Pull-Request: #3334
@vkuzo vkuzo added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Nov 12, 2025
@implements(aten.split.Tensor)
def _(func, types, args, kwargs):
tensor, split_size_or_sections, dim = args
assert isinstance(split_size_or_sections, int), "unimplemented"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: better error message?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants