Skip to content

[ROCm] update rocm to 6.4.2#114

Merged
YanWenKun merged 1 commit intoYanWenKun:mainfrom
SergeyFilippov:rocm-6.4.2
Jul 24, 2025
Merged

[ROCm] update rocm to 6.4.2#114
YanWenKun merged 1 commit intoYanWenKun:mainfrom
SergeyFilippov:rocm-6.4.2

Conversation

@SergeyFilippov
Copy link
Contributor

@SergeyFilippov SergeyFilippov commented Jul 24, 2025

Update ROCm dependencies to the latest stable version.

Highlights:

  • This release added experimental support for "Memory Efficient Flash Attention".
    (noticeable performance improvements starting with the second run of workflow).

    Command line adjustment for activation:

    -e TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1
  • Add official support for RDNA 4 (9060/9070) link

@YanWenKun
Copy link
Owner

Thank you! Merging :-)
I'll update the doc later, and test torch 2.7.1 + rocm 6.4.2.

@YanWenKun YanWenKun merged commit 71cff27 into YanWenKun:main Jul 24, 2025
@YanWenKun
Copy link
Owner

It seems that TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1 is no longer necessary since PyTorch 2.4:

https://github.com/Comfy-Org/docs/blob/main/troubleshooting/overview.mdx

But in this doc I found another env var could tweak performance:

# Enable tunable operations (slow first run, but faster subsequent runs)
PYTORCH_TUNABLEOP_ENABLED=1 python main.py

@SergeyFilippov SergeyFilippov deleted the rocm-6.4.2 branch January 16, 2026 20:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants