-
Notifications
You must be signed in to change notification settings - Fork 308
update TensorFlow easyblock for CUDA support in TensorFlow 2.18+ #3765
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update TensorFlow easyblock for CUDA support in TensorFlow 2.18+ #3765
Conversation
|
As for building TensorFlow 2.18+ with our CUDA: They don't support that (anymore) officially and strongly suggest to use the "hermetic" one, i.e. let Bazel download it during build. They argue that the build already "downloads half the internet" so one more doesn't hurt and they use checksums too for verification. Would that be acceptable for us or shall we still pursue using our CUDA? See easybuilders/easybuild-easyconfigs#22921 (comment) Edit: Solution implemented in #3791 : Symlink CUPTI files in CUDA module so they will be found |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 0 out of 1 (1 easyconfigs in total) |
|
@lexming Can you search the log for the error? With easybuilders/easybuild-framework#4942 the test report would likely contain it so maybe we can that in soon |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 0 out of 1 (1 easyconfigs in total) |
|
@Flamefire my tests on 2022a and 2023a failed due to linking issues with OpenSSL in the system: This error is not caused by this PR though. The problem is that these old toolchains use OpenSSL v1.1, while my system (Rocky 9) has OpenSSL v3 and just some compat libs with OpenSSL v1.1. This means that the headers for OpenSSL under So, rebuilding OpenSSL v1.1 from source and testing again... |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 1 out of 1 (1 easyconfigs in total) |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 1 out of 1 (1 easyconfigs in total) |
|
Test report by @lexming Overview of tested easyconfigs (in order)
Build succeeded for 1 out of 1 (1 easyconfigs in total) |
lexming
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
Merging, thanks @Flamefire ! |
(created using
eb --new-pr)A few variables were renamed or removed. Most importantly are the
LOCAL_*variables to find the installed CUDA.I also upgraded the warning of unknown variables "passed" to configure to a more visible one as that was easy to miss
Requires rebuild of CUDA with
Test report: easybuilders/easybuild-easyconfigs#22921 (comment)