Use env. allocators for initializers (#25108) #25281

AndreyOrb · 2025-07-03T17:11:36Z

Description

Pass environment allocators into the session state, if the "session.use_env_allocators" flag was activated (#25108)

Motivation and Context

Initializers use session-local allocators even if env. allocators to be used.

…se_env_allocators" flag was activated (microsoft#25108)

tianleiwu · 2025-07-07T20:55:58Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2025-07-07T20:56:18Z

Azure Pipelines successfully started running 5 pipeline(s).

AndreyOrb · 2025-07-08T12:33:58Z

@tianleiwu Could you rerun the failed test, please?

fs-eire · 2025-07-08T20:59:12Z

I triggered the re-run. If the error still occur, need to investigate why it happened. Error message seems showing it's related to the change:

onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRuntimeError] : 1 : FAIL : WebGPU validation failed. [Buffer (unlabeled)] used in submit while mapped.
 - While calling [Queue].Submit([[CommandBuffer]])

The debug build passes because test case onnx_backend_test_series.py does not run on a Debug build.

AndreyOrb · 2025-07-10T14:04:47Z

@fs-eire @tianleiwu
Is there a way to run this specific test locally without running all tests?

tianleiwu · 2025-07-10T17:09:21Z

@fs-eire @tianleiwu Is there a way to run this specific test locally without running all tests?

You can specify a test name like

python onnx_backend_test_series.py -t test_affine_grid_2d_align_corners_expanded_cpu

AndreyOrb · 2025-07-10T18:54:57Z

Thanks. I'm still working on setting up the env. to check the issue.
Do I have to build the --build_wheel for this test, or --use_webgpu is enough?

I'm currently building with
E:\3rdParties\onnxruntime_v1.22.0>.\build.bat --update --config Debug --build_dir ./build_web --parallel --use_binskim_compliant_compile_flags --build_shared_lib --use_webgpu --cmake_generator "Visual Studio 17 2022" --compile_no_warning_as_error --cmake_path E:\3rdParties\cmake-4.0.3\build\bin\Release\cmake.exe --windows_sdk_version 10.0.26100.0

qjia7 · 2025-07-11T10:00:43Z

I triggered the re-run. If the error still occur, need to investigate why it happened. Error message seems showing it's related to the change:
onnxruntime.capi.onnxruntime_pybind11_state.Fail: [ONNXRuntimeError] : 1 : FAIL : WebGPU validation failed. [Buffer (unlabeled)] used in submit while mapped.
 - While calling [Queue].Submit([[CommandBuffer]])
The debug build passes because test case onnx_backend_test_series.py does not run on a Debug build.

I remember I ever met a similar error for webgpu. The error was that it went to UMA path after the session initialization which is not expected. The UMA path should only work for the weights uploading since it's in a mapped state. The reason that I went wrongly into UMA path is that I didn't use the session's default allocator which can correctly record whether the session initialization is finished. I used a new webgpu allocator which's session_initialized_ is false but the session has finished the initialization. That's why it went to the UMA path. After switching to the the session's allocator, that issue was resolved. Just for your reference for my case.
You can simply to comment out https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/providers/webgpu/allocator.cc#L19-L21 to see whether your issue still exists.

AndreyOrb · 2025-07-11T13:50:00Z

@qjia7 Thanks a lot for pointing this out!
@fs-eire @tianleiwu Is there a way to run the failed test in VS in c++ directly? It will help me in debugging the issue.
onnxruntime_test_all.exe --gtest_filter= ???

AndreyOrb · 2025-07-11T16:21:10Z

I see now that all c++ tests have passed, but the python tests have failed.
So, the onnxruntime_test_all.exe will not help me.

Is there any way to debug the c++ issue when running from the onnx_backend_test_series.py?

yuslepukhin · 2025-07-11T16:53:34Z

I see now that all c++ tests have passed, but the python tests have failed. So, the onnxruntime_test_all.exe will not help me.

Is there any way to debug the c++ issue when running from the onnx_backend_test_series.py?

You can do mixed debugging using Python C++ Debugger extension for VS Code.

AndreyOrb · 2025-07-11T16:55:49Z

Thanks, Dmitri, will try.
I'm using VS. As it turns out, VS also has this capability: https://learn.microsoft.com/en-us/visualstudio/python/debugging-mixed-mode-c-cpp-python-in-visual-studio?view=vs-2022

Pass environment allocators into the session state, if the "session.u…

0b6fb04

…se_env_allocators" flag was activated (microsoft#25108)

tianleiwu requested a review from yuslepukhin July 7, 2025 19:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use env. allocators for initializers (#25108) #25281

Use env. allocators for initializers (#25108) #25281

AndreyOrb commented Jul 3, 2025

Uh oh!

tianleiwu commented Jul 7, 2025

Uh oh!

azure-pipelines bot commented Jul 7, 2025

Uh oh!

AndreyOrb commented Jul 8, 2025

Uh oh!

fs-eire commented Jul 8, 2025

Uh oh!

AndreyOrb commented Jul 10, 2025

Uh oh!

tianleiwu commented Jul 10, 2025

Uh oh!

AndreyOrb commented Jul 10, 2025

Uh oh!

qjia7 commented Jul 11, 2025

Uh oh!

AndreyOrb commented Jul 11, 2025 •

edited

Loading

Uh oh!

AndreyOrb commented Jul 11, 2025

Uh oh!

yuslepukhin commented Jul 11, 2025

Uh oh!

AndreyOrb commented Jul 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Use env. allocators for initializers (#25108) #25281

Are you sure you want to change the base?

Use env. allocators for initializers (#25108) #25281

Conversation

AndreyOrb commented Jul 3, 2025

Description

Motivation and Context

Uh oh!

tianleiwu commented Jul 7, 2025

Uh oh!

azure-pipelines bot commented Jul 7, 2025

Uh oh!

AndreyOrb commented Jul 8, 2025

Uh oh!

fs-eire commented Jul 8, 2025

Uh oh!

AndreyOrb commented Jul 10, 2025

Uh oh!

tianleiwu commented Jul 10, 2025

Uh oh!

AndreyOrb commented Jul 10, 2025

Uh oh!

qjia7 commented Jul 11, 2025

Uh oh!

AndreyOrb commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AndreyOrb commented Jul 11, 2025

Uh oh!

yuslepukhin commented Jul 11, 2025

Uh oh!

AndreyOrb commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

AndreyOrb commented Jul 11, 2025 •

edited

Loading

AndreyOrb commented Jul 11, 2025 •

edited

Loading