test(autoware_tensorrt_plugins): add reference kernel tests by mojomex · Pull Request #12561 · autowarefoundation/autoware_universe

mojomex · 2026-05-08T05:37:25Z

Summary

Adds CUDA kernel vs. known-good CPU reference tests for autoware_tensorrt_plugins:

verifies argsort output against a CPU stable-sort reference
verifies unique values, inverse indices, and counts against a CPU reference

Compatibility note

While validating the test, the unique counts check exposed an existing sentinel write bug, so this PR includes the minimal fix needed for the new test to pass: write the final range sentinel to range_ptr + num_out, not range_ptr + num_out * sizeof(int64_t) (range_ptr is an int64_t*).

See

fix(autoware_tensorrt_plugins): correct CustomUnique count offset #12502

for a standalone version of that fix.

Validation

colcon build --packages-up-to autoware_tensorrt_plugins && \
colcon test-result --delete-yes &&
colcon test --packages-select autoware_tensorrt_plugins --event-handlers console_cohesion+ && \
colcon test-result --verbose

Look out for:

[==========] Running 2 tests from 1 test suite.
1: [----------] Global test environment set-up.
1: [----------] 2 tests from ReferenceKernelsTest
1: [ RUN      ] ReferenceKernelsTest.ArgsortMatchesCpuReference
1: [       OK ] ReferenceKernelsTest.ArgsortMatchesCpuReference (277 ms)
1: [ RUN      ] ReferenceKernelsTest.UniqueMatchesCpuReference
1: [       OK ] ReferenceKernelsTest.UniqueMatchesCpuReference (1 ms)
1: [----------] 2 tests from ReferenceKernelsTest (278 ms total)
1: 
1: [----------] Global test environment tear-down
1: [==========] 2 tests from 1 test suite ran. (278 ms total)
1: [  PASSED  ] 2 tests.

github-actions · 2026-05-08T05:37:45Z

Thank you for contributing to the Autoware project!

🚧 If your pull request is in progress, switch it to draft mode.

Please ensure:

You've checked our contribution guidelines.
Your PR follows our pull request guidelines.
All required CI checks pass before marking the PR ready for review.

Co-authored-by: Copilot <copilot@github.com> Signed-off-by: Max SCHMELLER <max.schmeller@tier4.jp>

Signed-off-by: Max SCHMELLER <max.schmeller@tier4.jp>

Co-authored-by: Copilot <copilot@github.com> Signed-off-by: Max SCHMELLER <max.schmeller@tier4.jp>

codecov · 2026-05-08T07:23:53Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 0.22%. Comparing base (54af299) to head (5002ac5).

❗ There is a different number of reports uploaded between BASE (54af299) and HEAD (5002ac5). Click for more details.

HEAD has 1 upload less than BASE

Flag BASE (54af299) HEAD (5002ac5)

daily 1 0

Additional details and impacted files

@@             Coverage Diff             @@
##             main   #12561       +/-   ##
===========================================
- Coverage   18.64%    0.22%   -18.42%     
===========================================
  Files        1918       97     -1821     
  Lines      131362     3500   -127862     
  Branches    44502       25    -44477     
===========================================
- Hits        24489        8    -24481     
+ Misses      86760     3491    -83269     
+ Partials    20113        1    -20112

Flag	Coverage Δ
daily	`?`
full-suite	`0.22% <ø> (-18.42%)`	⬇️

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

manato

@mojomex
Thank you very much for improving the unit tests. In terms of test strictness, I left small suggestions. I would appreciate it if you consider them!

manato · 2026-05-08T15:17:26Z

+  copy_to_device(input_d.get(), input);
+
+  ASSERT_EQ(
+    argsort(
+      input_d.get(), output_d.get(), workspace_d.get(), input.size(),
+      get_argsort_workspace_size(input.size()), stream.get()),
+    cudaSuccess);


Suggested change

copy_to_device(input_d.get(), input);

ASSERT_EQ(

argsort(

input_d.get(), output_d.get(), workspace_d.get(), input.size(),

get_argsort_workspace_size(input.size()), stream.get()),

cudaSuccess);

cudaEvent_t copy_event;

cudaEventCreate(&copy_event);

copy_to_device(input_d.get(), input);

cudaEventRecord(copy_event, 0); // record event on the default stream

cudaStreamWaitEvent(stream.get(), copy_event, cudaEventWaitDefault);

ASSERT_EQ(

argsort(

input_d.get(), output_d.get(), workspace_d.get(), input.size(),

get_argsort_workspace_size(input.size()), stream.get()),

cudaSuccess);

According to the memcpy synchronous behavior:

For transfers from pageable host memory to device memory, a stream sync is performed before the copy is initiated. The function will return once the pageable buffer has been copied to the staging memory for DMA transfer to device memory, but the DMA to final destination may not have completed.

Since input is pageable host memory, it would be better to insert explicit synchronization before operating on input_d.

Alternatively, we can skip this kind of explicit synchronization if copy_to_device takes a CUDA stream as an argument and performs cudaMemcpyAsync inside.

manato · 2026-05-08T15:17:26Z

+  DeviceBuffer<std::uint8_t> workspace_d(get_unique_workspace_size(input.size()));
+
+  copy_to_device(input_d.get(), input);
+


same as the the case of argsort-_kernel_test.cpp. better to insert explicit sync

test(autoware_tensorrt_plugins): add reference kernel tests

b13d749

github-project-automation Bot added this to Software Working Group May 8, 2026

github-project-automation Bot moved this to To Triage in Software Working Group May 8, 2026

github-actions Bot added the component:perception Advanced sensor data processing and environment understanding. (auto-assigned) label May 8, 2026

mojomex mentioned this pull request May 8, 2026

perf(autoware_tensorrt_plugins): remove Thrust from sort kernels #12554

Draft

style(pre-commit): autofix

757c360

mojomex self-assigned this May 8, 2026

refactor(autoware_tensorrt_plugins): split tests into logical units

2fa44b3

Co-authored-by: Copilot <copilot@github.com> Signed-off-by: Max SCHMELLER <max.schmeller@tier4.jp>

mojomex added the run:build-and-test-differential Mark to enable build-and-test-differential workflow. (used-by-ci) label May 8, 2026

mojomex and others added 2 commits May 8, 2026 15:21

chore: add include guard

676007a

Co-authored-by: Copilot <copilot@github.com> Signed-off-by: Max SCHMELLER <max.schmeller@tier4.jp>

style(pre-commit): autofix

fd53532

mojomex force-pushed the test/tensorrt-reference-kernels branch from 30a2371 to fd53532 Compare May 8, 2026 06:26

chore: remove unused includes

a46dee5

Signed-off-by: Max SCHMELLER <max.schmeller@tier4.jp>

mojomex marked this pull request as ready for review May 8, 2026 06:29

mojomex requested review from MasatoSaeki, amadeuszsz, knzo25 and ktro2828 as code owners May 8, 2026 06:29

mojomex requested a review from manato May 8, 2026 07:05

test: hopefully made tests GPU-less CI friendly

5002ac5

Co-authored-by: Copilot <copilot@github.com> Signed-off-by: Max SCHMELLER <max.schmeller@tier4.jp>

manato reviewed May 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(autoware_tensorrt_plugins): add reference kernel tests#12561

test(autoware_tensorrt_plugins): add reference kernel tests#12561
mojomex wants to merge 7 commits intoautowarefoundation:mainfrom
mojomex:test/tensorrt-reference-kernels

mojomex commented May 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 8, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 8, 2026

Uh oh!

manato left a comment

Uh oh!

manato May 8, 2026

Uh oh!

manato May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		DeviceBuffer<std::uint8_t> workspace_d(get_unique_workspace_size(input.size()));

		copy_to_device(input_d.get(), input);

Conversation

mojomex commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Compatibility note

Validation

Uh oh!

github-actions Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented May 8, 2026

Codecov Report

Uh oh!

manato left a comment

Choose a reason for hiding this comment

Uh oh!

manato May 8, 2026

Choose a reason for hiding this comment

Uh oh!

manato May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mojomex commented May 8, 2026 •

edited

Loading

github-actions Bot commented May 8, 2026 •

edited

Loading