Skip to content

Evaluation with GitHub Copilot #593

Evaluation with GitHub Copilot

Evaluation with GitHub Copilot #593

Manually triggered May 28, 2026 14:58
Status Success
Total duration 1h 12m 42s
Artifacts 5

copilot-evaluation.yml

on: workflow_dispatch
get-entries  /  get-entries
14s
get-entries / get-entries
Matrix: evaluate-with-copilot-cli
summarize-results  /  Results
54s
summarize-results / Results
requeue  /  cleanup-ephemeral-tag
requeue / cleanup-ephemeral-tag
requeue  /  requeue-if-needed
requeue / requeue-if-needed
Fit to window
Zoom out
Zoom in

Annotations

2 errors and 4 warnings
bcbench.evaluate.testgeneration
Tests failed during evaluation of microsoftInternal__NAV-218253 Traceback (most recent call last): File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\evaluate\testgeneration.py", line 109, in evaluate run_test_suite(generated_tests, "Fail", container) ~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\operations\bc_operations.py", line 202, in run_test_suite raise TestExecutionError(expectation, e.stderr, e.stdout) from None bcbench.exceptions.TestExecutionError: Test result did not meet expectation (expected: Fail) Setting test codeunit range '137405' [16:03:37] Tests passed for Codeunit 137405
bcbench.evaluate.testgeneration
Tests failed during evaluation of microsoftInternal__NAV-208748 Traceback (most recent call last): File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\evaluate\testgeneration.py", line 119, in evaluate run_test_suite(generated_tests, "Pass", container) ~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\operations\bc_operations.py", line 202, in run_test_suite raise TestExecutionError(expectation, e.stderr, e.stdout) from None bcbench.exceptions.TestExecutionError: Test result did not meet expectation (expected: Pass) Setting test codeunit range '134377' Codeunit 134377 ERM Sales Blanket Order Testfunction SalesOrderFromBlanketOrderCarriesExtendedTextForAllItemLines Failure (2.9 seconds) Error: Assert.TableIsNotEmpty failed. Table <Sales Line> with filter <Document Type: Order, Document No.: 101010, Type: ' ', Description: E1> must contain records. Call Stack: Assert(CodeUnit 130000).RecRefIsNotEmpty line 3 - Tests-TestLibraries by Microsoft version 26.0.0.0 Assert(CodeUnit 130000).RecordIsNotEmpty line 5 - Tests-TestLibraries by Microsoft version 26.0.0.0 "ERM Sales Blanket Order"(CodeUnit 134377).VerifySalesOrderExtendedText line 8 - Tests-ERM by Microsoft version 26.0.0.0 "ERM Sales Blanket Order"(CodeUnit 134377).VerifySalesOrderExtendedTexts line 2 - Tests-ERM by Microsoft version 26.0.0.0 "ERM Sales Blanket Order"(CodeUnit 134377).SalesOrderFromBlanketOrderCarriesExtendedTextForAllItemLines line 31 - Tests-ERM by Microsoft version 26.0.0.0 "Test Runner - Mgt"(CodeUnit 130454).RunTests line 26 - Test Runner by Microsoft version 26.0.0.0 "Test Runner - Isol. Codeunit"(CodeUnit 130450).OnRun(Trigger) line 4 - Test Runner by Microsoft version 26.0.0.0 "Test Suite Mgt."(CodeUnit 130456).RunTests line 2 - Test Runner by Microsoft version 26.0.0.0 "Test Suite Mgt."(CodeUnit 130456).RunSelectedTests line 35 - Test Runner by Microsoft version 26.0.0.0 "Command Line Test Tool"(Page 130455)."RunSelectedTests - OnAction"(Trigger) line 7 - Test Runner by Microsoft version 26.0.0.0
bcbench.results.base
Result for microsoftInternal__NAV-193853 missing metrics: llm_duration
bcbench.results.base
Result for microsoftInternal__NAV-204450 missing metrics: llm_duration
bcbench.results.base
Result for microsoftInternal__NAV-218253 missing metrics: llm_duration
bcbench.results.base
Result for microsoftInternal__NAV-208748 missing metrics: llm_duration

Artifacts

Produced during runtime
Name Size Digest
evaluation-results-26582746860-microsoftInternal__NAV-193853 Expired
2.71 KB
sha256:afaf023176f242ac46ad55fab3a496898ab4ad9d88df257d464ce1a27f8235e8
evaluation-results-26582746860-microsoftInternal__NAV-204450 Expired
2.59 KB
sha256:5553a8eb66cf37fe3f208e3cec44168cbde5b0b025addc58209aecef3f346c50
evaluation-results-26582746860-microsoftInternal__NAV-208748 Expired
2.63 KB
sha256:a055181387a9f0bb83cd1cb0847df2e031ea16c997df66ab16b68693cc582d75
evaluation-results-26582746860-microsoftInternal__NAV-218253 Expired
2.62 KB
sha256:e02ed2239f52d5d0a02aa6491fb519ee19ded834c5a658f2ce12a3991f0eacc6
evaluation-summary Expired
608 Bytes
sha256:38f17a75670b31acc62a7e9289c54ddd3b0fe019056d189e49e087ea057c9aad