Evaluation with GitHub Copilot #593
copilot-evaluation.yml
on: workflow_dispatch
get-entries
/
get-entries
14s
Matrix: evaluate-with-copilot-cli
requeue
/
cleanup-ephemeral-tag
requeue
/
requeue-if-needed
Annotations
2 errors and 4 warnings
|
bcbench.evaluate.testgeneration
Tests failed during evaluation of microsoftInternal__NAV-218253
Traceback (most recent call last):
File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\evaluate\testgeneration.py", line 109, in evaluate
run_test_suite(generated_tests, "Fail", container)
~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\operations\bc_operations.py", line 202, in run_test_suite
raise TestExecutionError(expectation, e.stderr, e.stdout) from None
bcbench.exceptions.TestExecutionError: Test result did not meet expectation (expected: Fail)
Setting test codeunit range '137405'
[16:03:37] Tests passed for Codeunit 137405
|
|
bcbench.evaluate.testgeneration
Tests failed during evaluation of microsoftInternal__NAV-208748
Traceback (most recent call last):
File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\evaluate\testgeneration.py", line 119, in evaluate
run_test_suite(generated_tests, "Pass", container)
~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\vss-agent\2.334.0\_work\BC-Bench\BC-Bench\src\bcbench\operations\bc_operations.py", line 202, in run_test_suite
raise TestExecutionError(expectation, e.stderr, e.stdout) from None
bcbench.exceptions.TestExecutionError: Test result did not meet expectation (expected: Pass)
Setting test codeunit range '134377'
Codeunit 134377 ERM Sales Blanket Order
Testfunction SalesOrderFromBlanketOrderCarriesExtendedTextForAllItemLines Failure (2.9 seconds)
Error:
Assert.TableIsNotEmpty failed. Table <Sales Line> with filter <Document Type: Order, Document No.: 101010, Type: ' ', Description: E1> must contain records.
Call Stack:
Assert(CodeUnit 130000).RecRefIsNotEmpty line 3 - Tests-TestLibraries by Microsoft version 26.0.0.0
Assert(CodeUnit 130000).RecordIsNotEmpty line 5 - Tests-TestLibraries by Microsoft version 26.0.0.0
"ERM Sales Blanket Order"(CodeUnit 134377).VerifySalesOrderExtendedText line 8 - Tests-ERM by Microsoft version 26.0.0.0
"ERM Sales Blanket Order"(CodeUnit 134377).VerifySalesOrderExtendedTexts line 2 - Tests-ERM by Microsoft version 26.0.0.0
"ERM Sales Blanket Order"(CodeUnit 134377).SalesOrderFromBlanketOrderCarriesExtendedTextForAllItemLines line 31 - Tests-ERM by Microsoft version 26.0.0.0
"Test Runner - Mgt"(CodeUnit 130454).RunTests line 26 - Test Runner by Microsoft version 26.0.0.0
"Test Runner - Isol. Codeunit"(CodeUnit 130450).OnRun(Trigger) line 4 - Test Runner by Microsoft version 26.0.0.0
"Test Suite Mgt."(CodeUnit 130456).RunTests line 2 - Test Runner by Microsoft version 26.0.0.0
"Test Suite Mgt."(CodeUnit 130456).RunSelectedTests line 35 - Test Runner by Microsoft version 26.0.0.0
"Command Line Test Tool"(Page 130455)."RunSelectedTests - OnAction"(Trigger) line 7 - Test Runner by Microsoft version 26.0.0.0
|
|
bcbench.results.base
Result for microsoftInternal__NAV-193853 missing metrics: llm_duration
|
|
bcbench.results.base
Result for microsoftInternal__NAV-204450 missing metrics: llm_duration
|
|
bcbench.results.base
Result for microsoftInternal__NAV-218253 missing metrics: llm_duration
|
|
bcbench.results.base
Result for microsoftInternal__NAV-208748 missing metrics: llm_duration
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
evaluation-results-26582746860-microsoftInternal__NAV-193853
Expired
|
2.71 KB |
sha256:afaf023176f242ac46ad55fab3a496898ab4ad9d88df257d464ce1a27f8235e8
|
|
|
evaluation-results-26582746860-microsoftInternal__NAV-204450
Expired
|
2.59 KB |
sha256:5553a8eb66cf37fe3f208e3cec44168cbde5b0b025addc58209aecef3f346c50
|
|
|
evaluation-results-26582746860-microsoftInternal__NAV-208748
Expired
|
2.63 KB |
sha256:a055181387a9f0bb83cd1cb0847df2e031ea16c997df66ab16b68693cc582d75
|
|
|
evaluation-results-26582746860-microsoftInternal__NAV-218253
Expired
|
2.62 KB |
sha256:e02ed2239f52d5d0a02aa6491fb519ee19ded834c5a658f2ce12a3991f0eacc6
|
|
|
evaluation-summary
Expired
|
608 Bytes |
sha256:38f17a75670b31acc62a7e9289c54ddd3b0fe019056d189e49e087ea057c9aad
|
|