Skip to content

Add claude-opus-4.8 model and bump version to 0.5.4 #2226

Add claude-opus-4.8 model and bump version to 0.5.4

Add claude-opus-4.8 model and bump version to 0.5.4 #2226

Triggered via pull request May 31, 2026 21:11
Status Success
Total duration 2m 15s
Artifacts 5

CI.yml

on: pull_request
select-category
19s
select-category
lint-and-test
35s
lint-and-test
get-entries  /  get-entries
20s
get-entries / get-entries
Matrix: mock-evaluation
summarize-results  /  Results
56s
summarize-results / Results
Fit to window
Zoom out
Zoom in

Annotations

3 warnings
lint-and-test
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/cache@v4. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
bcbench.results.base
Result for microsoftInternal__NAV-192565 missing metrics: prompt_tokens, completion_tokens
bcbench.results.base
Result for microsoftInternal__NAV-218995 missing metrics: execution_time, llm_duration, turn_count, prompt_tokens, completion_tokens, tool_usage

Artifacts

Produced during runtime
Name Size Digest
evaluation-summary
513 Bytes
sha256:c29c943a9f8307da3c0bc063221f1a23cd2252355f112e9e44d99b8c7dc682b1
microsoftInternal__NAV-175577
494 Bytes
sha256:7a458000dd7d33e5733d118d652f7b6415605a09862230df54b2a1944ffd6555
microsoftInternal__NAV-192565
520 Bytes
sha256:93a96fd214c7fbd1c0b154f05a828b4a95a1e60247109cc74098da0369c633dd
microsoftInternal__NAV-218995
512 Bytes
sha256:3e119e8af5eb37b8107f314e4609d385cc30e9ab5e0a59eaa001193aaed065e6
microsoft__BCApps-4822
541 Bytes
sha256:97b27c4da107770307e0acddd5db1a0925329d84fc755c963813d5ecf24020e6