Skip to content

Optional weight tying for Qwen3 and Llama3.2 pretraining #535

Optional weight tying for Qwen3 and Llama3.2 pretraining

Optional weight tying for Qwen3 and Llama3.2 pretraining #535

Triggered via pull request January 14, 2026 08:54
Status Success
Total duration 3m 47s
Artifacts

basic-tests-macos-uv.yml

on: pull_request
Code tests (macOS)
3m 43s
Code tests (macOS)
Fit to window
Zoom out
Zoom in