Skip to content

Record: PR #1886 base + per-block MLP output gate (Linear, weight-lea…

bd7811f
Select commit
Loading
Failed to load commit list.
Closed

Record: PR #1886 base + per-block MLP output gate (Linear, weight-learnable) — val_bpb 1.06872 (3-seed mean) #1941

Record: PR #1886 base + per-block MLP output gate (Linear, weight-lea…
bd7811f
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs