Commit e587d76
PR openai#180 SOTA: 10L Int5-MLP + BigramHash(10240) + SWA(0.4) + WD=0.04
Reproduce openai/parameter-golf PR openai#180 (val_bpb 1.14276, 3-seed mean).
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>1 parent 0f51451 commit e587d76
3 files changed
Lines changed: 397 additions & 267 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
0 commit comments