Commit 201f563
committed
DEPRECATE specs 060G and 060H — refuted by PR openai#1898 measurement
PR openai#1898 (X-Abhishek-X) ran Partial SpinQuant + EMBED_BITS=6 reinvest on
the same chain and reported val_bpb 1.06614 vs their base openai#1851's 1.06128
= +0.00486 REGRESSION. Their PR framed it as -0.01486 vs the 2-week-old
merged SOTA openai#1493 (1.0810) instead of vs their actual parent.
Implications:
- 060G (Partial SpinQuant): empirically null/negative on this chain.
- 060H (EMBED_BITS=6 alone or with LQER reinvest): even riskier without
SpinQuant's rotation protection.
Both specs marked as DEPRECATED at the top. Not deleted (kept as
documentation for if conditions change later, e.g., deploy-time repair
specifically targeting tok_emb precision).1 parent 827f5ab commit 201f563
2 files changed
Lines changed: 31 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
3 | 18 | | |
4 | 19 | | |
5 | 20 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
3 | 19 | | |
4 | 20 | | |
5 | 21 | | |
| |||
0 commit comments