Commit 4057714
spec 055: default PPM_OMP_CHUNK_TOKENS=0 (single-pass, matches 1850)
Earlier default 4194304 (OMP-chunked) was suboptimal — saves ~230s eval time
but loses ~0.010 BPB sidecar from chunk-reset penalty. PR openai#1850 chose single-
pass deliberately and pays the 252s scoring cost for the bigger gain.
Single-pass timing on 8H per 1850's measurements:
pre-quant + gptq + ema: ~85s
diagnostic quantized eval: ~60s
non-overlap forward (8-way): ~20s
file gather: ~5s
single-pass PPM scoring: ~250s (CPU-bound, not GPU)
────────────────────────────────────
total eval phase: ~420s under 600s cap
Smokes (where wallclock matters more than gain) can override with
PPM_OMP_CHUNK_TOKENS=4194304.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent 80967e4 commit 4057714
1 file changed
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
79 | 79 | | |
80 | 80 | | |
81 | 81 | | |
82 | | - | |
| 82 | + | |
83 | 83 | | |
84 | 84 | | |
85 | 85 | | |
| |||
0 commit comments