WIP Record: SP8192 + CaseOps + Depth Curriculum + FreqGPTQ + PPM adaptive-λ mixture — val_bpb 0.90687688 (1-seed) by pragnyanramtha · Pull Request #1833 · openai/parameter-golf

pragnyanramtha · 2026-04-26T14:07:21Z

Summary

Builds on romeerp's #1756 depth curriculum stack. Adds two techniques:

FreqGPTQ — upweights top-100 most frequent calibration tokens by 2×
during Hessian collection, improving int6 quantization quality on
high-frequency vocabulary items.
PPM-D adaptive-λ mixture (from OE-GOD Record: SP4096 + byte-level PPM adaptive-λ mixture — val_bpb 1.01925 (3-seed) #1785) — byte-level PPM order-5
predictor mixed with NN log-probs at eval time using adaptive gate:
λ=0.05 when PPM confidence >0.9, λ=0.9 otherwise. Zero artifact cost.

Results (1-seed, 8×H100 SXM)

Metric	Value
Pre-quant post-EMA val_bpb	1.07238
Post-TTT val_bpb	1.06902
Artifact size	~24.5 MB ⚠️ (over cap, WIP)
Eval time	~658s ⚠️ (over cap, WIP)

Status

Single seed screening run. Two known issues being fixed:

Artifact size over 16MB cap (investigating NUM_LOOPS reduction + more
aggressive int8 passthrough quantization)
Eval time over 600s cap (investigating TTT chunk reduction)

Full 3-seed compliant submission pending fixes.

Base

Fork of romeerp #1756 (CaseOps + depth curriculum 1→3→4)`

Agent-Logs-Url: https://github.com/pragnyanramtha/parameter-golf/sessions/bf90bb43-1ad3-42a4-980f-612ebe31b0b0 Co-authored-by: pragnyanramtha <196208154+pragnyanramtha@users.noreply.github.com>

…d-freq-gptq-support # Conflicts: # records/track_10min_16mb/record3/train_gpt.py Co-authored-by: pragnyanramtha <196208154+pragnyanramtha@users.noreply.github.com>

Copilot/add freq gptq support

…-golf into record/attempt3

pragnyanramtha and others added 17 commits April 19, 2026 17:27

add: new files

5ded054

check one

f0f3d35

Initial plan

f366b84

Add FreqGPTQ support to train_gpt.py and create record3 files

1fbd15f

Agent-Logs-Url: https://github.com/pragnyanramtha/parameter-golf/sessions/bf90bb43-1ad3-42a4-980f-612ebe31b0b0 Co-authored-by: pragnyanramtha <196208154+pragnyanramtha@users.noreply.github.com>

Fix spelling: quantisation -> quantization in freq_gptq.py

b229a45

Agent-Logs-Url: https://github.com/pragnyanramtha/parameter-golf/sessions/bf90bb43-1ad3-42a4-980f-612ebe31b0b0 Co-authored-by: pragnyanramtha <196208154+pragnyanramtha@users.noreply.github.com>

Merge remote-tracking branch 'origin/record/attempt3' into copilot/ad…

98b3722

…d-freq-gptq-support # Conflicts: # records/track_10min_16mb/record3/train_gpt.py Co-authored-by: pragnyanramtha <196208154+pragnyanramtha@users.noreply.github.com>

Merge pull request #3 from pragnyanramtha/copilot/add-freq-gptq-support

f03a3e8

Copilot/add freq gptq support

newer default parameters

0e9da04

Merge branch 'record/attempt3' of github.com:pragnyanramtha/parameter…

09a97db

…-golf into record/attempt3

update path

3d48f6e

updated with lambda model

9edfb1b

added script

ce7d9cb

update: script

8122ef7

addd vars in script

4201898

fix: data and tokenizer paths

4f29c9c

add: vars in script

c2e852b

Merge branch 'openai:main' into record/attempt3

b2e1713

andrewbaggio1 mentioned this pull request Apr 27, 2026

Legality clarification: byte-level PPM-D mixture submissions (#1835 / #1850 / #1854 cluster) under Issue #1017 C2 #1872

Open

leon2k2k2k mentioned this pull request Apr 28, 2026

Report: PPM-D byte-level scoring is not a valid probability distribution, and why it appears to gain #1905

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Record: SP8192 + CaseOps + Depth Curriculum + FreqGPTQ + PPM adaptive-λ mixture — val_bpb 0.90687688 (1-seed)#1833

WIP Record: SP8192 + CaseOps + Depth Curriculum + FreqGPTQ + PPM adaptive-λ mixture — val_bpb 0.90687688 (1-seed)#1833
pragnyanramtha wants to merge 17 commits intoopenai:mainfrom
pragnyanramtha:record/attempt3

pragnyanramtha commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pragnyanramtha commented Apr 26, 2026

Summary

Results (1-seed, 8×H100 SXM)

Status

Base

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants