|
| 1 | +# Parameter Golf Daily Research - 2026-04-15 |
| 2 | + |
| 3 | +## PR #771 STATUS: CLOSED (REJECTED) — no change |
| 4 | + |
| 5 | +--- |
| 6 | + |
| 7 | +## N-GRAM PR STATUS |
| 8 | + |
| 9 | +| PR | Score | Status | Notes | |
| 10 | +|----|-------|--------|-------| |
| 11 | +| #727 | 0.9674 | **CLOSED** (illegal) | Hashed n-gram cache — no change | |
| 12 | +| #741 | 0.9850 | **CLOSED** (illegal) | No change | |
| 13 | +| #758 | 1.0465 | **OPEN** (dead) | No new activity. XOR hash includes target token; effectively dead. | |
| 14 | +| #731 | 1.0400 | **OPEN** | Dense-count tables + Laplace smoothing. Awaiting seeds 1337+2024. No update. | |
| 15 | + |
| 16 | +--- |
| 17 | + |
| 18 | +## Leaderboard |
| 19 | + |
| 20 | +**Merged SOTA: 1.0810 (bigbag, PR #1493) — DAY 6 UNCHANGED.** |
| 21 | + |
| 22 | +Last upstream commit: `75700cb 2026-04-09 15:22 PDT` (PR #1511, leaderboard README). Zero new records since Apr 9. |
| 23 | + |
| 24 | +This is the longest plateau since the Apr 5–9 acceleration wave (4 records in 4 days). Either the field is stuck, or a wave of PRs is being prepared for end-of-month push. **15 days to deadline.** |
| 25 | + |
| 26 | +Best open PRs (no changes from Apr 14): |
| 27 | + |
| 28 | +| PR | Score | Author | Technique | Legal? | |
| 29 | +|----|-------|--------|-----------|--------| |
| 30 | +| #1585 | **1.0639** | codemath3000 | Casefold Tokenizer + Parallel Residuals + Systems Opt | **AWAIT RULING** | |
| 31 | +| #1578 | **1.0668** | mikeapedia | Casefold BPE retrain | **AWAIT RULING** | |
| 32 | +| #1560 | **1.07406** | dexhunter | VarLen Attention + Doc-TTT | **YES** | |
| 33 | +| #1586 | **1.07493** | dexhunter | Per-Layer Adaptive GPTQ + int7 Emb + MLR=0.026 | **YES** | |
| 34 | +| #1584 | **1.0752** | codemath3000 | Systems Opt (fused Muon + batched EMA + loader prealloc) | **YES** | |
| 35 | +| #1555 | **1.07636** | andrewbaggio1 | TMA Megakernel + Tap-In (min_match=1) | Tap-In unconfirmed | |
| 36 | +| #1541 | **1.07785** | bigbag | Improved Parallel Residuals + Muon 0.97 | ⚠️ hash embed flag | |
| 37 | +| #1540 | **1.0777** | aryanbhosale | VarLen + Doc-Independent LoRA TTT rank-96 | **YES** | |
| 38 | +| #1610 | **1.0728** | romeerp | VarLenAttn + PhasingTTT | **YES** (low EV) | |
| 39 | + |
| 40 | +**Target**: ≤1.0760 bpb. 15 days remaining. |
| 41 | + |
| 42 | +--- |
| 43 | + |
| 44 | +## What Changed (GitHub — Apr 14–15, 2026) |
| 45 | + |
| 46 | +**No new merges. No new high-priority PRs detected via web search.** Day 6 plateau continues. |
| 47 | + |
| 48 | +Checked via: `git log upstream/main -5` (Apr 9 is most recent) + web search for new submissions. |
| 49 | + |
| 50 | +### PRs to watch for movement: |
| 51 | +- PR #1586 (per-layer GPTQ) — highest probability of merging next given 3-seed confirmation + zero flags |
| 52 | +- PR #1541 (bigbag improved residuals) — hash embed flag must clear first; bigbag is the merged-SOTA author so organizers watch his PRs closely |
| 53 | +- Casefold PRs (#1585, #1578) — ruling pending from @valerio-oai; if ruled legal, would reset our target to ≤1.0589 |
| 54 | + |
| 55 | +--- |
| 56 | + |
| 57 | +## New Research Papers |
| 58 | + |
| 59 | +| Priority | Paper | arXiv ID | Date | Key Technique | Competition Relevance | |
| 60 | +|----------|-------|----------|------|---------------|----------------------| |
| 61 | +| **Add to plan** | **Newton-Muon Optimizer** | **2604.01472** | Apr 2, 2026 | Right-preconditioning by input second moment; surrogate quadratic model. Reaches target val loss in **6% fewer steps**, 4% less wall-clock vs standard Muon | **NOT YET IN PLAN.** Drop-in Muon replacement. At our budget (~4800 steps), 6% ≈ +288 extra effective steps. Small but free. Compatible with MuonEq-R base; verify they don't conflict before adding. | |
| 62 | +| Already tracked | In-Place TTT | 2604.06169 | Apr 7, 2026 | MLP final-projection fast weights + NTP-aligned loss + chunk-wise updates | Score-first compatible. Key distinction from Session 3: uses NTP loss not reconstruction loss. Lesson #13 ("HARMFUL") used reconstruction loss on a different model. Could retry with NTP-aligned loss before dismissing permanently. Low priority until base stack is confirmed. | |
| 63 | +| Already tracked | PRISM | 2602.10796 | Feb 2026 | Parallelizable iterative residual correction; 174× vs serial | Architectural inspiration for Triple Loop improvement — read before next recurrence change | |
| 64 | +| Already tracked | Ouroboros | 2604.02051 | Apr 2, 2026 | Hypernetwork-generated per-step LoRA modulation for recursive blocks | 9.2M extra params overhead; likely too expensive for 16MB budget. Watch for competition PR. | |
| 65 | +| Already tracked | Mousse | 2603.09697 | Mar 2026 | Kronecker-factored preconditioning for Muon; ~12% fewer steps | Higher EV than Newton-Muon but more overhead | |
| 66 | + |
| 67 | +--- |
| 68 | + |
| 69 | +## HuggingFace / Community |
| 70 | + |
| 71 | +No new relevant blog posts or model releases. Web search for "parameter-golf 1.06 OR 1.05" returned only PR list page — no new scores below 1.06 surfacing publicly. |
| 72 | + |
| 73 | +--- |
| 74 | + |
| 75 | +## Recommended Action |
| 76 | + |
| 77 | +**No strategy change from Apr 14. One addition: add Newton-Muon to technique tracking.** |
| 78 | + |
| 79 | +Priority order: |
| 80 | +1. **Next GPU run: Implement PR #1586** (per-layer GPTQ + int7 emb + MLR=0.026). Expected: ~1.068–1.070 bpb. Config changes only: `clip_sigmas={'mlp': 12.0, 'attn': 13.0, 'emb': 15.0}, MATRIX_LR=0.026, emb_bits=7`. |
| 81 | +2. **Same run: Add VarLen Attention + Doc-TTT (PR #1560 approach).** Combined expected: ~1.062–1.068 bpb. |
| 82 | +3. **Watch PR #1541** — if hash embed flag clears and it merges, new target becomes ≤1.0728. |
| 83 | +4. **Newton-Muon (arXiv:2604.01472)**: Evaluate as a Muon swap in a follow-up run. +288 effective steps at our scale. Check if MuonEq-R and Newton-Muon are additive or redundant before GPU spend. |
| 84 | +5. **Do NOT implement**: Casefold (#1585, await ruling), PR #758 (dead), any AdamW TTT. |
| 85 | + |
| 86 | +--- |
| 87 | + |
| 88 | +_Updated: 2026-04-15 (merged SOTA 1.0810 Day 6 no change; no new PRs; Newton-Muon arXiv:2604.01472 added as new tracked technique (+6% effective steps); 15 days remaining)_ |
| 89 | + |
| 90 | +--- |
| 91 | + |
1 | 92 | # Parameter Golf Daily Research - 2026-04-14 |
2 | 93 |
|
3 | 94 | ## PR #771 STATUS: CLOSED (REJECTED) — no change |
|
0 commit comments