Skip to content

PR #287 base + Overtone init + 12L option#5

Closed
machdragon wants to merge 1 commit intomainfrom
submission/xsa-overtone-ema997
Closed

PR #287 base + Overtone init + 12L option#5
machdragon wants to merge 1 commit intomainfrom
submission/xsa-overtone-ema997

Conversation

@machdragon
Copy link
Copy Markdown
Owner

@machdragon machdragon commented Mar 21, 2026

Summary

Run plan

Run Config delta Goal
A Overtone only (11L) Baseline: does Overtone help?
B Overtone + NUM_LAYERS=12 12L capacity headroom
C Overtone + EMA_DECAY=0.996 Bracket EMA optimum
D Overtone + EMA_DECAY=0.998 Bracket EMA optimum
E Best config, 3 seeds Submission validation

Test plan

🤖 Generated with Claude Code


Open with Devin

PR openai#287 (SOTA val_bpb=1.1271) with Overtone embedding init added.
Overtone reshapes tok_emb singular values to power-law decay for
better int6 quantization. No KURE, R2, tanh, or TTT — clean minimal
delta over proven SOTA. Modal launcher with CLI flags for EMA decay,
num_layers, and XSA sweep.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copy link
Copy Markdown

@devin-ai-integration devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 5 additional findings.

Open in Devin Review

@machdragon machdragon closed this Apr 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant