Skip to content

[Non-Record] LegendreGPT: Legendre polynomial depth parameterization#1337

Merged
valerio-oai merged 5 commits intoopenai:mainfrom
sergimichi:legendregpt-submission
May 3, 2026
Merged

[Non-Record] LegendreGPT: Legendre polynomial depth parameterization#1337
valerio-oai merged 5 commits intoopenai:mainfrom
sergimichi:legendregpt-submission

Conversation

@sergimichi
Copy link
Copy Markdown
Contributor

Non-record submission (1x RTX 5090, ~27h training).

Pre-quant val_bpb: 1.2079
Post-quant val_bpb: 1.2266 (mixed INT8/INT7+LZMA, 15.70 MB)

Transformer weights parameterized as smooth functions of depth via Legendre polynomials. 2-group architecture with 24 virtual layers from 12 coefficient matrices.

Copy link
Copy Markdown
Contributor

@valerio-oai valerio-oai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Selected for the notable non-record submissions section.

@valerio-oai valerio-oai merged commit bf339c1 into openai:main May 3, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants