[Non-record] Meta-Learned TTT + Error-Guided Adaptation Analysis (val_bpb=1.1645)#296
[Non-record] Meta-Learned TTT + Error-Guided Adaptation Analysis (val_bpb=1.1645)#296sseanliu wants to merge 2 commits intoopenai:mainfrom
Conversation
Combines PR openai#287 (XSA + EMA + Int6 QAT) with PR openai#254 TTT adaptation. Changes: FA2 fallback import, TTT hyperparameters, ttt_adapt function, TTT call before torch.compile in eval section.
Community Review — [Non-record] Meta-Learned TTT + Error-Guided Adaptation Analysis (val_bpb=1.1645)Compliance flag: Pre-Quant TTT violation Head SHA: e3a7958 AnalysisPR #296 contains two separate submissions. Both are disqualified. File 1:
|
Summary
Non-record research submission exploring test-time adaptation strategies for compressed language models at 16MB scale.
Key findings
Score
See README for full methodology and analysis.