Skip to content

Add standard training script with selective precision and sliding win…

b76cf36
Select commit
Loading
Failed to load commit list.
Open

9L MLP3x + STE int6 QAT + ROPE=200K + warmdown=14K: val_bpb=0.9588 — 0.2656 nats over baseline #1

Add standard training script with selective precision and sliding win…
b76cf36
Select commit
Loading
Failed to load commit list.

There are no checks for this commit