Skip to content

Update: val_bpb=1.1648, tuned LRs (scalar=0.02, embed=0.03)

014814c
Select commit
Loading
Failed to load commit list.
Open

Int6+zstd MLP1488 + Sliding Window + QAT + Tuned LR (val_bpb=1.1648) #107

Update: val_bpb=1.1648, tuned LRs (scalar=0.02, embed=0.03)
014814c
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs