Skip to content

Add DenseSwiGLUMLP for dense baseline models (num_experts=1) #170

Add DenseSwiGLUMLP for dense baseline models (num_experts=1)

Add DenseSwiGLUMLP for dense baseline models (num_experts=1) #170

Annotations

2 errors and 1 warning

test (3.12)

cancelled Apr 3, 2026 in 1m 39s