Skip to content

Add DenseSwiGLUMLP for dense baseline models (num_experts=1) #170

Add DenseSwiGLUMLP for dense baseline models (num_experts=1)

Add DenseSwiGLUMLP for dense baseline models (num_experts=1) #170