Hi, @dbolya , a very solid work, cutrrently I apply tome with some diffusion-based models, and found that when perform on MLP module, even the pruning ratio is very small (0.1), the results become not correct, like this. Do you have some suggestions?
