In the interest of avoiding surprises, let's remove the automatic ONNX-ification of models immediately after training.
Often the trained model is not the one that will go to production, and thus we're wasting compute time ONNX-ifying them prepmaturely.