-
Notifications
You must be signed in to change notification settings - Fork 7.1k
[Feature Request] Pass multiple images to randomCrop, randomFlip, etc. #533
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Why don't you just use functional transforms? In my opinion it gives you better control as well for augmenting targets instead of passing to transforms. |
I believe joint transformation of input and target is pretty common, and thus probably worth inclusion in vision |
Indeed, they are very common and would deserve a place here. Unfortunately, all the alternatives I've seem / came up with were not generic nor good enough to so that I could add it to In the end, it just seemed easier to have the transforms be written by the user, leveraging the functional interface. That's the most generic way, even though it is a bit more verbose. |
This feature request keeps popping up. I think we should improve the docs and the tutorial (especially the data loader tutorial) to cover this case better. |
@sotte that's a great point! Would you be willing to improve the documentation? |
@fmassa sure, I'll try to do it on the weekend. Maybe I would add a few words to the api docs of functional transforms along the lines "functional transforms give you fine grained control...yada yada") plus a short tutorial for for functional transforms. Sounds good? |
@sotte yes, please! |
I think the current situation is better now than what it was before, thanks to the note from @sotte |
Recently I've worked with some semantic segmentation algorithms and there I've needed to randomly crop the images and apply the same crop to the mask/label image(to make sure that they are still aligned) and I couldn't do this with the current torchvision transformations.
Therefore I wanted to ask if anybody else would be interested in an implementation of these transforms for multiple images. If you are interested in a feature like this I could implement this :)
The text was updated successfully, but these errors were encountered: