[Feature Request] Pass multiple images to randomCrop, randomFlip, etc. #533

skief · 2018-06-20T17:47:22Z

Recently I've worked with some semantic segmentation algorithms and there I've needed to randomly crop the images and apply the same crop to the mask/label image(to make sure that they are still aligned) and I couldn't do this with the current torchvision transformations.
Therefore I wanted to ask if anybody else would be interested in an implementation of these transforms for multiple images. If you are interested in a feature like this I could implement this :)

Naman-ntc · 2018-06-27T07:00:53Z

Why don't you just use functional transforms? In my opinion it gives you better control as well for augmenting targets instead of passing to transforms.

Randl · 2018-08-05T07:52:54Z

I believe joint transformation of input and target is pretty common, and thus probably worth inclusion in vision

fmassa · 2018-08-13T21:49:55Z

Indeed, they are very common and would deserve a place here.

Unfortunately, all the alternatives I've seem / came up with were not generic nor good enough to so that I could add it to torchvision.

In the end, it just seemed easier to have the transforms be written by the user, leveraging the functional interface. That's the most generic way, even though it is a bit more verbose.

sotte · 2018-09-09T11:02:37Z

This feature request keeps popping up. I think we should improve the docs and the tutorial (especially the data loader tutorial) to cover this case better.

fmassa · 2018-09-11T13:53:45Z

@sotte that's a great point! Would you be willing to improve the documentation?
Maybe not in the beginner's tutorial, but maybe in an intermediate tutorial?

sotte · 2018-09-11T14:44:03Z

@fmassa sure, I'll try to do it on the weekend.

Maybe I would add a few words to the api docs of functional transforms along the lines "functional transforms give you fine grained control...yada yada") plus a short tutorial for for functional transforms. Sounds good?

fmassa · 2018-09-11T15:13:01Z

@sotte yes, please!

sotte · 2018-09-14T21:24:57Z

I think (and I'm clearly biased) that the ddb145b / #602 improved the multiple images transformation situation. Maybe we don't need the tutorial.

@skief you asked the initial question. What do you think? Is the improvement good enough or is a tutorial still needed?

fmassa · 2018-09-17T09:25:46Z

I think the current situation is better now than what it was before, thanks to the note from @sotte
As I mentioned before, I think it's best for more complex cases to be handled using the functional interface, so I'm closing this issue now. But please fell free to comment if you disagree.

sotte mentioned this issue Sep 12, 2018

Improve docs of functional transforms #602

Merged

fmassa closed this as completed Sep 17, 2018

fmassa mentioned this issue Jul 18, 2019

Keypoint transform #1131

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Pass multiple images to randomCrop, randomFlip, etc. #533

[Feature Request] Pass multiple images to randomCrop, randomFlip, etc. #533

skief commented Jun 20, 2018

Naman-ntc commented Jun 27, 2018

Uh oh!

Randl commented Aug 5, 2018

Uh oh!

fmassa commented Aug 13, 2018

Uh oh!

sotte commented Sep 9, 2018

Uh oh!

fmassa commented Sep 11, 2018

Uh oh!

sotte commented Sep 11, 2018

Uh oh!

fmassa commented Sep 11, 2018

Uh oh!

sotte commented Sep 14, 2018

Uh oh!

fmassa commented Sep 17, 2018

Uh oh!

[Feature Request] Pass multiple images to randomCrop, randomFlip, etc. #533

[Feature Request] Pass multiple images to randomCrop, randomFlip, etc. #533

Comments

skief commented Jun 20, 2018

Naman-ntc commented Jun 27, 2018

Uh oh!

Randl commented Aug 5, 2018

Uh oh!

fmassa commented Aug 13, 2018

Uh oh!

sotte commented Sep 9, 2018

Uh oh!

fmassa commented Sep 11, 2018

Uh oh!

sotte commented Sep 11, 2018

Uh oh!

fmassa commented Sep 11, 2018

Uh oh!

sotte commented Sep 14, 2018

Uh oh!

fmassa commented Sep 17, 2018

Uh oh!