port FiveCrop and TenCrop to prototype API #5513

pmeier · 2022-03-02T14:25:08Z

Both transforms are outliers in the current stable API as discussed in #5500 (comment). Even if we want to deprecate them in favor of a different approach eventually, we need to support them at day 0 of the roll out.

The issues with these transforms arise, because they always need to be followed by another to batch them together:

vision/torchvision/transforms/transforms.py

Lines 980 to 984 in 71d2bb0

    
               Example: 
        
                    >>> transform = Compose([ 
        
                    >>>    FiveCrop(size), # this is a list of PIL Images 
        
                    >>>    Lambda(lambda crops: torch.stack([ToTensor()(crop) for crop in crops])) # returns a 4D tensor 
        
                    >>> ])

We cannot re-use this lambda approach since crops might be buried inside a nested container. Thus, we need a custom transformation that picks up on the result of such a multi crop transform and batches it. To communicate which elements need to be batched between the two transformed, I've opted to wrap the return values of the multi crop kernels in named tuples, because they are full JIT scriptable. The old FiveCrop returned a regular tuple, so this is 100% compatible. TenCrop returned a list, but this is still fine unless someone is using list specific methods like append or the like.

Example usage:

from torchvision.prototype import datasets, transforms, features

transform = transforms.Compose(
    transforms.DecodeImage(),
    transforms.TenCrop(16),
    transforms.BatchMultiCrop(),
)

for sample in datasets.load("imagenet").map(transform):
    print(sample["image"].shape)
    break

torch.Size([10, 3, 16, 16])

To test with PIL images, insert transforms.Lambda(to_pil_image, features.Image) after the decoding.

facebook-github-bot · 2022-03-02T14:25:17Z

💊 CI failures summary and remediations

As of commit f1a5003 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

pmeier · 2022-03-02T14:27:40Z

torchvision/prototype/utils/_internal.py

-        return [apply_recursively(fn, item) for item in obj]
-    elif isinstance(obj, collections.abc.Mapping):
-        return {key: apply_recursively(fn, item) for key, item in obj.items()}
+    exclude_sequence_types: Collection[Type] = (str,),


We need this addition to be able to exclude named tuples as sequences in the BatchMultiCrop transform. My gut says we are going to need this fine grained control again for other transforms that are not yet ported / implemented. If that turns out not to be true, I'm happy to simply implement a custom solution only in the BatchMultiCrop transform given that we probably deprecate it anyway.

We should start from the assumption that this is not needed and add it later. Starting with a simple solution first is a good prior. Please simplify as much as possible.

pmeier · 2022-03-02T14:29:04Z

torchvision/prototype/transforms/_geometry.py

+        return apply_recursively(
+            functools.partial(self._transform, params=self._get_params(sample)),
+            sample,
+            exclude_sequence_types=(str, *self._MULTI_CROP_TYPES),


We need this exclude here, because named tuples by default would be recognized as sequence and thus we would only get the individual elements rather than everything at once.

I think that's one more reason not to use named tuples.

torchvision/prototype/transforms/_geometry.py

Co-authored-by: Philip Meier <[email protected]>

datumbox

@pmeier As we discussed previously offline, the specific transforms are a bit problematic as they don't fit nicely on the new API.
The only reason why we provide support for them is for BC, so I would avoid introducing any new features. Let's offer the simplest possible solution just to ensure we won't break anyone's code.

torchvision/prototype/transforms/_geometry.py

datumbox · 2022-03-04T12:31:15Z

torchvision/prototype/utils/_internal.py

-        return [apply_recursively(fn, item) for item in obj]
-    elif isinstance(obj, collections.abc.Mapping):
-        return {key: apply_recursively(fn, item) for key, item in obj.items()}
+    exclude_sequence_types: Collection[Type] = (str,),


We should start from the assumption that this is not needed and add it later. Starting with a simple solution first is a good prior. Please simplify as much as possible.

torchvision/prototype/transforms/functional/_geometry.py

datumbox · 2022-03-04T12:35:48Z

torchvision/prototype/transforms/_geometry.py

+        return apply_recursively(
+            functools.partial(self._transform, params=self._get_params(sample)),
+            sample,
+            exclude_sequence_types=(str, *self._MULTI_CROP_TYPES),


I think that's one more reason not to use named tuples.

torchvision/prototype/transforms/_geometry.py

…lti-crop

pmeier · 2022-03-07T10:38:50Z

After some offline discussion @datumbox and I agreed that we indeed need the BatchMultiCrop transform. Nevertheless, with the new commits the implementation is a lot simply, by moving the "result wrapper" onto the transformations and being able to use only a single subtype of list rather than two namedtuples that would have been necessary to satisfy torchscript. Plus, instead of making apply_recursively more general, BatchMultiCrop now implements a custom version of it to keep it simple.

datumbox

LGTM.

I think we should also review on the near future whether the very detailed nested sample is something we need to support or if that's something we don't need on the Datasets. If it turns out it's not needed it will allow us to massively simplify the transforms.

cc @NicolasHug

Summary: * port FiveCrop and TenCrop to prototype API * fix ten crop for pil * Update torchvision/prototype/transforms/_geometry.py * simplify implementation * minor cleanup Reviewed By: vmoens Differential Revision: D34878994 fbshipit-source-id: d1220091f9da4515e831bc44e873041ab33f3ce0 Co-authored-by: Philip Meier <[email protected]> Co-authored-by: Vasilis Vryniotis <[email protected]>

port FiveCrop and TenCrop to prototype API

8916cdf

pmeier added module: transforms prototype labels Mar 2, 2022

pmeier requested a review from datumbox March 2, 2022 14:25

pytorch-bot bot added the ciflow/default label Mar 2, 2022

facebook-github-bot added the cla signed label Mar 2, 2022

pmeier commented Mar 2, 2022

View reviewed changes

torchvision/prototype/transforms/_geometry.py Outdated Show resolved Hide resolved

fix ten crop for pil

4673727

This was linked to issues Mar 3, 2022

Port transforms.TenCrop to prototype.transforms #5527

Closed

Port transforms.FiveCrop to prototype.transforms #5526

Closed

Update torchvision/prototype/transforms/_geometry.py

1d769c3

Co-authored-by: Philip Meier <[email protected]>

datumbox reviewed Mar 4, 2022

View reviewed changes

pmeier added 5 commits March 7, 2022 11:31

Merge branch 'main' into multi-crop

8986719

simplify implementation

dec31cd

Merge branch 'multi-crop' of https://github.com/pmeier/vision into mu…

2aefd88

…lti-crop

minor cleanup

4302084

Merge branch 'main' into multi-crop

da1fa8f

pmeier requested a review from datumbox March 7, 2022 10:39

datumbox approved these changes Mar 7, 2022

View reviewed changes

pmeier added 2 commits March 7, 2022 14:22

Merge branch 'main' into multi-crop

2231c4a

Merge branch 'main' into multi-crop

f1a5003

pmeier merged commit 7039c2c into pytorch:main Mar 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

port FiveCrop and TenCrop to prototype API #5513

port FiveCrop and TenCrop to prototype API #5513

pmeier commented Mar 2, 2022 •

edited

Loading

facebook-github-bot commented Mar 2, 2022 •

edited

Loading

pmeier Mar 2, 2022

datumbox Mar 4, 2022

pmeier Mar 2, 2022

datumbox Mar 4, 2022

datumbox left a comment

datumbox Mar 4, 2022

datumbox Mar 4, 2022

pmeier commented Mar 7, 2022

datumbox left a comment

	Example:
	>>> transform = Compose([
	>>> FiveCrop(size), # this is a list of PIL Images
	>>> Lambda(lambda crops: torch.stack([ToTensor()(crop) for crop in crops])) # returns a 4D tensor
	>>> ])

port FiveCrop and TenCrop to prototype API #5513

port FiveCrop and TenCrop to prototype API #5513

Conversation

pmeier commented Mar 2, 2022 • edited Loading

facebook-github-bot commented Mar 2, 2022 • edited Loading

💊 CI failures summary and remediations

pmeier Mar 2, 2022

Choose a reason for hiding this comment

datumbox Mar 4, 2022

Choose a reason for hiding this comment

pmeier Mar 2, 2022

Choose a reason for hiding this comment

datumbox Mar 4, 2022

Choose a reason for hiding this comment

datumbox left a comment

Choose a reason for hiding this comment

datumbox Mar 4, 2022

Choose a reason for hiding this comment

datumbox Mar 4, 2022

Choose a reason for hiding this comment

pmeier commented Mar 7, 2022

datumbox left a comment

Choose a reason for hiding this comment

pmeier commented Mar 2, 2022 •

edited

Loading

facebook-github-bot commented Mar 2, 2022 •

edited

Loading