Don't hardcode 255 unless uint8 is enforced #6825

pmeier · 2022-10-24T14:37:23Z

Across our transformations we sometimes hardcode the value 255. This is justified if we make sure that point only torch.uint8 images are allowed, like

vision/torchvision/transforms/functional_tensor.py

Lines 471 to 472 in 788ad12

    
           if interpolation == "bicubic" and out_dtype == torch.uint8: 
        
               img = img.clamp(min=0, max=255)

However, there are a few instances where uint8 is implied but never enforced:

vision/torchvision/transforms/functional_tensor.py

Lines 266 to 267 in 788ad12

    
           bound = 1.0 if img1.is_floating_point() else 255.0 
        
           return (ratio * img1 + (1.0 - ratio) * img2).clamp(0, bound).to(img1.dtype)

vision/torchvision/transforms/functional_tensor.py

Lines 778 to 779 in 788ad12

    
           bound = torch.tensor(1 if img.is_floating_point() else 255, dtype=img.dtype, device=img.device) 
        
           return bound - img

vision/torchvision/transforms/functional_tensor.py

Line 852 in 788ad12

bound = 1.0 if img.is_floating_point() else 255.0

Instead of hardcoding 255 here, we should either use _max_value(dtype) instead or if uint8 is actually required, enforce it.

cc @vfdev-5 @datumbox

The text was updated successfully, but these errors were encountered:

datumbox · 2022-10-24T14:54:41Z

Good spot. It's also worth noting that the current stable uses incorrect bounds across methods. The ones that you highlighted use 255 for all integers. On the other hand, convert_image_dtype uses different bound depending on their integer type:

vision/torchvision/transforms/functional_tensor.py

Lines 47 to 57 in 9f024a6

    
           def _max_value(dtype: torch.dtype) -> int: 
        
               if dtype == torch.uint8: 
        
                   return 255 
        
               elif dtype == torch.int8: 
        
                   return 127 
        
               elif dtype == torch.int16: 
        
                   return 32767 
        
               elif dtype == torch.int32: 
        
                   return 2147483647 
        
               elif dtype == torch.int64: 
        
                   return 9223372036854775807

These two need to align. Either we will continue assuming that 255 is the right value for integers, or as you propose we should use the _max_value(dtype) everywhere else.

pmeier · 2022-10-24T17:36:08Z

I prefer the behavior of convert_image_dtype for two reasons:

If we would go for enforcing [0, 255] for image dtypes, it makes no sense to use to allow integer dtypes other than uint8 in the first place. While float16 or float64 share the same value range with the default float32, their precision varies and thus it makes sense for them to have the same range. However, there is no such effect for integer dtypes. They would have the exact same number of valid values at the cost of increased memory in case of int{16, 32, 64}.
Changing the behavior of convert_image_dtype is BC breaking whereas changing the behavior of the ops mentioned above can be regarded as a bug fix. They were never doing the right thing unless one used uint8 or floating images.

pmeier · 2022-10-28T11:41:44Z

One more thing came to mind: although an edge case at best, by hardcoding 255 we we not only ignore large portions of the range for larger dtypes, we also don't handle the fact that torch.int8 images can only store [0, 127]. For example, with the current implementation F.posterize(image_int8, bits=4) is a no-op.

pmeier mentioned this issue Oct 24, 2022

revert 255 -> max_value fix #6826

Merged

datumbox added module: transforms bug needs discussion labels Oct 24, 2022

pmeier closed this as completed in #6826 Oct 24, 2022

datumbox reopened this Oct 24, 2022

This was referenced Oct 24, 2022

Fix hardcoded 255 #6830

Merged

Support all integer and floating point dtypes in prototype transform kernels? #6840

Closed

pmeier closed this as completed in #6830 Nov 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't hardcode 255 unless uint8 is enforced #6825

Don't hardcode 255 unless uint8 is enforced #6825

pmeier commented Oct 24, 2022 •

edited by pytorch-bot bot

Loading

datumbox commented Oct 24, 2022

pmeier commented Oct 24, 2022

pmeier commented Oct 28, 2022

Don't hardcode 255 unless uint8 is enforced #6825

Don't hardcode 255 unless uint8 is enforced #6825

Comments

pmeier commented Oct 24, 2022 • edited by pytorch-bot bot Loading

datumbox commented Oct 24, 2022

pmeier commented Oct 24, 2022

pmeier commented Oct 28, 2022

pmeier commented Oct 24, 2022 •

edited by pytorch-bot bot

Loading