assert error len(grid_sizes) == len(strides) == len(cell_anchors) #3246

ghost · 2021-01-13T03:30:16Z

It looks like a bug. When I do not set the AnchorGenerator() in FasterRCNN, the default anchor_sizes in ### detection/faster_rcnn.py line182 shows that 'anchor_sizes = ((32,), (64,), (128,), (512,))' which cause len(cell_anchors) == 5. And I found that in the detection/faster_rcnn.py line120 the anchor_size set '((32, 64, 128, 256, 512), )' and len(cell_anchors) == 1

oke-aditya · 2021-01-13T09:14:23Z

Hi !

I think this error is fixed on master. #2971 #2960 #2983 #2947.

This would be error message on master / next release

if not (len(grid_sizes) == len(strides) == len(cell_anchors)):
    raise ValueError("Achors should be Tuple[Tuple[int]] because each feature "
    "map could potentially have different sizes and aspect ratios. "
    "There needs to be a match between the number of "
    "feature maps passed and the number of sizes / aspect ratios specified.")

In short, you need to pass a Tuple[Typle[int]] instead of a Tuple[int] to Anchor Generator.
This was done to avoid potentially bad results.

Also, I think we should change Line 121 from FRCNN to Tuple[Tuple[]] ?

It think that above line is causing confusion

datumbox · 2021-01-15T17:27:48Z

@alpha-gradient As @oke-aditya mentioned, the error message has been updated to make the situation less confusing.

Here is a simplified version of the code that you are quoting:

backbone = torchvision.models.mobilenet_v2(pretrained=True).features
backbone.out_channels = 1280

anchor_generator = AnchorGenerator(sizes=((32, 64, 128, 256, 512),),
                                   aspect_ratios=((0.5, 1.0, 2.0),))

model = FasterRCNN(backbone, num_classes=2, rpn_anchor_generator=anchor_generator)

The above snippet uses sizes=((32, 64, 128, 256, 512),), or in other words defines 1 level/group of 5 anchor-sizes. Why 1 level/group? Because the backbone provides only 1 ouput.

On the other hand the default anchors used in faster-rcnn is ((32,), (64,), (128,), (256,), (512,)) which means we have 5 levels/groups with 1 anchor size:

vision/torchvision/models/detection/faster_rcnn.py

Lines 186 to 188 in 8ebfd2f

    
           anchor_sizes = ((32,), (64,), (128,), (256,), (512,)) 
        
           aspect_ratios = ((0.5, 1.0, 2.0),) * len(anchor_sizes) 
        
           rpn_anchor_generator = AnchorGenerator(

Why is that? This is because by default it uses a Feature Pyramid as a backbone which returns 5 outputs (intermediate layers of the original backbone).

The error message that you got basically indicates that the number of outputs on the backbone should match the number of levels of anchor sizes.

fmassa · 2021-01-20T11:06:09Z

Closing following @oke-aditya and @datumbox great answers.

datumbox added the question label Jan 15, 2021

fmassa closed this as completed Jan 20, 2021

oke-aditya mentioned this issue Oct 12, 2021

Add typing to anchor utils #4599

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

assert error len(grid_sizes) == len(strides) == len(cell_anchors) #3246

assert error len(grid_sizes) == len(strides) == len(cell_anchors) #3246

ghost commented Jan 13, 2021

oke-aditya commented Jan 13, 2021 •

edited

Loading

Uh oh!

datumbox commented Jan 15, 2021

Uh oh!

fmassa commented Jan 20, 2021

Uh oh!

assert error len(grid_sizes) == len(strides) == len(cell_anchors) #3246

assert error len(grid_sizes) == len(strides) == len(cell_anchors) #3246

Comments

ghost commented Jan 13, 2021

oke-aditya commented Jan 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datumbox commented Jan 15, 2021

Uh oh!

fmassa commented Jan 20, 2021

Uh oh!

oke-aditya commented Jan 13, 2021 •

edited

Loading