`GeneralizedRCNN` raises an exception when encountering a bad bounding box during `forward` #6787

jangop · 2022-10-18T13:12:41Z

vision/torchvision/models/detection/generalized_rcnn.py

Line 95 in 0610b13

torch._assert(

The current implementation of GeneralizedRCNN checks for degenerate bounding boxes during its forward pass. While -- in principle -- that is great, it can make training brittle, especially when randomized data augmentation is involved, because an exception is raised (via assert by torch). I see that the current implementation is experimental (it says # TODO: Move this to a function a couple of lines up) but I am wondering what could be more helpful ways of dealing with those bad bounding boxes:

Raise an explicit exception that can be handled during training to, for example, skip the current batch?
Filter out degenerate bounding boxes?
Keep the current implementation because torch's _assert is great and I just don't understand it properly? 🙄

The text was updated successfully, but these errors were encountered:

datumbox · 2022-10-20T10:00:52Z

Degenerate BBoxes can mess up the training phase of the models and this will lead to a very hard to debug situation. This is why I believe originally this assertion was added. We typically avoid such assertions when possible to improve performance but in cases where the debugging can become extremely difficult we maintain them.

I think the best way to avoid the issue is to filter out degenerate bboxes. Our new Transforms V2 prototype offers a built-in method for doing so by utilizing the ClampBoundingBoxes followed by the RemoveSmallBoundingBoxes. The first will clamp the bboxes to valid values (ensuring xy1<=xy2) and the latter will eliminate any small bboxes that don't pass the expected threshold.

We are preparing a blogpost to announce the Transforms V2 API. Some initial info are recorded at #6753

jangop · 2022-10-21T12:21:01Z

Agreed, “the best way” surely is filtering out degenerate bounding boxes. But the issue of bad data and how to deal with it is old and it will likely stick around. See, for example, nonechucks. Its latest release is from 2019, but I find the concept interesting, and in its README, a few points are outlined for why one might want to deal with bad data on the fly.

I struggle with such issues when trying to rely on frameworks intended to make life easier. The pipeline, in which I now noticed the issue of degenerate bounding boxes, loads data using activeloop deeplake, augments it using albumentations, and performs training using pytorch lightning. In theory, everything should just work. In practice, training crashes because of an odd bounding box and there is nothing I can do about it.

Supposedly, by the way, albumentations filters out bounding boxes, too:

https://albumentations.ai/docs/getting_started/bounding_boxes_augmentation/#min_area-and-min_visibility

This is not-so-easy, though:

albumentations-team/albumentations#1322

If Transforms V2 obsoletes albumentations and turns out more robust: 👍

ehratjon mentioned this issue Oct 3, 2023

GeneralizedRCNN on GPU always detects degenerate bounding boxes #8015

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`GeneralizedRCNN` raises an exception when encountering a bad bounding box during `forward` #6787

`GeneralizedRCNN` raises an exception when encountering a bad bounding box during `forward` #6787

jangop commented Oct 18, 2022

datumbox commented Oct 20, 2022 •

edited

Loading

jangop commented Oct 21, 2022

GeneralizedRCNN raises an exception when encountering a bad bounding box during forward #6787

GeneralizedRCNN raises an exception when encountering a bad bounding box during forward #6787

Comments

jangop commented Oct 18, 2022

datumbox commented Oct 20, 2022 • edited Loading

jangop commented Oct 21, 2022

`GeneralizedRCNN` raises an exception when encountering a bad bounding box during `forward` #6787

`GeneralizedRCNN` raises an exception when encountering a bad bounding box during `forward` #6787

datumbox commented Oct 20, 2022 •

edited

Loading