Introduce Textnet backbone (needed for Fast model) #27425

raghavanone · 2023-11-10T08:37:29Z

This is needed for merging fast model #26501

raghavanone · 2023-11-10T09:54:57Z

@amyeroberts I have pulled out all the changes for textnet backbone needed for fast model (#26657) into separate PR. Requesting for a first round of review.

docs/source/en/model_doc/textnet.md

src/transformers/models/textnet/__init__.py

src/transformers/models/textnet/configuration_textnet.py

src/transformers/models/textnet/modeling_textnet.py

docs/source/en/model_doc/textnet.md

setup.py

src/transformers/models/textnet/__init__.py

src/transformers/models/textnet/configuration_textnet.py

src/transformers/models/textnet/image_processing_textnet.py

src/transformers/models/textnet/modeling_textnet.py

amyeroberts

Thanks for adding!

I've done an initial pass - it'll likely be at least one more review round before ready for approval. Main comment is about the deletion and addition of model parameters. The priority when adding models to transformers is to make sure that they're easy to read and understand - the current fused kernel logic should be simplified or completely removed. Have you run tests to see compare running times with and without the fused kernels?

utils/check_docstrings.py

src/transformers/models/textnet/image_processing_textnet.py

src/transformers/models/textnet/modeling_textnet.py

raghavanone · 2023-11-14T14:14:30Z

Thanks for adding!

I've done an initial pass - it'll likely be at least one more review round before ready for approval. Main comment is about the deletion and addition of model parameters. The priority when adding models to transformers is to make sure that they're easy to read and understand - the current fused kernel logic should be simplified or completely removed. Have you run tests to see compare running times with and without the fused kernels?

Fair point, I just ran the tests. The fused kernel just saves some 5 % time during eval. I am going to remove the complex logic.

raghavanone · 2023-11-14T14:27:25Z

@amyeroberts I have addressed almost all of the feedbacks, I have some questions of few of them. Help me with some more details for them.

amyeroberts

Thanks for iterating!

Main comment is about how info is stored in the config. Let me know if any of it isn't clear

tests/models/textnet/test_modeling_textnet.py

src/transformers/models/textnet/__init__.py

src/transformers/models/textnet/configuration_textnet.py

src/transformers/models/textnet/modeling_textnet.py

raghavanone · 2023-11-17T02:59:49Z

@amyeroberts The feedbacks on configuration refactoring has been addressed.

amyeroberts

Thanks again for iterating!

Just a few small comments now. Once resolved we'll be good to merge! Main comment is about the logic in the backbone for selecting the returned feature maps

tests/models/textnet/test_modeling_textnet.py

amyeroberts · 2023-11-20T13:58:40Z

src/transformers/models/textnet/convert_textnet_original_to_pytorch.py

+    content = response.text
+    namespace = {}
+
+    exec(content, namespace)


Let's not do this - dynamic execution of code is dangerous

Agree, But this config url is a python file, (example) . Is there any other safer way to do this ?

@amyeroberts Gentle remainder .

Yes there's a safer way of doing this, you could upload the config files to the hub and use hf_hub_download for instance: https://huggingface.co/docs/huggingface_hub/guides/download#download-a-single-file

I have made this change.

It doesn't seem to have been pushed?

What I have done is, push the config file (python file) into hub and download it using hf_hub_download. My understanding is when file is downloaded from hub, it will be safer to do exec on it. Let me know if my understanding is wrong and this has to fixed in some other way.

@amyeroberts I have removed the dynamic execution, I have preprocessed all the python config files into a json files. Hope this approach is okay.

Also the original_pixel_values has been prepared and stored in hub, avoid performing preprocessing in conversion script.

tests/models/textnet/test_modeling_textnet.py

src/transformers/models/textnet/modeling_textnet.py

oneraghavan · 2024-01-29T12:19:08Z

@amyeroberts All Feedbacks has been incorporated. Please have a look

amyeroberts · 2024-01-29T15:55:10Z

@raghavanone Please make sure to look at the diff before asking for review, to catch any obvious things that should be updated. Opening the files tab, the first thing one sees are README's that need to be filled in.

raghavanone · 2024-01-29T16:03:40Z

@raghavanone Please make sure to look at the diff before asking for review, to catch any obvious things that should be updated. Opening the files tab, the first thing one sees are README's that need to be filled in.

Oh , the rebase to main should have undone some of these changes . Let me fix and review them again .

raghavanone · 2024-01-30T10:47:37Z

@amyeroberts I have fixed the issues coming from rebase to main and also validated that all the feedbacks are incorporated. Please have a look .

amyeroberts

Thanks for iterating and adding this!

Just a few nits to address before merging. The checkpoint paths will also need to be updated to go under the organisation

amyeroberts · 2024-01-30T16:12:11Z

tests/models/textnet/test_modeling_textnet.py

+        TextNetBackbone,
+        TextNetForImageClassification,
+        TextNetModel,
+        is_torch_available,


No need to import this here - it's already included in this module (it's used just a few lines above)

Suggested change

is_torch_available,

amyeroberts · 2024-01-30T16:13:37Z

tests/models/textnet/test_modeling_textnet.py

+@require_torch
+@require_vision
+class TextNetModelIntegrationTest(unittest.TestCase):
+    # @slow


Suggested change

# @slow

@slow

amyeroberts · 2024-01-30T17:15:50Z

src/transformers/models/textnet/configuration_textnet.py

+            The num of channels in out for the initial convolution layer.
+        stem_act_func (`str`, *optional*, defaults to `"relu"`):
+            The activation function for the initial convolution layer.
+        image_size (`int`, *optional*, defaults to `[640, 640]`):


Types here don't match.

Suggested change

image_size (`int`, *optional*, defaults to `[640, 640]`):

image_size (`int` or `Tuple[int, int]`, *optional*, defaults to `[640, 640]`):

raghavanone · 2024-02-06T14:47:06Z

@amyeroberts Let me know how to update the checkpoint to organisation .

amyeroberts · 2024-02-09T16:35:27Z

Hi @raghavan, nice work on getting TextNet across the line!

For adding the checkpoints to the hub, I’d suggest reaching out to the lead paper author and asking them if they would be happy to host the model weights on the hub under their profile. As the model appears to have been a collab between universities, there isn’t an obvious org for it to go under.

@NielsRogge has more experience here, so will have recommendations for everything that needs to be done.

github-actions · 2024-03-05T08:05:45Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

raghavanone force-pushed the fix_issue_26501_textnet branch from bb5f3b1 to ab2aad5 Compare November 10, 2023 09:14