Fix bugs in cutout training #233

ravinkohli · 2021-05-21T13:52:21Z

This PR fixes the following bugs:

we were not sampling the indices with replacement.
we were taking the minimum between batch size and num features, which is not necessary
Numerical features were getting -1 to them, whereas 0 makes more sense

ArlindKadra · 2021-05-21T14:18:28Z

autoPyTorch/pipeline/components/training/trainer/RowCutMixTrainer.py

@@ -39,7 +39,8 @@ def data_preparation(self, X: np.ndarray, y: np.ndarray,
        # It is unlikely that the batch size is lower than the number of features, but
        # be safe
        size = min(X.shape[0], X.shape[1])


This should also be changed to size=X.shape[1] right ?

True, I missed that

ArlindKadra · 2021-05-21T15:46:58Z

autoPyTorch/pipeline/components/training/trainer/RowCutOutTrainer.py


-        # We use an ordinal encoder on the tabular data
+        if not isinstance(self.numerical_columns, typing.Iterable):


What if the numerical columns are None, we should still continue with only categorical imputing in this case or not.

Also if there are only numerical columns, there should not be a conversion for categorical ones.

Actually when there are no numerical columns, it is not none but it is an empty list. And indexing with an empty list does not affect the tensor so this should work

Auto-PyTorch/autoPyTorch/pipeline/components/training/trainer/base_trainer_choice.py

Lines 340 to 341 in 8b71ee2

numerical_columns=X['dataset_properties']['numerical_columns'] if 'numerical_columns' in X[

'dataset_properties'] else None

Is numerical_columns always in dataset_properties ?

when its tabular data then yeah

* Fix bugs in cutout training * Address comments from arlind

Fix bugs in cutout training

9278b53

ArlindKadra self-requested a review May 21, 2021 14:17

ArlindKadra reviewed May 21, 2021

View reviewed changes

Address comments from arlind

8b71ee2

ArlindKadra reviewed May 21, 2021

View reviewed changes

ArlindKadra merged commit 463c166 into refactor_development_regularization_cocktails May 21, 2021

github-actions bot pushed a commit that referenced this pull request May 21, 2021

Ravin Kohli: Fix bugs in cutout training (#233)

061fb0a

ravinkohli mentioned this pull request Jun 17, 2021

Fix Possible Bug #226

Closed

ravinkohli deleted the fix_cutTrainer branch October 22, 2021 09:36

ravinkohli added a commit that referenced this pull request Dec 8, 2021

Fix bugs in cutout training (#233)

a41a2a3

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Dec 8, 2021

Fix bugs in cutout training (#233)

d0f2875

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Dec 21, 2021

Fix bugs in cutout training (#233)

8b8ba42

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Jan 24, 2022

Fix bugs in cutout training (#233)

f026103

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Jan 28, 2022

Fix bugs in cutout training (#233)

d140442

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Feb 28, 2022

Fix bugs in cutout training (#233)

a831767

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Feb 28, 2022

Fix bugs in cutout training (#233)

e86fbcf

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Mar 9, 2022

Fix bugs in cutout training (#233)

17d18d8

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit to ravinkohli/Auto-PyTorch that referenced this pull request Apr 12, 2022

Fix bugs in cutout training (automl#233)

64b1397

* Fix bugs in cutout training * Address comments from arlind

ravinkohli added a commit that referenced this pull request Jul 26, 2022

Fix bugs in cutout training (#233)

c4b7729

* Fix bugs in cutout training * Address comments from arlind

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bugs in cutout training #233

Fix bugs in cutout training #233

ravinkohli commented May 21, 2021

ArlindKadra May 21, 2021

ravinkohli May 21, 2021

ArlindKadra May 21, 2021

ArlindKadra May 21, 2021

ravinkohli May 21, 2021

ArlindKadra May 21, 2021 •

edited

Loading

ravinkohli May 21, 2021


		# We use an ordinal encoder on the tabular data
		if not isinstance(self.numerical_columns, typing.Iterable):

	numerical_columns=X['dataset_properties']['numerical_columns'] if 'numerical_columns' in X[
	'dataset_properties'] else None

Fix bugs in cutout training #233

Fix bugs in cutout training #233

Conversation

ravinkohli commented May 21, 2021

ArlindKadra May 21, 2021

Choose a reason for hiding this comment

ravinkohli May 21, 2021

Choose a reason for hiding this comment

ArlindKadra May 21, 2021

Choose a reason for hiding this comment

ArlindKadra May 21, 2021

Choose a reason for hiding this comment

ravinkohli May 21, 2021

Choose a reason for hiding this comment

ArlindKadra May 21, 2021 • edited Loading

Choose a reason for hiding this comment

ravinkohli May 21, 2021

Choose a reason for hiding this comment

ArlindKadra May 21, 2021 •

edited

Loading