Concatenate torchvision.datasets.FakeData with another dataset -> cannot load it #3517

zhifengkong · 2021-03-06T10:06:48Z

🐛 Bug

If you concatenate a dataset such as CIFAR10 with FakeData, you get error

AttributeError: 'int' object has no attribute 'numel'

To Reproduce

Steps to reproduce the behavior:

cifar_dataset = torchvision.datasets.CIFAR10(...)
fake_dataset = torchvision.datasets.FakeData(...)
train_data = Concat([cifar_dataset, fake_dataset])
train_loader = DataLoader(train_data, ...)
for data in train_loader then error

Additional context

The reason why it happens is the labels in CIFAR10 are int and labels in FakeData are tensors. When concatenating them to construct a batch, the batch labels look like [0,1,2,3,tensor(0),3,4,5,6,tensor(2)...].

I can solve this bug by letting target_transform=int when I load fake_dataset. However, this is very hard to debug. I assume that the default target type in the FakeData source code should be set to int instead of long tensor.

Here:
https://pytorch.org/vision/0.8/_modules/torchvision/datasets/fakedata.html#FakeData
in function __getitem__
target = torch.randint(0, self.num_classes, size=(1,), dtype=torch.long)[0]
It's long tensor. It should be int.

cc @pmeier @fmassa @vfdev-5

The text was updated successfully, but these errors were encountered:

fmassa · 2021-03-09T11:12:44Z

Yes, the target should be a int, and it was the case in the first version of torchvision, but with the introduction of scalar tensors in PyTorch that snippet became a tensor.

I'm happy to accept a PR adding a .item() call to the aforementioned line to fix the issue

target -> target.item() so it's an int instead of a long tensor

avijit9 · 2021-03-18T17:19:35Z

Anybody working on this? I can send a PR otherwise.

pmeier · 2021-03-18T17:44:37Z

@avijit9 Go ahead!

avijit9 · 2021-03-22T03:50:26Z

@pmeier Shouldn't this issue be closed?

pmeier · 2021-03-22T06:15:44Z

Indeed it should. If the PR contains a certain keyword together with the issue number, GitHub will close the issue automatically when the PR is merged.

You used a keyword in your PR that is not recognized by GitHub:

solves #3517

zhifengkong changed the title ~~torchvision.datasets.FakeData target type error~~ torchvision.datasets.FakeData concatenated with another dataset -> cannot load it Mar 6, 2021

zhifengkong changed the title ~~torchvision.datasets.FakeData concatenated with another dataset -> cannot load it~~ Concatenate torchvision.datasets.FakeData with another dataset -> cannot load it Mar 6, 2021

vfdev-5 transferred this issue from pytorch/pytorch Mar 6, 2021

vfdev-5 added the module: datasets label Mar 6, 2021

fmassa added the bug label Mar 9, 2021

pmeier added the help wanted label Mar 9, 2021

zhifengkong added a commit to zhifengkong/vision that referenced this issue Mar 12, 2021

Fix target data type (pytorch#3517)

cc6c855

target -> target.item() so it's an int instead of a long tensor

avijit9 mentioned this issue Mar 18, 2021

.item() added to the 'target' variable in fakedataset.py #3587

Merged

pmeier closed this as completed Mar 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Concatenate torchvision.datasets.FakeData with another dataset -> cannot load it #3517

Concatenate torchvision.datasets.FakeData with another dataset -> cannot load it #3517

zhifengkong commented Mar 6, 2021 •

edited by pytorch-probot bot

Loading

fmassa commented Mar 9, 2021

Uh oh!

avijit9 commented Mar 18, 2021

Uh oh!

pmeier commented Mar 18, 2021

Uh oh!

avijit9 commented Mar 22, 2021

Uh oh!

pmeier commented Mar 22, 2021

Uh oh!

Concatenate torchvision.datasets.FakeData with another dataset -> cannot load it #3517

Concatenate torchvision.datasets.FakeData with another dataset -> cannot load it #3517

Comments

zhifengkong commented Mar 6, 2021 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🐛 Bug

To Reproduce

Additional context

fmassa commented Mar 9, 2021

Uh oh!

avijit9 commented Mar 18, 2021

Uh oh!

pmeier commented Mar 18, 2021

Uh oh!

avijit9 commented Mar 22, 2021

Uh oh!

pmeier commented Mar 22, 2021

Uh oh!

zhifengkong commented Mar 6, 2021 •

edited by pytorch-probot bot

Loading