Skip to content

Add DTD dataset #5115

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 11 commits into from
Jan 5, 2022
Merged

Add DTD dataset #5115

merged 11 commits into from
Jan 5, 2022

Conversation

pmeier
Copy link
Collaborator

@pmeier pmeier commented Dec 19, 2021

Addresses #5108 and supersedes #743. I've also added the corresponding prototype dataset.

cc @pmeier @bjuncek

@facebook-github-bot
Copy link

facebook-github-bot commented Dec 19, 2021

💊 CI failures summary and remediations

As of commit 7d21152 (more details on the Dr. CI page):



1 failure not recognized by patterns:

Job Step Action
CircleCI unittest_macos_cpu_py3.7 Run tests 🔁 rerun

🚧 4 ongoing upstream failures:

These were probably caused by upstream breakages that are not fixed yet.


This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

Copy link
Member

@NicolasHug NicolasHug left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @pmeier , this looks great. I made some minor comment, the only one I'm still not sure about is whether we should allow fold="all". LMK what you think

Comment on lines +56 to +61
if path.name == "labels_joint_anno.txt":
return 1

return 0
elif path.parents[1].name == "images":
return 2
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instead of hardcoded 0 1 2, would a small private enum be overkill here? or maybe just hardcoded named constants? No strong opinion but this might help readability

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, that could work. If that is ok with you, I'll put it in my backlog, because we should do that for all datasets that use a Demultiplexer.

Copy link
Member

@NicolasHug NicolasHug left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @pmeier , LGTM with the potential renaming of fold to partition as we just discussed offline.

@pmeier pmeier merged commit 5c9c835 into pytorch:main Jan 5, 2022
@pmeier pmeier deleted the datasets/dtd branch January 5, 2022 14:39
facebook-github-bot pushed a commit that referenced this pull request Jan 8, 2022
Summary:
* add DTD as prototype dataset

* add old style dataset

* add test for old dataset

* fix tests for windows

* add dataset to docs

* remove properties and use pathlib

* Apply suggestions from code review

* fold -> partition

Reviewed By: sallysyw

Differential Revision: D33479261

fbshipit-source-id: d879da77bd33064e57841a5f7ffdcea62c3add36

Co-authored-by: Nicolas Hug <[email protected]>
Co-authored-by: Nicolas Hug <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants