Skip to content

Fix iterable dataset dict num proc shards#8062

Open
HaukurPall wants to merge 2 commits intohuggingface:mainfrom
HaukurPall:fix-iterable-dataset-dict-num-proc-shards
Open

Fix iterable dataset dict num proc shards#8062
HaukurPall wants to merge 2 commits intohuggingface:mainfrom
HaukurPall:fix-iterable-dataset-dict-num-proc-shards

Conversation

@HaukurPall
Copy link
Contributor

If the number of shards for some split is lower than num_proc the IterableDatasetDict.push_to_hub will crash.

This handling of num_proc is as close to identical as the handling in IterableDataset.push_to_hub

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant