Closed
Description
There are now three separate occasions of bugs in the prototype datasets that were not picked up by our CI.
- fix
fromfile
on windows #4980 - Fix DTD prototype dataset on Windows #5227
- fix eurosat prototype mock data setup #5549
We didn't notice them, because they happen on Windows on macOS, but we only run the prototype tests on Linux. Given that the prototypes work with files and the path handling is different on all platforms, this can be problematic.
The torchdata team brought the failures to our attention, since they run our dataset tests on the full matrix. I propose we also run the prototype datasets tests on Windows and macOS to avoid these post mortems. To limit the extra needed CI resources I would only run the tests only on Python3.7.
That means we are looking at 2 extra CI runs per PR with roughly 10 minutes runtime.