Skip to content

Sharding in zarr v2 #2346

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
d-v-b opened this issue Oct 12, 2024 · 1 comment
Open

Sharding in zarr v2 #2346

d-v-b opened this issue Oct 12, 2024 · 1 comment
Labels
enhancement New features or improvements

Comments

@d-v-b
Copy link
Contributor

d-v-b commented Oct 12, 2024

Since sharding is implemented as a codec, and because the zarr v2 spec takes a permissive stance to codecs, the sharding codec can be used for zarr v2 arrays, provided a program exists that can interpret that codec, such as any zarr v3 implementation. I can think of a variety of use cases for this (e.g., all use cases imagined for sharding in zarr v3), but the most simple one might be for people with v2 data who want sharding but don't want to change anything else about their workflow.

FWIW I haven't tested what the v3 branch of zarr-python would do with a v2 array with a sharding codec, but IMO failing to open it would be a bug.

@dstansby dstansby added the enhancement New features or improvements label Dec 28, 2024
@Muhammad-Rebaal
Copy link

Hi @d-v-b , @dstansby !

I hope you're doing well!

I’d like to take on this issue and could you assign this issue to me ? According to me to solve this issue , I would test whether Zarr v3 can read Zarr v2 data that uses sharding. If it doesn't work, I would look into fixing that so people can use the sharding feature in Zarr v2 without any problems. If I missed something please let me know !

Thank You !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New features or improvements
Projects
None yet
Development

No branches or pull requests

3 participants