Support partial reading of Zarr datasets #106

krisfed · 2025-06-19T15:23:18Z

This is a first draft of allowing partial reads from Zarr datasets.

The proposed interface is as follows (somewhat modeled on h5read and ncread):

>> fullData = zarrread("grp_v2/smallArr") % reading full dataset works the same

fullData =

     1     4     7    10
     2     5     8    11
     3     6     9    12

>> d = zarrread("grp_v2/smallArr", Start=[2,3]) % read starting from the 2nd row and 3rd column

d =

     8    11
     9    12

>> d = zarrread("grp_v2/smallArr", Count=[2,4]) % read 2 elements in 1st dimension and 4 in 2nd dimension

d =

     1     4     7    10
     2     5     8    11

>> d = zarrread("grp_v2/smallArr", Stride=[1,2]) % read every element in 1st dimension and every second one in 2nd dimension

d =

     1     7
     2     8
     3     9

>> d = zarrread("grp_v2/smallArr", Start=[1,2], Stride=[1,2], Count=[2,2]) % use any combination of Start/Stride/Count

d =

     4    10
     5    11

The number of elements in Start/Stride/Count must be the same as the number of dimensions:

>> d = zarrread("grp_v2/smallArr", Start=[1,2,3])
Error using Zarr.processPartialReadParams (line 86)
Number of elements in Start must be the same as the number of Zarr array dimensions.

Error in Zarr/read (line 260)
            start = Zarr.processPartialReadParams(start, info.shape,...
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Error in zarrread (line 36)
data = zarrObj.read(options.Start, options.Count, options.Stride);
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Only exception is if the dataset is a vector, in which case scalar Start/Stride/Count are allowed:

>> d = zarrread("grp_v2/vectorData")

d =

     1     2     3     4     5     6     7     8     9    10

>> d = zarrread("grp_v2/vectorData", Start=3, Stride=2)

d =

     3     5     7     9

…d reshape them into row vectors

codecov · 2025-06-19T16:08:06Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.07%. Comparing base (1ad7173) to head (7889203).
Report is 8 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #106      +/-   ##
==========================================
+ Coverage   96.66%   97.07%   +0.40%     
==========================================
  Files           8        8              
  Lines         210      239      +29     
==========================================
+ Hits          203      232      +29     
  Misses          7        7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

PythonModule/ZarrPy.py

test/tZarrRead.m

jm9176 · 2025-06-19T18:07:55Z

@krisfed Please create a test task geck once you are done submitting all of your changes.

jhughes-mw · 2025-06-20T14:44:42Z

Zarr.m

+            arguments (Output)
+                newParams (1,:) int64
+            end


Surprised considering the name of the function, you didn't also use arguments block to validate inputs

Yeah I am doing the basic arguments block validation for Start/Stride/Count in zarrread (better error message and faster erroring out), so not much left to validate here. Not great to validate different things in different places though..

jhughes-mw

Looks Good.

krisfed added 2 commits June 19, 2025 11:20

First draft of partial read

792d144

mustBeRow not available before R2024b - for now allow other shapes an…

1bd22db

…d reshape them into row vectors

krisfed added 2 commits June 19, 2025 12:26

Adding test for scalar Start/Stride/Count

603eb9b

Included info about defaults for Start/Stride/Count in M-help

6f763b9

krisfed marked this pull request as ready for review June 19, 2025 16:54

krisfed requested review from jm9176 and jhughes-mw June 19, 2025 16:54

krisfed changed the title ~~First draft of partial read~~ Support partial reading of Zarr datasets Jun 19, 2025

jm9176 approved these changes Jun 19, 2025

View reviewed changes

PythonModule/ZarrPy.py Show resolved Hide resolved

test/tZarrRead.m Outdated Show resolved Hide resolved

jhughes-mw reviewed Jun 20, 2025

View reviewed changes

jhughes-mw approved these changes Jun 20, 2025

View reviewed changes

krisfed added 2 commits June 20, 2025 11:49

Error message update

9936f79

Address feedback and add doc

7889203

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support partial reading of Zarr datasets #106

Support partial reading of Zarr datasets #106

Uh oh!

krisfed commented Jun 19, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jun 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

jm9176 commented Jun 19, 2025

Uh oh!

jhughes-mw Jun 20, 2025

Uh oh!

krisfed Jun 27, 2025

Uh oh!

jhughes-mw left a comment

Uh oh!

Uh oh!

Support partial reading of Zarr datasets #106

Are you sure you want to change the base?

Support partial reading of Zarr datasets #106

Uh oh!

Conversation

krisfed commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

jm9176 commented Jun 19, 2025

Uh oh!

jhughes-mw Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

krisfed Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

jhughes-mw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

krisfed commented Jun 19, 2025 •

edited

Loading

codecov bot commented Jun 19, 2025 •

edited

Loading