scverse · ilan-gold · Oct 30, 2025 · Oct 23, 2025 · Oct 23, 2025 · Oct 23, 2025
diff --git a/docs/index.md b/docs/index.md
@@ -113,6 +113,7 @@ However, once you have too much data to fit into memory, for whatever reason, th
 :maxdepth: 1
 
 api.md
+zarr-configuration.md
 changelog.md
 contributing.md
 references.md

diff --git a/docs/zarr-configuration.md b/docs/zarr-configuration.md
@@ -0,0 +1,46 @@
+# Zarr Configuration
+
+If you are using a local file system, use {doc}`zarrs-python <zarrs:index>`:
+
+```python
+import zarr
+zarr.config.set({"codec_pipeline.path": "zarrs.ZarrsCodecPipeline"})
+```
+
+Otherwise normal use {mod}`zarr` without {doc}`zarrs-python <zarrs:index>` (wich does not support, for example, remote stores).
+
+## `zarrs` Performance
+
+Please look at {doc}`zarrs-python <zarrs:index>`'s docs for more info but there are two important setting to consider:
+
+```python
+zarr.config.set({
+    "threading.max_workers": None,
+    "codec_pipeline": {
+        "direct_io": False
+    }
+})
+```
+
+The `threading.max_workers` will control how many threads are used by `zarrs`, and by extension, our data loader.
+This parameter is global and controls both the rust parallelism and the Python parallelism.
+If you notice thrashing or similar oversubscription behavior of threads, please open an issue.
+
+Some **linux** file systems' [performance may suffer][] from the high level of parallelism combined with a full page cache in RAM.
+To bypass the page cache, use `direct_io` - there should not be a performance difference.
+If this setting is set on a system that does not support `direct_io`, file reading will fall back to normal buffered io.
+
+## `zarr-python` performance
+
+In this case, likely the store of interest is in the cloud.
+Please see zarr python's {doc}`zarr:user-guide/config` for more info but likely of most interest aside from the above mentioned `threading.max_workers` is
+
+```python
+zarr.config.set({"async.concurrency": 64})
+```
+
+which is 64 by default.
+See the [zarr page on concurrency][] for more information.
+
+[performance may suffer]: https://gist.github.com/ilan-gold/705bd36329b0e19542286385b09b421b
+[zarr page on concurrency]: https://zarr.readthedocs.io/en/latest/user-guide/consolidated_metadata/#synchronization-and-concurrency