Use sentence casing consistently and fix links

2color · 2color · commit f550f1d4c656 · 2026-02-23T15:10:06.000+01:00
In the rest of the docs we mostly use sentence casing.
diff --git a/docs/how-to/scientific-data/landscape-guide.md b/docs/how-to/scientific-data/landscape-guide.md
@@ -1,23 +1,23 @@
 ---
-title: Scientific Data and IPFS Landscape Guide
+title: Scientific data and IPFS landscape guide
 description: an overview of the problem space, available tools, and architectural patterns for publishing and working with scientific data using IPFS.
 ---
 
-# Scientific Data and IPFS Landscape Guide
+# Scientific data and IPFS landscape guide
 
 Scientific data and IPFS are naturally aligned: research teams need to share large datasets across institutions, verify data integrity, and ensure resilient access. From sensor networks to global climate modeling efforts, scientific communities are using IPFS content addressing and peer-to-peer distribution to solve problems traditional infrastructure can't.
 
 In this guide, you'll find an overview of the problem space, available tools, and architectural patterns for publishing and working with scientific data using IPFS.
 
-## A Landscape in Flux
+## A landscape in flux
 
 Science advances through collaboration, yet the infrastructure for sharing scientific data has historically developed in silos. Different fields adopted different formats, metadata conventions, and distribution mechanisms.
 
 This fragmentation means there is no single "right way" to publish and share scientific data. Instead, this is an area of active innovation, with new tools and conventions emerging as communities identify common needs. Standards like [Zarr](https://zarr.dev) represent convergence points where different fields have found common ground.
 
 This guide surveys the landscape and available tooling, but the right approach for your project depends on your specific constraints: the size and structure of your data, your collaboration patterns, your existing infrastructure, and your community's conventions. The goal is to help you understand the options so you can make informed choices.
 
-## The Nature of Scientific Data
+## The nature of scientific data
 
 Scientific data originates from a variety of sources. In the geospatial field, data is collected by sensors, measuring instruments, camera systems, and satellites. This data is commonly structured as multidimensional arrays (tensors), representing measurements across dimensions like time, latitude, longitude, and altitude.
 
@@ -28,7 +28,7 @@ Key characteristics of scientific data include:
 - **Metadata-rich**: Extensive contextual information accompanies the raw measurements
 - **Collaborative**: Research often involves multiple institutions and scientists sharing and building upon datasets
 
-## The Importance of Open Data Access
+## The importance of open data access
 
 As hinted above, open access to scientific data accelerates research, enables reproducibility, and maximizes the return on public investment in science. Organizations worldwide have recognized this, leading to mandates for open data sharing in publicly funded research.
 
@@ -42,7 +42,7 @@ These criteria are by no means exhaustive, for example initiatives like [FAIR](h
 
 With that in mind, the next section will look at how these ideas come together with IPFS.
 
-## The Benefits of IPFS for Scientific Data
+## The benefits of IPFS for scientific data
 
 IPFS addresses several pain points in scientific data distribution:
 
@@ -53,7 +53,7 @@ IPFS addresses several pain points in scientific data distribution:
 
 To get a better sense of how these ideas which are central to IPFS' design are applied by the scientific community, it's worth looking at the [ORCESTRA Campaign Case Study](../../case-studies/orcestra.md) campaign, which uses IPFS to reap these benefits.
 
-## Architectural Patterns
+## Architectural patterns
 
 ### CID-centric verifiable data management
 
@@ -72,34 +72,34 @@ Ultimately the choice between these approaches for content-addressed data manage
 - How important is it to maintain a copy of the data in a content-addressed format? If no public publishing is expected and you only need integrity checks, you may choose not to store a full content-addressed replica and instead compute hashes on demand.
 - What libraries and which programming languages will you use to interact with the data? For example, Python’s xarray library, via fsspec, can read directly from a local IPFS gateway using [`ipfsspec`](https://github.com/fsspec/ipfsspec).
 
-### Single Publisher
+### Single publisher
 
 A single institution runs Kubo nodes to publish and provide data. Users retrieve via gateways or their own nodes.
 
-### Collaborative Publishing
+### Collaborative publishing
 
 Multiple institutions coordinate to provide the same datasets:
 
 - Permissionless: single writer multiple follower providers
 - Coordination can happen out of band, for example via a shared pinset on GitHub. The original publisher must ensure their data is provided, but once it's added to the pinset, others can replicate it.
 
-### Connecting to Existing Infrastructure
+### Connecting to existing infrastructure
 
 IPFS can complement existing data infrastructure:
 
 - STAC catalogs can include IPFS CIDs alongside traditional URLs
 - Data portals can offer IPFS as an alternative retrieval method
 - CI/CD pipelines can automatically add new data to IPFS nodes
 
-## Geospatial Format Evolution: From NetCDF to Zarr
+## Geospatial format evolution: from NetCDF to Zarr
 
 The scientific community has long relied on formats like NetCDF, HDF5, and GeoTIFF for storing multidimensional n-array data (also referred to as tensors). While these formats served research well, they were designed for local filesystems and face challenges in cloud and distributed environments, that have become the norm over the last decades. This has been a trend driven by both the size of datasets growing and the advent of cloud and distributed systems enabling the storage and processing of larger volumes of data.
 
-### Limitations of Traditional Formats
+### Limitations of traditional formats
 
 NetCDF and HDF5 interleave metadata with data, requiring large sequential reads to access metadata before reaching the data itself. This creates performance bottlenecks when accessing data over networks, whether that's cloud storage or a peer-to-peer network.
 
-### The Rise of Zarr
+### The rise of Zarr
 
 [Zarr](https://zarr.dev/) has emerged as a cloud-native format optimized for distributed storage:
 
@@ -146,11 +146,11 @@ Metadata in scientific datasets serves to make the data self-describing, like wh
 
 [**GeoZarr**](https://github.com/zarr-developers/geozarr-spec) is a specification for storing geospatial raster/grid data in the Zarr format. It defines conventions for how to encode coordinate reference systems, spatial dimensions, and other geospatial metadata within Zarr stores. It's conceptually downstream of the ideas in CF CDM (from the [netCDF ecosystem](https://docs.unidata.ucar.edu/netcdf-java/5.2/userguide/common_data_model_overview.html)), but designed for the Zarr ecosystem.
 
-## Ecosystem Tooling
+## Ecosystem tooling
 
-### Organizing Content-Addressed Data
+### Organizing content-addressed data
 
-#### UnixFS and CAR Files
+#### UnixFS and CAR files
 
 UnixFS is the default format for representing files and directories in IPFS. It chunks large files for incremental verification and parallel retrieval.
 
@@ -181,7 +181,7 @@ To learn more about how to use MFS to organize your data, check out the guide on
 [IPFS Cluster](https://ipfscluster.io/) is a cluster solution built on top of Kubo for multi-node deployments. IPFS Cluster coordinates pinning across a set of Kubo nodes, ensuring data redundancy and availability.
 Support for the [Pinning API spec](https://ipfs.github.io/pinning-services-api-spec/).
 
-#### Pinning Services
+#### Pinning services
 
 Third-party pinning services provide managed infrastructure for persistent storage, useful when you don't want to run your own nodes.
 TODO: link to pinning services list in docs
@@ -201,7 +201,7 @@ ds = xr.open_dataset(
 )
 ```
 
-### Discovery, Metadata, and Data Portals: From discovery all the way to retrieval
+### Discovery, metadata, and data portals: from discovery all the way to retrieval
 
 TODO: add an intro in the form of a user journey of a scientists looking for data, all the way to retrieving it.
 
@@ -212,7 +212,7 @@ Content Discovery is an loaded term that can mean related, albeit distinct conce
   - Human-centric
 - **Content discovery:** also commonly known as **content routing**, refers to finding providers (nodes serving the data) for a given CID, including their network addresses. By default, IPFS supports a number of content routing systems: the Amino DHT, IPNI and Delegated Routing over HTTP as a common interface for interoperability.
 
-### CID Discovery
+### CID discovery
 
 When using content-addressed systems like IPFS, a new challenge emerges: how do users discover the Content Identifiers (CIDs) for datasets they want to access?
 
@@ -259,7 +259,7 @@ STAC has a web browser, making navigation discovery https://github.com/radiantea
 
 -->
 
-## Next Steps
+## Next steps
 
 - [Publishing Zarr Datasets with IPFS](./publish-geospatial-zarr-data.md) - A hands-on guide to publishing your first dataset
 - [Kubo Configuration Reference](https://github.com/ipfs/kubo/blob/master/docs/config.md)
diff --git a/docs/how-to/scientific-data/publish-geospatial-zarr-data.md b/docs/how-to/scientific-data/publish-geospatial-zarr-data.md
@@ -1,9 +1,9 @@
 ---
-title: Publish Geospatial Zarr Data with IPFS
+title: Publish geospatial Zarr data with IPFS
 description: Learn how to publish geospatial datasets using IPFS and Zarr for decentralized distribution, data integrity, and open access.
 ---
 
-# Publish Geospatial Zarr Data with IPFS
+# Publish geospatial Zarr data with IPFS
 
 In this guide, you will learn how to publish public geospatial data sets using IPFS, with a focus on the [Zarr](https://zarr.dev/) format. You'll learn how to leverage decentralized distribution with IPFS for better collaboration, data integrity, and open access.
 
@@ -15,19 +15,19 @@ If you are interested in a real-world example following the patterns in this gui
 
 - [Why IPFS for Geospatial Data?](#why-ipfs-for-geospatial-data)
 - [Prerequisites](#prerequisites)
-- [Step 1: Prepare Your Zarr Data Set](#step-1-prepare-your-zarr-data-set)
-- [Step 2: Add Your Data Set to IPFS](#step-2-add-your-data-set-to-ipfs)
-- [Step 3: Organizing Your Data](#step-3-organizing-your-data)
-- [Step 4: Verify Providing Status](#step-4-verify-providing-status)
-- [Step 5: Content Discovery](#step-5-content-discovery)
-  - [Option A: Share the CID Directly](#option-a-share-the-cid-directly)
-  - [Option B: Use IPNS for Updatable References](#option-b-use-ipns-for-updatable-references)
-  - [Option C: Use DNSLink for Human-Readable URLs](#option-c-use-dnslink-for-human-readable-urls)
-- [Accessing Published Data](#accessing-published-data)
-- [Choosing Your Approach](#choosing-your-approach)
+- [Step 1: Prepare your Zarr data set](#step-1-prepare-your-zarr-data-set)
+- [Step 2: Add your data set to IPFS](#step-2-add-your-data-set-to-ipfs)
+- [Step 3: Organizing your data](#step-3-organizing-your-data)
+- [Step 4: Verify providing status](#step-4-verify-providing-status)
+- [Step 5: Content discovery](#step-5-content-discovery)
+  - [Option A: Share the CID directly](#option-a-share-the-cid-directly)
+  - [Option B: Use IPNS for updatable references](#option-b-use-ipns-for-updatable-references)
+  - [Option C: Use DNSLink for human-readable URLs](#option-c-use-dnslink-for-human-readable-urls)
+- [Accessing published data](#accessing-published-data)
+- [Choosing your approach](#choosing-your-approach)
 - [Reference](#reference)
 
-## Why IPFS for Geospatial Data?
+## Why IPFS for geospatial data?
 
 Geospatial data sets such as weather observations, satellite imagery, and sensor readings, are typically stored as multidimensional arrays, also commonly known as tensors.
 
@@ -58,14 +58,14 @@ Before starting, ensure you have:
 
 - A Zarr data set ready for publishing
 - Basic familiarity with the command line
-- [Kubo](/install/command-line/) or [IPFS Desktop](/install/ipfs-desktop/) installed on a machine.
+- [Kubo](../../install/command-line.md) or [IPFS Desktop](../../install/ipfs-desktop.md) installed on a machine.
 
 :::callout
 See the [NAT and port forwarding guide](../nat-configuration.md) for more information on how to configure port forwarding so that your IPFS node is publicly reachable, thus allowing reliable retrievability of data by other nodes.
 
 :::
 
-## Step 1: Prepare Your Zarr Data Set
+## Step 1: Prepare your Zarr data set
 
 When preparing your Zarr data set for IPFS, aim for approximately 1 MiB chunks to align with IPFS's 1 MiB maximum block size. While this is not a strict requirement, using larger Zarr chunks will cause IPFS to split them into multiple blocks, potentially increasing retrieval latency.
 
@@ -93,7 +93,7 @@ Chunking in Zarr is a nuanced topic beyond the scope of this guide. For more inf
 
 :::
 
-## Step 2: Add Your Data Set to IPFS
+## Step 2: Add your data set to IPFS
 
 Add your Zarr folder to IPFS using the `ipfs add` command:
 
@@ -117,9 +117,9 @@ This command:
 
 The `--quieter` flag outputs only the root CID, which identifies the complete dataset.
 
-> **Note:** Check out the [lifecycle of data in IPFS](../../../concepts/lifecycle.md) to learn more about how merkleizing, pinning, and providing work under the hood.
+> **Note:** Check out the [lifecycle of data in IPFS](../../concepts/lifecycle.md) to learn more about how merkleizing, pinning, and providing work under the hood.
 
-## Step 3: Organizing Your Data
+## Step 3: Organizing your data
 
 Two options help manage multiple datasets on your node:
 
@@ -186,7 +186,7 @@ ipfs files stat --hash /datasets/halo
 
 `bafybeihqixf5ew7mfr74bzb74qiw2mgtnytabnpzjnf5xeejzq4p2ocygu` is a new CID representing the combined dataset containing all three HALO flight datasets. The original CIDs are referenced, not copied, so no data is duplicated.
 
-## Step 4: Verify Providing Status
+## Step 4: Verify providing status
 
 After adding, Kubo continuously announces your content to the network. Check the status:
 
@@ -196,19 +196,19 @@ ipfs provide stat
 
 For detailed diagnostics, see the [provide system documentation](https://github.com/ipfs/kubo/blob/master/docs/provide-stats.md).
 
-## Step 5: Content Discovery
+## Step 5: Content discovery
 
 Now that your data is available on the public network, the next step is making it discoverable to others. Choose a sharing approach based on your needs:
 
-### Option A: Share the CID Directly
+### Option A: Share the CID directly
 
 For one-off sharing, provide the CID directly:
 
 ```
 ipfs://bafybeif52irmuurpb27cujwpqhtbg5w6maw4d7zppg2lqgpew25gs5eczm
 ```
 
-### Option B: Use IPNS for Updatable References
+### Option B: Use IPNS for updatable references
 
 If you want to share a stable identifier but be able to update the underlying dataset, create an [IPNS](https://docs.ipfs.tech/concepts/ipns/) identifier and share that instead. This is useful for datasets that get updated regularly — users can bookmark your IPNS name and always retrieve the latest version.
 
@@ -222,7 +222,7 @@ ipfs name publish /ipfs/<new-dataset-cid>
 
 IPNS is supported by all the retrieval methods in the [Accessing Published Data](#accessing-published-data) section below. Keep in mind that IPNS name resolution adds latency to the retrieval process.
 
-### Option C: Use DNSLink for Human-Readable URLs
+### Option C: Use DNSLink for human-readable URLs
 
 Link a DNS name to your CID by adding a TXT record:
 
@@ -236,11 +236,11 @@ Users can then access your data using one of the following methods:
 - With Kubo: `ipfs cat /ipns/data.example.org/zarr.json`
 - Using ipfsspec in Python as detailed below in [Python with ipfsspec](#python-with-ipfsspec), which also supports IPNS names, so you can use `ipns://data.example.org/zarr.json` directly.
 
-## Accessing Published Data
+## Accessing published data
 
 Once published, users can access your Zarr datasets through multiple methods:
 
-### IPFS HTTP Gateways
+### IPFS HTTP gateways
 
 See the [retrieval guide](../../quickstart/retrieve.md).
 
@@ -266,7 +266,7 @@ import { verifiedFetch } from '@helia/verified-fetch'
 const response = await verifiedFetch('ipfs://<cid>/zarr.json')
 ```
 
-## Choosing Your Approach
+## Choosing your approach
 
 Consider these factors when planning your publishing strategy:
 
diff --git a/docs/quickstart/retrieve.md b/docs/quickstart/retrieve.md
@@ -139,7 +139,7 @@ To fetch the CID using an IPFS gateway is as simple as loading one of the follow
 
 In this quickstart guide, you learned the different approaches to retrieving CIDs from the IPFS network and how to pick the most appropriate method for your specific needs.
 
-You then fetched the image that was pinned in the [publishing with a pinning service quickstart guide](./publish.md) using an IPFS Kubo node and an IPFS Gateway.
+You then fetched the image that was pinned in the [publishing with a pinning service quickstart guide](./pin.md) using an IPFS Kubo node and an IPFS Gateway.
 
 Possible next steps include: