python-sdk/examples/notebook.py at main · aignostics/python-sdk · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
# /// script
# requires-python = ">=3.13"
# dependencies = [
#     "marimo",
#     "aignostics==1.3.0",
# ]
# ///

import marimo

__generated_with = "0.16.5"
app = marimo.App(width="full")


@app.cell
def _():
    import marimo as mo
    return (mo,)


@app.cell
def _(mo):
    mo.md(
        r"""
    # Initialize the Client

    As a first step, you need to initialize the client to interact with the Aignostics Platform. This will execute an OAuth flow depending on the environment you run:
    - In case you have a browser available, an interactive login flow in your browser is started.
    - In case there is no browser available, a device flow is started.

    **NOTE:** By default, the client caches the access token in your operation systems application cache folder. If you do not want to store the access token, set cache_token to False.

    ```python
    import aignostics.client as platform
    # initialize the client
    client = platform.Client(cache_token=True)
    ```
    """
    )
    return


@app.cell
def _():
    import pandas as pd
    from pydantic import BaseModel

    # the following function is used for visualizing the results nicely in this notebook
    def show(models: BaseModel | list[BaseModel]) -> pd.DataFrame:
        if isinstance(models, BaseModel):
            items = [models.model_dump()]
        else:
            items = (a.model_dump() for a in models)
        return pd.DataFrame(items)
    return (show,)


@app.cell
def _():
    from aignostics import platform
    # initialize the client
    client = platform.Client(cache_token=True)
    return client, platform


@app.cell
def _(mo):
    mo.md(
        r"""
    # List our available applications

    Next, let us list the applications that are available in your organization:
    """
    )
    return


@app.cell
def _(client, show):
    applications = client.applications.list()
    # visualize
    show(applications)
    return


@app.cell
def _(mo):
    mo.md(
        r"""
    # List all available versions of an application

    Now that we know the applications that are available, we can list all the versions of a specific application. In this case, we will use the `test-app` as an example. Using the `application_id`, we can list all the versions of the application:
    """
    )
    return


@app.cell
def _(client, show):
    application_versions = client.applications.versions.list(application="test-app")
    # visualize
    show(application_versions)
    return


@app.cell
def _(mo):
    mo.md(
        r"""
    # Inspect the application version details

    Now that we have the list of versions, we can inspect the details of a specific version. While we could directly use the list of application version returned by the `list` method, we want to directly query details for a specific application version. In this case, we will use the `test-app` application and version `0.0.4`:
    """
    )
    return


@app.cell
def _(client):
    test_app_version = client.applications.versions.details(application_id="test-app",application_version="0.0.4")

    # view the `input_artifacts` to get insights in the required fields of the input expected by this application version.
    test_app_version.input_artifacts
    return


@app.cell
def _(mo):
    mo.md(
        r"""
    # Submit an application run

    Now, let's submit an application run. We will use the application ID retrieved in the previous steps. We will not specify the version, which automatically uses the latest version. To submit an application run, we need to provide a payload that consists of 1 or more items. We provide the Pydantic model `InputItem` an item and the data that comes with it:
    ```python
    platform.InputItem(
        external_id="<a unique identifier to associate outputs to this input item>",
        input_artifacts=[platform.InputArtifact]
    )
    ```
    The `InputArtifact` defines the actual data that you provide aka. in this case the image that you want to be processed. The expected values are defined by the application version and have to align with the `input_artifacts` schema of the application version. In the case of this application, we only require a single artifact per item, which is the image to process on. The artifact name is defined as `whole_slide_image`. The `download_url` is a signed URL that allows the Aignostics Platform to download the image data later during processing. In addition to the image data itself, you have to provide the metadata defined in the input artifact schema, i.e., `checksum_base64_crc32c`, `resolution_mpp`, `width_px`, and `height_px`. The metadata is used to validate the input data and is required for the processing of the image. The following example shows how to create an item with a single input artifact:

    ```python
    platform.InputArtifact(
        name="whole_slide_image", # as defined by the application version input_artifact schema
        download_url="<a signed url to download the data>",
        metadata={
            "checksum_base64_crc32c": "<checksum, base64 encoded, crc32c hashed>",
            "resolution_mpp": "<mpp of base layer>",
            "width_px": "<width in pixels>",
            "height_px": "<height in pixels>"
        }
    )
    ```
    """
    )
    return


@app.cell
def _(client, platform):
    application_run = client.runs.submit(
        application_id="test-app",
        items=[
            platform.InputItem(
                external_id="wsi-1",
                input_artifacts=[
                    platform.InputArtifact(
                        name="user_slide",
                        download_url=platform.generate_signed_url("<signed-url>"),
                        metadata={
                            "checksum_base64_crc32c": "AAAAAA==",
                            "resolution_mpp": 0.25,
                            "width_px": 10000,
                            "height_px": 10000,
                        },
                    )
                ],
            ),
        ],
    )
    print(application_run)
    return (application_run,)


@app.cell
def _(mo):
    mo.md(
        r"""
    # Observe the status of the application run and download

    While you can observe the status of an application run directly via the `status()` method and also retrieve the results via the `results()` method, you can also download the results directly to a folder of your choice. The `download_to_folder()` method will download all the results to the specified folder. The method will automatically create a sub-folder in the specified folder with the name of the application run. The results for each individual input item will be stored in a separate folder named after the `external_id` you defined in the `InputItem`.

    The method downloads the results for a slide as soon as they are available. There is no need to keep the method running until all results are available. The method will automatically check for the status of the application run and download the results as soon as they are available. If you invoke the method on a run you already downloaded some results before, it will only download the missing artifacts.
    """
    )
    return


@app.cell
def _(application_run):
    download_folder = "/tmp/"
    application_run.download_to_folder(download_folder)
    return


@app.cell
def _(mo):
    mo.md(
        r"""
    # Continue to retrieve results for an application run

    In case you just submitted an application run and want to check on the results later or you had a connection loss, you can simply initialize an application run object via its `run_id`. If you do not have the `run_id` anymore, you can simply list all currently running application versions via the `client.runs.list()` method. The `run_id` is part of the `ApplicationRun` object returned by the `list()` method. You can then use the `download_to_folder()` method to continue downloading the results.
    """
    )
    return


@app.cell
def _(client):
    # list currently running applications
    application_runs = client.runs.list()
    for run in application_runs:
        print(run)
    return


@app.cell
def _(mo):
    mo.md(
        r"""
    from aignostics.platform.resources.runs import ApplicationRun
    application_run = ApplicationRun.for_run_id("<run_id>")
    # download
    download_folder = "/tmp/"
    application_run.download_to_folder(download_folder)
    """
    )
    return


if __name__ == "__main__":
    app.run()