When to validate variable values? #32

benbovy · 2018-03-06T10:57:58Z

Validation should be done on each value given as a model input, either when creating the input xarray.Dataset or when we add the value in the key-value store (see #31). The second might be better if we eventually want to use the modelling framework with another interface than xarray.Dataset.

For performance issues, it may be wise not to systematically validate the value of a variable when it is set/updated by a process during a simulation. Validation may be needed in some cases, though (e.g., to ensure that values computed in a process does not fall outside of an acceptable range). This would require a convenient way to call the validators of a (possibly foreign) variable from within a process.

The text was updated successfully, but these errors were encountered:

benbovy · 2019-12-12T12:02:36Z

Ideally, the earlier input values are validated the better. However, validating input values when creating the input dataset (using creating_setup) is a bad idea, because input datasets may be created by other means (e.g., by loading a netcdf file or after some pre-processing). It would also complicate the validation itself, e.g., considering additional dimension(s) of the input variables, such as the master clock for time-varying values or a batch dimension for running batches of simulations.

So the best place, common to all cases, is to validate just before setting inputs into the simulation data store.

All kinds of validation (even for model inputs) should be optional, as it may impact performance. Control on validation might be possible by introducing a parameter to Dataset.xsimlab.run(), e.g.,:

Dataset.xsimlab.run(validate='nothing'): no validation is performed.
Dataset.xsimlab.run(validate='inputs'): validate only input values.
Dataset.xsimlab.run(validate='all'): validate both input values and values set by foreign variables in process classes.

The latter may be costly, but it is useful for debugging.

This would require a convenient way to call the validators of a (possibly foreign) variable from within a process.

This is not possible, as validation must be performed at the level of a process class (e.g., when validation involves checking the values of multiple variables declared in the process class).

benbovy · 2019-12-12T13:26:37Z

So the best place, common to all cases, is to validate just before setting inputs into the simulation data store.

Actually, using attr.validate, validation would rather be performed just after updating the simulation store. I think it doesn't really matters.

benbovy added this to the 0.2 milestone Mar 6, 2018

benbovy removed this from the 0.2 milestone Apr 16, 2018

benbovy added this to the 0.3 milestone Aug 29, 2018

benbovy modified the milestones: 0.3, 0.4 Sep 26, 2019

benbovy mentioned this issue Dec 12, 2019

Validate values given as input or set in foreign processes #74

Merged

4 tasks

benbovy closed this as completed in #74 Dec 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When to validate variable values? #32

When to validate variable values? #32

benbovy commented Mar 6, 2018

benbovy commented Dec 12, 2019

benbovy commented Dec 12, 2019

When to validate variable values? #32

When to validate variable values? #32

Comments

benbovy commented Mar 6, 2018

benbovy commented Dec 12, 2019

benbovy commented Dec 12, 2019