Rename and refactor start dict in sampling #5027

michaelosthege · 2021-09-26T19:17:29Z

This PR is in preparation of switching to the new initval framework.

To make debugging and reviewing a little easier, I added type hints and moved the start-related code closer together.

API/Behavior changes:

A few lines related to the now unsupported use of length-zero traces as pm.sample inputs were removed.
The result from init_nuts is now always used as the initial/starting point, whereas before it was only used unless a start dict was manually specified. init_nuts itself combines the automatically determined initial point with the user-provided initvals such that initvals take priority.
The pm.sample(start=...) kwarg was renamed to initvals, to reflect that it takes the same keys/values/signature as model.initial_values or the corresponding Distribution.__new__(initval=...) kwarg.
start kwargs of lower-level sampling functions are now required to be numeric & complete. model.update_start_vals is no longer applied by lower-level functions.
Moving forward only pm.sample(initvals=...) and init_nuts(initvals=...) take the fully-flexibly initval-style dictionary of potentially incomplete (and soon also symbolic, "prior", or "moment" valued) initval strategies.
Checks of non-inf/nan initial points and corresponding shapes now run under all circumstances.

codecov · 2021-09-26T19:32:19Z

Codecov Report

Merging #5027 (6156cbb) into main (641b278) will increase coverage by 0.01%.
The diff coverage is 83.33%.

❗ Current head 6156cbb differs from pull request most recent head 75f4c46. Consider uploading reports for the commit 75f4c46 to get more accurate results

@@            Coverage Diff             @@
##             main    #5027      +/-   ##
==========================================
+ Coverage   77.82%   77.83%   +0.01%     
==========================================
  Files         128      128              
  Lines       24380    24384       +4     
==========================================
+ Hits        18973    18980       +7     
+ Misses       5407     5404       -3

Impacted Files	Coverage Δ
pymc/sampling.py	`87.06% <81.81%> (+0.04%)`	⬆️
pymc/parallel_sampling.py	`87.33% <100.00%> (+1.04%)`	⬆️

michaelosthege · 2021-09-27T13:32:12Z

@aseyboldt the only remaining test failure is limited to float32. The test was written by you 4 years ago and I can't tell why it failed:
https://github.com/pymc-devs/pymc3/pull/5027/checks?check_run_id=3720168022#step:7:2486

I was able to establish that the initial point that failes the _check_start_shape() was {'a': array([0., 0.], dtype=float32), 'c': array([0., 0.], dtype=float32)}.

michaelosthege · 2021-09-27T17:44:39Z

I XFAILed the test, since the error appears in a branch (start shape checking) that was not triggered before.

ricardoV94

Needs a release note

pymc/sampling.py

michaelosthege · 2021-09-27T21:47:45Z

I'm collecting all release notes in the Hackmd document. Also there will be another update on this API with the next PR.

If that's fine with you, I will update the release notes in the next PR?

pymc/sampling.py

aloctavodia · 2021-09-28T05:32:46Z

pymc/sampling.py

    try:
        step = CompoundStep(step)
    except TypeError:
        pass

-    point = Point(start, model=model, filter_model_vars=True)
+    point = start


better to simply use "start" instead of renaming the variable.

I decided against that because the "point" variable is overwritten all over again while iterating the draws. So it's no longer the "start" after the first iteration and I wanted to avoid confusion because of lines like strace.record(start, stats).

Take out leftover start-from-trace support. And rearrange some code blocks for easier refactoring later.

Co-authored-by: Osvaldo Martin <[email protected]>

The initial point is now determined exactly once in the control flow: + By `init_nuts` (initvals replace init results). + In `sample`, if the above does not apply or fails. Lower-level sampling functions now require the `start` kwarg to be a complete dictionary of numeric initial values for all free variables. The initial points for _each_ chain is checked for shape and logp inf/nan once in `sample`, even if they may be identical for all chains. Co-authored-by: Osvaldo Martin <[email protected]>

michaelosthege

Thanks @aloctavodia ! With the exception of one item I implemented your suggestions :)

Please let me know if we can move forward with this PR.

michaelosthege · 2021-09-28T07:59:08Z

pymc/sampling.py

    try:
        step = CompoundStep(step)
    except TypeError:
        pass

-    point = Point(start, model=model, filter_model_vars=True)
+    point = start


I decided against that because the "point" variable is overwritten all over again while iterating the draws. So it's no longer the "start" after the first iteration and I wanted to avoid confusion because of lines like strace.record(start, stats).

michaelosthege added enhancements maintenance labels Sep 26, 2021

michaelosthege added this to the vNext (4.0.0) milestone Sep 26, 2021

michaelosthege force-pushed the rename-start branch from 6156cbb to 81183a8 Compare September 27, 2021 12:19

michaelosthege requested a review from aseyboldt September 27, 2021 13:32

michaelosthege force-pushed the rename-start branch from 81183a8 to 4e4f1c8 Compare September 27, 2021 17:43

michaelosthege force-pushed the rename-start branch from 4e4f1c8 to 10b71d9 Compare September 27, 2021 17:48

michaelosthege marked this pull request as ready for review September 27, 2021 17:49

michaelosthege assigned ricardoV94 and unassigned ricardoV94 Sep 27, 2021

michaelosthege requested a review from ricardoV94 September 27, 2021 17:49

ricardoV94 reviewed Sep 27, 2021

View reviewed changes

pymc/sampling.py Outdated Show resolved Hide resolved

michaelosthege mentioned this pull request Sep 27, 2021

Refactor _check_start_shape to evaluate shapes without needing to draw samples #5031

Closed

aloctavodia reviewed Sep 28, 2021

View reviewed changes

michaelosthege and others added 2 commits September 28, 2021 09:51

Define types of start kwargs

ec7e249

Take out leftover start-from-trace support. And rearrange some code blocks for easier refactoring later.

Rename start kwarg to initvals

0c0372f

Co-authored-by: Osvaldo Martin <[email protected]>

michaelosthege force-pushed the rename-start branch from 10b71d9 to 62f6408 Compare September 28, 2021 08:04

michaelosthege force-pushed the rename-start branch from 62f6408 to 75f4c46 Compare September 28, 2021 08:09

michaelosthege commented Sep 28, 2021

View reviewed changes

aloctavodia approved these changes Sep 28, 2021

View reviewed changes

michaelosthege merged commit 00e6eb9 into pymc-devs:main Sep 28, 2021

michaelosthege deleted the rename-start branch September 28, 2021 14:15

michaelosthege mentioned this pull request Sep 30, 2021

MLDA needs refactoring to become operable again #5021

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Rename and refactor start dict in sampling #5027

Rename and refactor start dict in sampling #5027

Uh oh!

michaelosthege commented Sep 26, 2021 •

edited

Loading

Uh oh!

codecov bot commented Sep 26, 2021 •

edited

Loading

Uh oh!

michaelosthege commented Sep 27, 2021

Uh oh!

michaelosthege commented Sep 27, 2021

Uh oh!

ricardoV94 left a comment

Uh oh!

Uh oh!

michaelosthege commented Sep 27, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aloctavodia Sep 28, 2021

Uh oh!

michaelosthege Sep 28, 2021

Uh oh!

michaelosthege left a comment

Uh oh!

michaelosthege Sep 28, 2021

Uh oh!

Uh oh!

Uh oh!

Rename and refactor start dict in sampling #5027

Rename and refactor start dict in sampling #5027

Uh oh!

Conversation

michaelosthege commented Sep 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API/Behavior changes:

Uh oh!

codecov bot commented Sep 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

michaelosthege commented Sep 27, 2021

Uh oh!

michaelosthege commented Sep 27, 2021

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

michaelosthege commented Sep 27, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aloctavodia Sep 28, 2021

Choose a reason for hiding this comment

Uh oh!

michaelosthege Sep 28, 2021

Choose a reason for hiding this comment

Uh oh!

michaelosthege left a comment

Choose a reason for hiding this comment

Uh oh!

michaelosthege Sep 28, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

michaelosthege commented Sep 26, 2021 •

edited

Loading

codecov bot commented Sep 26, 2021 •

edited

Loading