Add support for symbolic initval using a singledispatch approach #4912

kc611 · 2021-08-08T07:17:55Z

This PR was built upon #4867 as an simpler singledispatch based alternative.

Also Fixes: #4911

michaelosthege · 2021-08-08T08:43:39Z

I appreciate you looking into this too, @kc611, but I see several disadvantages of the singledispatch approach:

It's not simpler than the classmethod route:
The implementation of the singledispatch strategy is much more spread all over the place. For example there is some stuff inside an if clause in Distribution.__new__, and two functions outside of the Distribution class where previously one classmethod was enough.
The singledispatch based initval picking acts on TensorVariables:
For logp/logcdf I can follow that argument, because those relate to statistical properties, but initial values for the MCMCs are a PyMC3 Model thing.
Those TensorVariables can come from the Distribution.dist() API (as you did in the test) which doesn't even support initial values.

Unrelated to the concerns above, I will now cherry-pick some changes from #4867 to a separate PR so these diffs become easier to read.

ricardoV94 · 2021-08-08T08:48:56Z

It's not simpler than the classmethod route

I agree. There is one advantage however. The method can be (re)-used in other places. For example I think the old pm.Mixture needed to access the initval of its component distributions in order to set it's own initval.

brandonwillard · 2021-08-11T05:50:27Z

It's not simpler than the classmethod route:
The implementation of the singledispatch strategy is much more spread all over the place.

What is this measure of "spread all over the place" and what makes it relevant?

From what I can tell, this is putting the initial value computation logic within the types/classes to which it corresponds. Would you rather have all this logic implemented as a bunch of fixed if-elif statements within Model or Distribution?

If you add methods to Distribution that compute/determine initial values, then you're doing effectively the same thing as this PR—except with much less non-monkey-patching configurability.

If you're not aware, dispatching/generic functions like these are another approach to the same functionality that class methods provide—albeit with different forms of flexibility.

For example there is some stuff inside an if clause in Distribution.__new__, and two functions outside of the Distribution class where previously one classmethod was enough.

Is this "one" class method approach able to accomplish exactly the same things? It looks like these changes allow one to set the default initial values for Distributions quite flexibly and succinctly, and makes those defaults easily configurable at run-time.

2. The singledispatch based initval picking acts on TensorVariables:
For logp/logcdf I can follow that argument, because those relate to statistical properties, but initial values for the MCMCs are a PyMC3 Model thing.

In what sense does it "act on TensorVariables"? The dispatches of this feature and logp/logcdf are not on TensorVariables.

If you're referring to how default_initval takes TensorVariable arguments, then you're not seeing that this is simply a convenience. Distributions are currently one-to-one with RandomVariable types, and this means that no matter how you want to implement this you're necessarily using a RandomVariable type to determine how an initial value is computed. Whether you get that type from a TensorVariable's Op or type(self) is immaterial.

Those `TensorVariable`s can come from the `Distribution.dist()` API (as you did in the test) which doesn't even support initial values.

We're trying to assign initial values to random variable/distribution types—both at a conceptual level and at the type-level; that's the objective, and that's what this approach is doing. How are you proposing it be done without associating initial value computations/choices with Distributions/RandomVariable types?

All these arguments are phrased as "cons", but I'm not seeing any real implementation or design issues being raised. If you don't like the approach used in this PR, that's fine, but stating so in this way isn't very helpful. If you don't understand what these changes are doing or why, asking questions is better.

ricardoV94 · 2021-08-11T14:53:41Z

Just a note that I refactored (and broke) the code quite a lot after @michaelosthege comment above, so the discussion could be a bit misplaced.

brandonwillard · 2021-08-11T15:36:56Z

Just a note that I refactored (and broke) the code quite a lot after @michaelosthege comment above, so the discussion could be a bit misplaced.

It's not misplaced, since all the relevant comments are basically only about using a dispatch-based approach, and that hasn't changed.

codecov · 2021-08-11T16:10:21Z

Codecov Report

Merging #4912 (46cb30a) into main (389f818) will increase coverage by 0.00%.
The diff coverage is 93.75%.

@@           Coverage Diff           @@
##             main    #4912   +/-   ##
=======================================
  Coverage   74.13%   74.13%           
=======================================
  Files          86       86           
  Lines       13882    13898   +16     
=======================================
+ Hits        10291    10303   +12     
- Misses       3591     3595    +4

Impacted Files	Coverage Δ
pymc3/distributions/distribution.py	`82.66% <91.66%> (+0.78%)`	⬆️
pymc3/distributions/continuous.py	`96.26% <100.00%> (+0.01%)`	⬆️
pymc3/backends/report.py	`89.51% <0.00%> (-2.10%)`	⬇️

kc611 · 2021-08-14T12:09:35Z

Re-running tests to check if failure is flaky.

michaelosthege · 2021-08-14T12:29:43Z

@kc611, just a heads-up: yesterday @ricardoV94, @twiecki and I discussed about ongoing initival efforts and we found a way to disentangle and resolve some of the implementation difficulties we're having with different approaches.

We came up with the idea to think of a singledispatch-based re-implementation as something independent from initial values.
Thomas proposed to name that dispatched function something like "representative_point" and implement it without touching anything initial-value related.
Independently, we can then work on refactoring how/when initial values are numerified and in a 3rd step we can then change the Distribution.__new__(initval=...) signature to allow for selecting between "random", "moment".

@ricardoV94 @twiecki please comment/edit if I mixed up something.

twiecki · 2021-08-18T13:53:08Z

Is this still required with #4942? If it is, we should probably change the naming here a bit like @michaelosthege advised above.

ricardoV94 · 2021-08-18T14:04:03Z

Is this still required with #4942? If it is, we should probably change the naming here a bit like @michaelosthege advised above.

Yeah you still need this (i.e. the "moment" thing, not the symbolic initval). For transformed unconstrained variables we can do like STAN [uniform -1, 1], but unlike STAN we also sample constrained variables, in which case we still need a way to specify a stable starting point for those distributions.

ricardoV94 · 2021-08-19T08:46:11Z

pymc3/distributions/distribution.py

+    Parameters are the same as for the `.dist()` method.
+    """
+
+    return _representative_point(rv.owner.op, rv, *rv.owner.inputs[2:])


Actually size in input[1], input[2] is dtype,

michaelosthege · 2021-08-20T17:35:37Z

pymc3/tests/test_initvals.py

+class TestMoment:
+    def test_basic(self):
+        rv = pm.Flat.dist()
+        assert get_moment(rv).eval() == np.zeros(1)


A random draw from a scalar RV has a shape == (). Shouldn't the moment have the same scalar shape?

pymc3/distributions/distribution.py

twiecki · 2021-08-30T11:13:41Z

@kc611 do you know what's up with the unit tests here?

kc611 · 2021-08-30T11:26:50Z

Ah, these failures are due to #4961 (comment), rebasing should fix them, Infact this PR should be marked ready to merge. (If the initval work is being done separately somewhere else)

twiecki · 2021-08-30T15:44:34Z

@kc611 want to rebase then?

michaelosthege · 2021-08-30T16:07:58Z

Asking the unpleasant question: Which moment will LogNormal get? Mean or mode?

(Or will there be a mechanism where this can be configured in some way?)

Also please check my comment from above - Don't know if you fixed it, but IIRC I saw something looking like a shape problem around the corner.

kc611 · 2021-08-30T16:45:41Z

Yeah there will be some other mechanism according to #4912 (comment) , this PR just implments the get_moment part of it.

Also please check my comment from above

Just saw it. The shape problem seems to be something of a limitation caused by TensorVariable/TensorConstant type shapes being passed to aesara.tensor.zeros. Just fixed it and addressed your comment too. (Not sure if this .data approach will work every-time though)

kc611 · 2021-08-30T17:32:26Z

Not sure if this .data approach will work every-time though

Turns out it doesn't :-P

Looks like the .data approach won't work on windows systems. Plus it'll cause problems with TensorVariable type shapes.

And I was wondering why I (mentally) put a green check on this in the first place. Turns out the newly added tests pass without the .data on aesara==2.2.0. So I think we'd want to update that before this goes in. That or otherwise we'll need to somehow change how shapes are being passed to those aet.zeroes/aet.ones. However sooner or later we'd like to switch to this particular approach.

Meanwhile we can use this PR to plan and implement the new functionality that was being planned in #4942

ricardoV94 · 2021-08-30T17:39:52Z

Asking the unpleasant question: Which moment will LogNormal get? Mean or mode?

(Or will there be a mechanism where this can be configured in some way?)

I could not quite figure out what was the point of multiple moments. I think it was more a question of having at least one good moment for each distribution, and perhaps having a discrete alternative for continuous distributions. For the latter, it should be enough to just round the output if a discrete initval from a continuous variable is needed.

michaelosthege · 2021-08-31T13:51:24Z

Asking the unpleasant question: Which moment will LogNormal get? Mean or mode?
(Or will there be a mechanism where this can be configured in some way?)

I could not quite figure out what was the point of multiple moments. I think it was more a question of having at least one good moment for each distribution, and perhaps having a discrete alternative for continuous distributions. For the latter, it should be enough to just round the output if a discrete initval from a continuous variable is needed.

My point is kinda that I don't want to make that decision. In v3 the user did that.

If the get_moment method could take a moment: str kwarg (in addition to *rv_inputs) the implementation for LogNormal.get_moment, for example, could be parametrized.
Not sure if that's a good design though.
Just continuing that though process: If moment: str is a standard kwarg for that method, we could make the initival kwarg also be optionally a string such as random | mean | mode | median and just forward through get_moment.

ricardoV94 · 2021-08-31T22:22:31Z

I don't think the multiple moments were there for the user convenience. They were not even advertised on the docs.

twiecki · 2021-09-02T13:55:15Z

Thanks @kc611!

ricardoV94 · 2021-09-02T15:42:42Z

pymc3/distributions/distribution.py

+def get_moment(rv: TensorVariable) -> TensorVariable:
+    """Fallback method for creating an initial value for a random variable.
+
+    Parameters are the same as for the `.dist()` method.


I know this was merged already, but this part of the docstrings is wrong

OK, we should fix that then. CC @kc611

Ah yes I missed that, those docstrings were supposed to be removed.

I'm not sure what (docstring) will go in it's place though. Maybe I should just remove them for now ? We can add a proper explanation when we give the get_moment a proper entry point in the initval framework (if that's being planned)

Yes, then just remove them for now.

Did it in #4979

ricardoV94 force-pushed the initval_dispatch branch from 0d84d49 to c77f7a0 Compare August 10, 2021 17:50

ricardoV94 marked this pull request as ready for review August 10, 2021 17:51

ricardoV94 force-pushed the initval_dispatch branch from c77f7a0 to 8ecd0e8 Compare August 10, 2021 18:01

ricardoV94 marked this pull request as draft August 10, 2021 18:50

ricardoV94 force-pushed the initval_dispatch branch from 8ecd0e8 to 46940ae Compare August 11, 2021 15:36

ricardoV94 force-pushed the initval_dispatch branch from 46940ae to ec9124e Compare August 11, 2021 15:52

ricardoV94 force-pushed the initval_dispatch branch from ec9124e to f6cb942 Compare August 11, 2021 16:17

brandonwillard mentioned this pull request Aug 12, 2021

Can't set symbolic initval #4911

Closed

ricardoV94 mentioned this pull request Aug 13, 2021

Initval refactoring #4924

Closed

kc611 closed this Aug 14, 2021

kc611 reopened this Aug 14, 2021

kc611 force-pushed the initval_dispatch branch from f6cb942 to ba93578 Compare August 14, 2021 12:13

michaelosthege mentioned this pull request Aug 18, 2021

Remove initval from Model class and add function to draw one #4942

Closed

kc611 force-pushed the initval_dispatch branch from ba93578 to b3cfab8 Compare August 18, 2021 18:44

ricardoV94 reviewed Aug 19, 2021

View reviewed changes

kc611 force-pushed the initval_dispatch branch from cfdfc5e to e4f03e5 Compare August 20, 2021 17:26

michaelosthege reviewed Aug 20, 2021

View reviewed changes

twiecki marked this pull request as ready for review August 30, 2021 15:44

kc611 force-pushed the initval_dispatch branch 2 times, most recently from fee76c1 to e89684f Compare August 30, 2021 15:55

kc611 force-pushed the initval_dispatch branch from e89684f to 65b0680 Compare August 30, 2021 16:45

twiecki requested a review from ricardoV94 September 1, 2021 17:12

twiecki assigned ricardoV94 Sep 1, 2021

Added dispatch function for representative point

46cb30a

kc611 force-pushed the initval_dispatch branch from dbf6c18 to 46cb30a Compare September 2, 2021 09:23

twiecki approved these changes Sep 2, 2021

View reviewed changes

twiecki merged commit 9e15b20 into pymc-devs:main Sep 2, 2021

ricardoV94 reviewed Sep 2, 2021

View reviewed changes

kc611 mentioned this pull request Sep 3, 2021

Updated incorrect docstring and removed redundant logic in sampling.py #4979

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for symbolic initval using a singledispatch approach #4912

Add support for symbolic initval using a singledispatch approach #4912

kc611 commented Aug 8, 2021 •

edited

Loading

michaelosthege commented Aug 8, 2021

ricardoV94 commented Aug 8, 2021 •

edited

Loading

brandonwillard commented Aug 11, 2021

ricardoV94 commented Aug 11, 2021 •

edited

Loading

brandonwillard commented Aug 11, 2021

codecov bot commented Aug 11, 2021 •

edited

Loading

kc611 commented Aug 14, 2021

michaelosthege commented Aug 14, 2021

twiecki commented Aug 18, 2021 •

edited

Loading

ricardoV94 commented Aug 18, 2021 •

edited

Loading

ricardoV94 Aug 19, 2021 •

edited

Loading

michaelosthege Aug 20, 2021

twiecki commented Aug 30, 2021

kc611 commented Aug 30, 2021

twiecki commented Aug 30, 2021

michaelosthege commented Aug 30, 2021

kc611 commented Aug 30, 2021

kc611 commented Aug 30, 2021 •

edited

Loading

ricardoV94 commented Aug 30, 2021 •

edited

Loading

michaelosthege commented Aug 31, 2021

ricardoV94 commented Aug 31, 2021

twiecki commented Sep 2, 2021

ricardoV94 Sep 2, 2021

twiecki Sep 2, 2021

kc611 Sep 2, 2021 •

edited

Loading

twiecki Sep 3, 2021

kc611 Sep 3, 2021

Add support for symbolic initval using a singledispatch approach #4912

Add support for symbolic initval using a singledispatch approach #4912

Conversation

kc611 commented Aug 8, 2021 • edited Loading

michaelosthege commented Aug 8, 2021

ricardoV94 commented Aug 8, 2021 • edited Loading

brandonwillard commented Aug 11, 2021

ricardoV94 commented Aug 11, 2021 • edited Loading

brandonwillard commented Aug 11, 2021

codecov bot commented Aug 11, 2021 • edited Loading

Codecov Report

kc611 commented Aug 14, 2021

michaelosthege commented Aug 14, 2021

twiecki commented Aug 18, 2021 • edited Loading

ricardoV94 commented Aug 18, 2021 • edited Loading

ricardoV94 Aug 19, 2021 • edited Loading

Choose a reason for hiding this comment

michaelosthege Aug 20, 2021

Choose a reason for hiding this comment

twiecki commented Aug 30, 2021

kc611 commented Aug 30, 2021

twiecki commented Aug 30, 2021

michaelosthege commented Aug 30, 2021

kc611 commented Aug 30, 2021

kc611 commented Aug 30, 2021 • edited Loading

ricardoV94 commented Aug 30, 2021 • edited Loading

michaelosthege commented Aug 31, 2021

ricardoV94 commented Aug 31, 2021

twiecki commented Sep 2, 2021

ricardoV94 Sep 2, 2021

Choose a reason for hiding this comment

twiecki Sep 2, 2021

Choose a reason for hiding this comment

kc611 Sep 2, 2021 • edited Loading

Choose a reason for hiding this comment

twiecki Sep 3, 2021

Choose a reason for hiding this comment

kc611 Sep 3, 2021

Choose a reason for hiding this comment

kc611 commented Aug 8, 2021 •

edited

Loading

ricardoV94 commented Aug 8, 2021 •

edited

Loading

ricardoV94 commented Aug 11, 2021 •

edited

Loading

codecov bot commented Aug 11, 2021 •

edited

Loading

twiecki commented Aug 18, 2021 •

edited

Loading

ricardoV94 commented Aug 18, 2021 •

edited

Loading

ricardoV94 Aug 19, 2021 •

edited

Loading

kc611 commented Aug 30, 2021 •

edited

Loading

ricardoV94 commented Aug 30, 2021 •

edited

Loading

kc611 Sep 2, 2021 •

edited

Loading