ENH - Mixture density class #1401

AustinRochford · 2016-09-27T14:43:13Z

I often find myself marginalizing over discrete variables and using hacks like

def normal_mixture_rvs(*args, **kwargs):
    w = kwargs['w']
    mu = kwargs['mu']
    tau = kwargs['tau']

    size = kwargs['size']

    component = np.array([np.random.choice(w.shape[1], size=size, p=w_ / w_.sum())
                          for w_ in w])

    return sp.stats.norm.rvs(mu[np.arange(w.shape[0]), component], tau[component]**-0.5)

class NormalMixture(pm.distributions.Continuous):
    def __init__(self, w, mu, tau, *args, **kwargs):
        super(NormalMixture, self).__init__(*args, **kwargs)

        self.w = w
        self.mu = mu
        self.tau = tau

        self.mean = (w * mu).sum()

    def random(self, point=None, size=None, repeat=None):
        w, mu, tau = draw_values([self.w, self.mu, self.tau], point=point)

        return normal_mixture_rvs(w=w, mu=mu, tau=tau, size=size)

    def logp(self, value):
        w = self.w
        mu = self.mu
        tau = self.tau

        return bound(logsumexp(tt.log(w) + (-tau * (value - mu)**2 + tt.log(tau / np.pi / 2.)) / 2.,
                               axis=1).sum(),
                     tau >=0, w >= 0, w <= 1)

to speed up convergence. It seems to me like we could probably add a nice built-in MixtureDensity class to facilitate this flexibly.

Looking for feedback before starting to write this.

The text was updated successfully, but these errors were encountered:

kyleabeauchamp · 2016-09-27T15:26:34Z

Do you recall a good citation that discusses this? Is it in the stan paper?

On Sep 27, 2016 8:05 AM, "Austin Rochford" [email protected] wrote:

I often find myself marginalizing over discrete variables and using hacks
like

def normal_mixture_rvs(_args, *_kwargs):
w = kwargs['w']
mu = kwargs['mu']
tau = kwargs['tau']
size = kwargs['size']

component = np.array([np.random.choice(w.shape[1], size=size, p=w_ / w_.sum())
                      for w_ in w])

return sp.stats.norm.rvs(mu[np.arange(w.shape[0]), component], tau[component]**-0.5)
class NormalMixture(pm.distributions.Continuous):
def init(self, w, mu, tau, _args, *_kwargs):
super(NormalMixture, self).init(_args, *_kwargs)
    self.w = w
    self.mu = mu
    self.tau = tau

    self.mean = (w * mu).sum()

def random(self, point=None, size=None, repeat=None):
    w, mu, tau = draw_values([self.w, self.mu, self.tau], point=point)

    return normal_mixture_rvs(w=w, mu=mu, tau=tau, size=size)

def logp(self, value):
    w = self.w
    mu = self.mu
    tau = self.tau

    return bound(logsumexp(tt.log(w) + (-tau * (value - mu)**2 + tt.log(tau / np.pi / 2.)) / 2.,
                           axis=1).sum(),
                 tau >=0, w >= 0, w <= 1)
to speed up convergence. It seems to me like we could probably add a nice
built-in MixtureDensity class to facilitate this flexibly.

Looking for feedback before starting to write this.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#1401, or mute the thread
https://github.com/notifications/unsubscribe-auth/ABz_q3rU0r6BnBuNFCkinhTEIrbu857hks5quS0sgaJpZM4KHwPR
.

twiecki · 2016-09-27T15:29:33Z

This would be great, I knew the Stan guys use this to pretty good effect, just didn't know how to do it properly.

AustinRochford · 2016-09-27T16:07:55Z

@kyleabeauchamp yes, I think it's Chapter 11, Latent Discrete Parameters, of the Stan reference v2.8.0

AustinRochford · 2016-09-27T16:16:40Z

@twiecki it definitely speeds up mixing drastically for some models. I will start working on this when I get the chance!

fonnesbeck · 2016-09-29T21:59:42Z

This is essentially what we do in our ZeroInflated* models to eliminate the use of latent indicators. Would be handy to have mixtures of often-used distributions like normals.

AustinRochford · 2016-09-29T22:25:18Z

@fonnesbeck I was envisioning a base class that could take arbitrary distributions for the mixture components, with subclasses/encapsulating classes for specific common use cases (zero inflated, normal mixtures, etc.).

AustinRochford mentioned this issue Oct 10, 2016

WIP Mixture Models #1437

Merged

AustinRochford closed this as completed Oct 26, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH - Mixture density class #1401

ENH - Mixture density class #1401

AustinRochford commented Sep 27, 2016

kyleabeauchamp commented Sep 27, 2016

twiecki commented Sep 27, 2016

AustinRochford commented Sep 27, 2016

AustinRochford commented Sep 27, 2016

fonnesbeck commented Sep 29, 2016

AustinRochford commented Sep 29, 2016 •

edited

Loading

ENH - Mixture density class #1401

ENH - Mixture density class #1401

Comments

AustinRochford commented Sep 27, 2016

kyleabeauchamp commented Sep 27, 2016

twiecki commented Sep 27, 2016

AustinRochford commented Sep 27, 2016

AustinRochford commented Sep 27, 2016

fonnesbeck commented Sep 29, 2016

AustinRochford commented Sep 29, 2016 • edited Loading

AustinRochford commented Sep 29, 2016 •

edited

Loading