Is `calculate_impact` correct?

When I was looking at the multi-cell geolift notebook I started wondering if the `calculate_impact` method of the `PyMCModel` class was correct. In the post intervention period in the top plot you can see that the posterior expectation has narrow HDI's and the data points are far away. Yet when we look in the causal impact plot which simply looks at the difference between the data and the posterior mu distribution, we see larger HDI's which sometimes overlap with zero.

<img width="593" alt="Image" src="https://github.com/user-attachments/assets/04eda21f-5019-4ef0-9bfc-8a8bd1887c56" />

This was confirmed, the current implementation calculates the causal impact as the difference between the data and the posterior predictive distribution. 

https://github.com/pymc-labs/CausalPy/blob/714c48b467fba140c74bdd18c16d430ae48cd581/causalpy/pymc_models.py#L169-L173

I think this should instead be a comparison between the data and the posterior expectation. So instead of `y_pred["posterior_predictive"]["y_hat"]` we should have `y_pred["posterior_predictive"]["mu"]`?

If so, then the implications are that our estimates of the causal impact will _increase_ in precision. If the current code is in error, then we are not getting biased estimates, we are just getting less precise estimates than we should be getting out.

Making this change results in this...

<img width="253" alt="Image" src="https://github.com/user-attachments/assets/cf40b2ee-a861-4392-9561-cf5653ef649f" />

	def calculate_impact(
	self, y_true: xr.DataArray, y_pred: az.InferenceData
	) -> xr.DataArray:
	impact = y_true - y_pred["posterior_predictive"]["y_hat"]
	return impact.transpose(..., "obs_ind")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is `calculate_impact` correct? #496

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Is calculate_impact correct? #496

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Is `calculate_impact` correct? #496