Use Blockwise for matmul #452

ricardoV94 · 2023-09-23T09:22:37Z

We use a Blockwised Dot for matmul so that we get gradients for free.

C-performance won't be great for Blockwised Dot, since that doesn't have a C-implementation.
We could Blockwise more specialized Dot22 / GEMM Ops but that code is a bit of a mess at the moment and not useful long term as we deprecate the C backend

~~Alternatively we probably could use tensor_dot/batched_dot?~~ They are fundamentally different (or it would be rather inefficient to convert one to the other)

Closes #451

Needed for pymc-devs/pymc#6897

codecov-commenter · 2023-09-23T11:28:06Z

Codecov Report

Merging #452 (8d4054b) into main (4efbd19) will decrease coverage by 0.02%.
Report is 6 commits behind head on main.
The diff coverage is 98.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #452      +/-   ##
==========================================
- Coverage   80.76%   80.75%   -0.02%     
==========================================
  Files         159      159              
  Lines       45869    45849      -20     
  Branches    11238    11234       -4     
==========================================
- Hits        37048    37026      -22     
- Misses       6593     6595       +2     
  Partials     2228     2228

Files Changed	Coverage Δ
pytensor/compile/mode.py	`84.40% <ø> (ø)`
pytensor/ifelse.py	`51.56% <ø> (ø)`
pytensor/tensor/variable.py	`87.42% <83.33%> (-0.34%)`	⬇️
pytensor/link/jax/dispatch/scan.py	`100.00% <100.00%> (ø)`
pytensor/link/numba/dispatch/scan.py	`95.91% <100.00%> (+0.02%)`	⬆️
pytensor/scalar/basic.py	`80.25% <100.00%> (ø)`
pytensor/tensor/math.py	`90.16% <100.00%> (-0.33%)`	⬇️
pytensor/tensor/rewriting/blockwise.py	`96.15% <100.00%> (+1.15%)`	⬆️
pytensor/tensor/rewriting/uncanonicalize.py	`96.21% <100.00%> (+0.24%)`	⬆️

jessegrabowski

Looks great, I played around with it and everything seems to work as expected. Currently you can't compile graphs with blockwise-matmul into jax/numba, is that beyond the scope of this PR? I thought there was a jax.vectorize-type function that would make that trivial (for the jax case at least)?

jessegrabowski · 2023-09-24T11:37:45Z

pytensor/tensor/math.py

+    elif x1.type.ndim == 1:
+        out = _matrix_matrix_matmul(x1[None], x2).squeeze(-2)
+    elif x2.type.ndim == 1:
+        out = _matrix_matrix_matmul(x1, x2[:, None]).squeeze(-1)


Is all this better than a separate _matrix_vector_matmul function? I only ask because BLAS makes the distinction.

This should be fine, once we ever go into optimizing this further in jax/numba backends we should be able to know which case is which by inspecting the input static types.

ricardoV94 · 2023-09-24T11:42:44Z

I thought there was a jax.vectorize-type function that would make that trivial (for the jax case at least)?

Yet it should be pretty simple. It's in the todo list: #430
Interested in picking it up :) ?

Accept Blockwise in ifelse lift rewrite

bfebb84

ricardoV94 added bug Something isn't working enhancement New feature or request NumPy compatibility Op implementation labels Sep 23, 2023

ricardoV94 requested a review from jessegrabowski September 23, 2023 09:22

ricardoV94 force-pushed the blockwise_for_matmul branch 2 times, most recently from c6b4410 to 7e3c2dd Compare September 23, 2023 09:30

Remove Matmul Operator in favor of Blockwise Dot

8d4054b

ricardoV94 force-pushed the blockwise_for_matmul branch from 7e3c2dd to 8d4054b Compare September 23, 2023 10:56

ricardoV94 requested a review from aseyboldt September 23, 2023 10:57

jessegrabowski approved these changes Sep 24, 2023

View reviewed changes

ricardoV94 merged commit 071eadd into pymc-devs:main Sep 24, 2023

ricardoV94 mentioned this pull request Oct 3, 2023

Fix bug in TensorVariable.__rmatmul__ #465

Merged

ricardoV94 deleted the blockwise_for_matmul branch October 12, 2023 08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Blockwise for matmul #452

Use Blockwise for matmul #452

ricardoV94 commented Sep 23, 2023 •

edited

Loading

codecov-commenter commented Sep 23, 2023

jessegrabowski left a comment •

edited

Loading

jessegrabowski Sep 24, 2023

ricardoV94 Sep 24, 2023

ricardoV94 commented Sep 24, 2023

Use Blockwise for matmul #452

Use Blockwise for matmul #452

Conversation

ricardoV94 commented Sep 23, 2023 • edited Loading

codecov-commenter commented Sep 23, 2023

Codecov Report

jessegrabowski left a comment • edited Loading

Choose a reason for hiding this comment

jessegrabowski Sep 24, 2023

Choose a reason for hiding this comment

ricardoV94 Sep 24, 2023

Choose a reason for hiding this comment

ricardoV94 commented Sep 24, 2023

ricardoV94 commented Sep 23, 2023 •

edited

Loading

jessegrabowski left a comment •

edited

Loading