Broadcast matrix inputs to Gemm #986

ricardoV94 · 2022-06-08T12:17:31Z

The Gemm Op is only applicable when there is no broadcasting between Z and dot(A, B) in the expression Z + dot(A, B). This PR broadcasts the matrix inputs when the Op was inserted for a mix of matrices that weren't known to be row / column matrices until runtime.

Closes #984

ricardoV94 · 2022-06-08T12:42:06Z

Converting to draft because of the performance concerns that this fix entails

brandonwillard · 2022-06-08T14:34:23Z

Perhaps it makes sense to allow this type of constraint at the type level, more in line with the old broadcastable flag? Type shapes would then not only be limited to (None, int), but also allow for a special flag -1 or "not1" to indicate this dimension can be anything other than 1.

The old TensorType.broadcastable is still present in exactly the same form as it was. The only differences might be in how we want to use and interpret it.

I made a remark about this (i.e. the "old") interpretation of TensorType.broadcastable recently here—among other places/times throughout our work. The problem with the "old"/strict interpretation is that it puts extra pressure on Op.make_node implementations to both infer and be accurate about the broadcast patterns/static shape information in the TensorTypes it creates. We've been dealing with the issues and limitations that arise from this interpretation all throughout this work.

The type constraints you mention are viable, but also really do require a much more clearly defined and implemented type system, and, ultimately, some basic constraint logic. My push for the broad use of miniKanren is—in part—motivated by the availability (and compartmentalization) of such features.

Regardless, why can't we broadcast all the inputs to the GEMM Op in the rewrite (or even in the Op.make_node or Op.perform/Op.c_code methods)?

ricardoV94 · 2022-06-08T19:48:11Z

Regardless, why can't we broadcast all the inputs to the GEMM Op in the rewrite (or even in the Op.make_node or Op.perform/Op.c_code methods)?

I'll explore that. I didn't plan to mess with blas related Ops, but here we are ^^

In any case I feel that supporting (and enforcing) non size1 type shape might come in handy in a couple of places.

ricardoV94 · 2022-06-09T14:52:09Z

Broadcasting the matrix inputs was not so hard in the end. Doing that now

codecov · 2022-06-10T10:29:30Z

Codecov Report

Merging #986 (47cc1f0) into main (064e72f) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main     #986   +/-   ##
=======================================
  Coverage   79.26%   79.26%           
=======================================
  Files         152      152           
  Lines       47927    47932    +5     
  Branches    10912    10913    +1     
=======================================
+ Hits        37990    37995    +5     
  Misses       7429     7429           
  Partials     2508     2508

Impacted Files	Coverage Δ
aesara/tensor/blas.py	`79.71% <100.00%> (+0.09%)`	⬆️

This test was not particularly slow compared to the rest of the module.

ricardoV94 · 2022-06-10T10:56:52Z

Tests are passing

brandonwillard

This looks great.

aesara/tensor/blas.py

ricardoV94 · 2022-06-13T17:17:11Z

I did some sanity checks and I am more confident that I didn't screw up anything :)

Ricardo added 3 commits June 10, 2022 12:56

Remove slow mark from test_dot22scalar

6b677e2

This test was not particularly slow compared to the rest of the module.

Assert something about the output of function calls in test_blas.py

7f04cbb

Actually test dtype in TestGer.given_dtype

5def19a

brandonwillard previously approved these changes Jun 10, 2022

View reviewed changes

ricardoV94 commented Jun 11, 2022

View reviewed changes

aesara/tensor/blas.py Show resolved Hide resolved

aesara/tensor/blas.py Show resolved Hide resolved

ricardoV94 commented Jun 11, 2022

View reviewed changes

aesara/tensor/blas.py Show resolved Hide resolved

Broadcast input matrices in Gemm

47cc1f0

brandonwillard approved these changes Jun 13, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Broadcast matrix inputs to Gemm #986

Broadcast matrix inputs to Gemm #986

Uh oh!

ricardoV94 commented Jun 8, 2022 •

edited

Loading

Uh oh!

ricardoV94 commented Jun 8, 2022

Uh oh!

brandonwillard commented Jun 8, 2022

Uh oh!

ricardoV94 commented Jun 8, 2022

Uh oh!

ricardoV94 commented Jun 9, 2022

Uh oh!

codecov bot commented Jun 10, 2022 •

edited

Loading

Uh oh!

ricardoV94 commented Jun 10, 2022

Uh oh!

brandonwillard left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ricardoV94 commented Jun 13, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Broadcast matrix inputs to Gemm #986

Broadcast matrix inputs to Gemm #986

Uh oh!

Conversation

ricardoV94 commented Jun 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ricardoV94 commented Jun 8, 2022

Uh oh!

brandonwillard commented Jun 8, 2022

Uh oh!

ricardoV94 commented Jun 8, 2022

Uh oh!

ricardoV94 commented Jun 9, 2022

Uh oh!

codecov bot commented Jun 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ricardoV94 commented Jun 10, 2022

Uh oh!

brandonwillard left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ricardoV94 commented Jun 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ricardoV94 commented Jun 8, 2022 •

edited

Loading

codecov bot commented Jun 10, 2022 •

edited

Loading

ricardoV94 commented Jun 13, 2022 •

edited

Loading