Linear algebra namespace overview #165

kgryte · 2021-04-15T16:48:29Z

In NumPy (and other array libraries), linear algebra APIs can be found in both the top-level namespace and a separate linear algebra (linalg) sub-namespace.

The purpose of this issue is to provide an overview of how array libraries (notably NumPy and TensorFlow) currently delineate linear algebra APIs and to provide a place for discussing whether the specification should adopt a similar division and what that division should be.

As discussed in gh-147 and in meetings, whether the specification should adopt a similar division is a matter of some debate.

Notably, some array libraries, such as Dask, may have difficulty implementing some linear algebra APIs, and, thus, the question is raised as to whether we should include "essential", relatively straightforward linear algebra APIs in the top-level namespace (i.e., those APIs whose implementations do not require factorization (of the BLAS variety, rather than of the LAPACK variety)) and place the remaining APIs in a sub-namespace which could be an optional extension.

Some individuals have proposed that we include all linear algebra APIs (mirroring NumPy) in a sub-namespace linalg, but also mirror a select subset of APIs in the top-level namespace (e.g., matmul and linalg.matmul). The benefit of this is that (1) linear algebra APIs in the top-level namespace would be guaranteed and provide a minimal subset of linear algebra APIs in all standard-conforming array libraries and (2) for consistency, a user could simply request the linalg sub-namespace and get all linear algebra APIs (something which is not currently possible in, e.g., NumPy today).

What follows is an overview of NumPy's linear algebra API division which is adopted by several other array libraries (e.g., Dask, CuPy, JAX, etc).

NumPy

Top-Level Namespace

cross
diagonal
einsum
matmul
outer
tensordot
trace
transpose
vecdot

Note: essentially sum-products and basic array manipulation (transpose, diagonal).

Linalg Namespace

cholesky
det
eig
eigh
eigvals
eigvalsh
inv
lstsq
matrix_power
matrix_rank
norm
pinv
qr
slogdet
solve
svd
svdvals

TensorFlow

TensorFlow adopts a similar division, but places the following APIs in a linalg sub-namespace, rather than than the top-level namespace...

Top-Level

einsum
tensordot
transpose

Linalg Namespace

...
cross
diag (diagonal)
matmul
trace

Paths Forward

Everything in the top-level namespace.
- Advantages:
  1. A single namespace affords the simplest import strategy and mental model. No need to import a separate module. No need for users to recall that a separate module exists.
- Disadvantages:
  1. No "grouping" of related APIs.
  2. Requires that all specification-conforming array libraries implement all linear algebra APIs. (note: depending on one's views, this may be an advantage.)
Everything in a linear algebra sub-namespace.
- Advantages:
  1. Related APIs are grouped together.
  2. Possible to make linear algrebra APIs an "extension" (i.e., optional).
- Disadvantages:
  1. Requires that a specification-conforming array library implement all linear algebra APIs in the sub-namespace.
  2. Slightly less convenient, as need to import sub-package.
Split linear algebra across the top-level and a sub-namespace.
- Advantages:
  1. Allows delineating "universal" linear algebra APIs which every specification-conforming array library should implement from those which require factorizations and may be more difficult to implement.
- Disadvantages:
  1. Imposes a more complicated mental model. From a user's POV, not clear why one linear algebra API exists in the top-level namespace, while another lives in a sub-namespace. A division based on factorization (BLAS vs LAPACK) leaks implementation details of which a user is unlikely to be aware.
Split across top-level and sub-namespace and mirror top-level APIs in the sub-namespace. (e.g., matul and linalg.matmul)
- Advantages:
  1. Allows for certain basic linear algebra APIs to be mandatory, while others could be optional (i.e., implemented as part of an extension).
  2. Allows users wanting all linear algebra APIs to just import the sub-package and not have to remember which APIs are implemented in the top-level vs sub-namespace.
- Disadvantages:
  1. Duplicated APIs.

The text was updated successfully, but these errors were encountered:

kgryte · 2021-04-26T09:46:51Z

Based on offline discussions and the meeting discussion on (03/06/2021), option (4) seems most amenable and, while not a perfect solution, assuages competing concerns (mandatory vs optional) at the expense of a minimal number of duplicated APIs.

rgommers · 2021-06-30T18:49:14Z

This was implemented in gh-182, closing.

rgommers mentioned this issue Apr 21, 2021

torch.cross is divergent from np.cross pytorch/pytorch#50273

Open

kgryte mentioned this issue May 16, 2021

Move linear algebra APIs to an extension #182

Merged

rgommers added the topic: Linear Algebra Linear algebra. label Jun 30, 2021

rgommers closed this as completed Jun 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linear algebra namespace overview #165

Linear algebra namespace overview #165

kgryte commented Apr 15, 2021 •

edited

Loading

kgryte commented Apr 26, 2021 •

edited

Loading

rgommers commented Jun 30, 2021

Linear algebra namespace overview #165

Linear algebra namespace overview #165

Comments

kgryte commented Apr 15, 2021 • edited Loading

NumPy

Top-Level Namespace

Linalg Namespace

TensorFlow

Top-Level

Linalg Namespace

Paths Forward

kgryte commented Apr 26, 2021 • edited Loading

rgommers commented Jun 30, 2021

kgryte commented Apr 15, 2021 •

edited

Loading

kgryte commented Apr 26, 2021 •

edited

Loading