Skip to content

Commit d0682f9

Browse files
committed
Used byte addresses in the definitions of tile load/store
1 parent 85bc8ec commit d0682f9

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

src/integrated-matrix.adoc

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1140,7 +1140,7 @@ Loads a 2D matrix tile from memory into the vector register group starting at `v
11401140
Let _linesize_ = λ × LMUL.
11411141
For each element index `i` in the body `[vstart, VL)` where the mask is enabled:
11421142

1143-
VD[i] = M[rs1 + (i / linesize) × LD + (i % linesize)]
1143+
VD[i] = M[rs1 + SEW × ((i / linesize) × LD + (i % linesizea))]
11441144

11451145
This instruction is the correct choice when A is stored in row-major order or when B is
11461146
stored in column-major order: in both cases the memory layout consists of _linesize_-element
@@ -1156,7 +1156,7 @@ Stores the 2D matrix tile held in the vector register group starting at `vs3` to
11561156
Let _linesize_ = λ × LMUL.
11571157
For each element index `i` in the body `[vstart, VL)` where the mask is enabled:
11581158

1159-
M[rs1 + (i / linesize) × LD + (i % linesize)] = VS[i]
1159+
M[rs1 + SEW × ((i / linesize) × LD + (i % linesize))] = VS[i]
11601160

11611161
===== `vmttl.v` — Transposing Tile Load
11621162

@@ -1170,7 +1170,7 @@ This instruction is used when a B tile is stored in row-major order, or when an
11701170
Let _linesize_ = λ × LMUL.
11711171
For each element index `i` in the body `[vstart, VL)` where the mask is enabled:
11721172

1173-
VD[i] = M[rs1 + (i % linesize) × LD + (i / linesize)]
1173+
VD[i] = M[rs1 + SEW × ((i % linesize) × LD + (i / linesize))]
11741174

11751175
===== `vmtts.v` — Transposing Tile Store
11761176

@@ -1183,7 +1183,7 @@ Stores a 2D matrix tile from vector registers to memory, applying the inverse tr
11831183
Let _linesize_ = λ × LMUL.
11841184
For each element index `i` in the body `[vstart, VL)` where the mask is enabled:
11851185

1186-
M[rs1 + (i % linesize) × LD + (i / linesize)] = VS[i]
1186+
M[rs1 + SEW × ((i % linesize) × LD + (i / linesize))] = VS[i]
11871187

11881188
<<<
11891189

0 commit comments

Comments
 (0)