mirror of
https://github.com/NVIDIA/cutlass.git
synced 2026-04-19 22:38:56 +00:00
Fix SFB Layout scale granularity representation (#2924)
This commit is contained in:
@@ -19,7 +19,7 @@ and SFB. This leads to a GEMM $D = \alpha \text{SFA} * A \text{ SFB} * B + \beta
|
||||
|
||||
These can be represented in CuTe as:
|
||||
- *SFA Layout*: $((\text{scale granularity M}, M / \text{scale granularity M}), (\text{scale granularity K}, K / \text{scale granularity K})) : ((0, int), (0, int))$
|
||||
- *SFB Layout*: $((\text{scale granularity N}, M / \text{scale granularity M}), (\text{scale granularity K}, K / \text{scale granularity K})) : ((0, int), (0, int))$
|
||||
- *SFB Layout*: $((\text{scale granularity N}, N / \text{scale granularity N}), (\text{scale granularity K}, K / \text{scale granularity K})) : ((0, int), (0, int))$
|
||||
|
||||
The 0 element stride ensures the same group of coordinates to map to the same element in the scale factors.
|
||||
|
||||
|
||||
Reference in New Issue
Block a user