Add grouped conv bwd wei merged grouped instance for larger filter (#1984)

* Add grouped conv bwd wei merged grouped instance for larger filter

* Update readme
This commit is contained in:
Bartłomiej Kocot
2025-03-18 16:16:24 +01:00
committed by GitHub
parent 1342ecf7fb
commit fdaff5603e
2 changed files with 5 additions and 3 deletions

View File

@@ -36,10 +36,10 @@ Table of supported cases by instance factory with XDL instruction:
| |NHWGC/GKYXC/NHWGK|NGCHW/GKYXC/NGKHW|GNHWC/GKYXC/GNHWK|
|-------|---|---|---|
|bf16|2D, 3D|✗|✗|
|bf16|2D, 3D|2D, 3D|✗|
|bf16(fp32 for weight)|2D, 3D|✗|1D, 2D, 3D|
|fp16 |2D, 3D|✗|1D, 2D, 3D|
|fp32 |2D, 3D|✗|1D, 2D, 3D|
|fp16 |2D, 3D|2D, 3D|1D, 2D, 3D|
|fp32 |2D, 3D|2D, 3D|1D, 2D, 3D|
Table of supported cases by instance factory with WMMA instruction: