Bartłomiej Kocot
cbc8335964
Improve XDL to WMMA porting for grouped conv fwd ( #3456 )
...
Refactors the way the number of XDL (matrix multiply-accumulate) instructions per wave is calculated and used in the grouped convolution forward implementations, especially to better support WMMA (Wave Matrix Multiply-Accumulate) instructions and 16x16 tiles.
The changes use MXdlPerWave instead of NXdlPerWave to increase number of waves per M dim.
2025-12-19 15:58:51 -07:00
..
2025-12-19 15:58:51 -07:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-12-19 15:58:51 -07:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-12-11 09:06:20 +01:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00
2025-11-26 11:00:05 -07:00