Haocong WANG
|
ec7e5b1331
|
[GEMM] Optimization for MI200/300. (#1135)
* Optimize GEMM on MI200/300:
1. Add new blockwise gemm pipeline
2. Add irregular splitk intances
* clang format + typo fix
* Fix a bug
[ROCm/composable_kernel commit: bb63b9732c]
|
2024-01-19 07:02:22 -06:00 |
|