zjing14 8a43beac2e Split k f16 (#97)
* init for splitk f16

* a working prototype

* debug

* perf debug

* update example

* instances for mk kn

* add instances for all layers

* clean

* clean

* add tuning

* format

* add mn_padding into irregular tile

* clean

Co-authored-by: Chao Liu <chao.liu2@amd.com>

[ROCm/composable_kernel commit: e221d11e51]
2022-02-25 01:19:37 -06:00
2022-02-18 21:44:11 -06:00
2022-02-25 01:19:37 -06:00
2022-02-25 01:19:37 -06:00
2022-02-23 17:23:49 -06:00
2022-02-22 22:45:28 -06:00
2022-02-25 01:19:37 -06:00
2022-02-06 22:32:47 -06:00
2022-02-24 20:11:36 -06:00
2018-10-08 22:49:58 -05:00
2021-08-08 17:41:54 +00:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
Description
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
Readme MIT 234 MiB
Languages
C++ 93.1%
Python 4.5%
CMake 1.5%
Shell 0.5%
Pawn 0.2%