zjing14
71758e1559
Grouped GEMM for fp16 (#126)
* init of grouped_gemm
* 2 gemm test
* perf test
* clean
* wrap desc into a struct
* test cast static_arr to pointer
* add ptr to GemmDesc
* add grouped gemm profiler
* fixed mem issue with unique_ptr
* clean
* clean
* finished ckprofiler
* Update README.md
* readme
* fixed readme
* add example
* improve code
* fixed comments: reserve, seperate ptr and gemm_shapes
* merge group and non-group
* fixed comments: replace push_back with emplace_back to avoid copy constructor
* fixed comments: unified blk2ctile; add test
* ci fix
* fixed ci
* fixed ci
* fixed ci
[ROCm/composable_kernel commit: 716f1c7fb1]
2022-03-22 18:18:18 -05:00
..
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-21 16:45:14 -05:00
2022-03-08 21:46:36 -06:00
2022-03-21 10:53:23 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-22 18:18:18 -05:00
2022-03-22 18:18:18 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00