Jianfeng Yan
cb97ce68d8
Batched gemm and reduction ( #156 )
...
* adding batched_gemm_and_reduction
* batched_gemm_reduce works with bactch_count=1
* fix a bug in grid_size; batched_gemm_reduce works for batch_count > 1
* adding profiler for batched_gemm_fp16
* fixed a bug in declaration of d1 and d0; both example and profiler work
* clang-format
* cleanup
* batched_gemm_reduce: add test
* minor change
* fixed some typo in function names
[ROCm/composable_kernel commit: 34c661e71c ]
2022-03-30 11:21:18 -05:00
..
2022-03-29 10:52:25 -05:00
2022-03-23 22:18:42 -05:00
2022-03-23 22:18:42 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-30 11:21:18 -05:00
2022-03-30 11:21:18 -05:00
2022-03-08 21:46:36 -06:00
2022-03-29 10:52:25 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-23 10:23:13 -05:00
2022-03-23 10:23:13 -05:00
2022-03-08 21:46:36 -06:00
2022-03-23 22:18:42 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-29 10:52:25 -05:00
2022-03-23 10:23:13 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-23 22:18:42 -05:00
2022-03-30 11:21:18 -05:00
2022-03-30 11:21:18 -05:00
2022-03-23 22:18:42 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-23 22:18:42 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-23 22:18:42 -05:00
2022-03-22 18:18:18 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-23 22:18:42 -05:00
2022-03-08 21:46:36 -06:00
2022-03-29 10:52:25 -05:00