Chao Liu
22d438ae9e
Add gridwise GEMM pipeline (#89)
* clean up
* add mutilple thread scratch to ThreadwiseTensorSliceTransfer_v3r1
* add 2 stage prefetch
* add more sanity check into transform_tensor_descriptor
* tweak
* enabling 2 stage prefetch to exsiting gridwise gemm; tweak
* enabling 2 stage prefetch to exsiting gridwise gemm
* move gridwise gemm pipeline in class; clean up
* add some irregular tile size
* update CalculateHasMainK0BlockLoop for multi-stage-prefetch
* refactor gridwise gemm pipeline class
2022-02-23 17:23:49 -06:00
..
2022-02-23 10:44:20 -06:00
2021-12-26 07:43:42 -07:00
2022-02-22 22:45:28 -06:00
2021-12-26 07:43:42 -07:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-23 10:44:20 -06:00
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2022-02-23 10:44:20 -06:00
2022-02-06 22:32:47 -06:00
2022-02-06 22:32:47 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-23 17:23:49 -06:00
2022-02-11 15:49:06 -06:00
2022-02-23 17:23:49 -06:00
2022-02-11 00:48:41 -06:00
2021-12-26 07:43:42 -07:00
2022-02-11 15:49:06 -06:00
2022-02-23 10:44:20 -06:00