Anthony Chang
7db48f9008
Tune & add conflict-free LDS gemm kernels (#159)
* retune & add conflict-free bf16/fp16 c-shuffle gemm instances
amend wrong K1 value in some fp16/bf16 kernel instances
* make gemm cshuffle's timing behavior consistent with all other functions
* clang-format
* retune & add conflict-free fp32 c-shuffle gemm instances
* retune & add conflict-free int8 c-shuffle gemm instances
* update the underlying gridwise gemm of all c-shuffle gemm kernels
* typo
2022-03-31 12:58:41 -05:00
..
2022-03-29 10:52:25 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-08 21:46:36 -06:00
2022-03-31 12:33:34 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-08 21:46:36 -06:00
2022-03-08 21:46:36 -06:00
2022-03-23 22:18:42 -05:00
2022-03-31 12:33:34 -05:00
2022-03-30 11:21:18 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:58:41 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-30 21:32:49 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-22 14:35:14 -05:00
2022-03-31 12:33:34 -05:00
2022-03-31 12:33:34 -05:00
2022-03-30 21:32:49 -05:00