Haocong WANG
be31f1ddf3
[GEMM] F8 GEMM, performance optimized. (#1384)
* add ab_scale init support
* enabled interwave
* add scale type; update isSupport
* adjust example
* clean
* enable f8 pure gemm rcr ckprofiler
* Add gemm_multiply_multiply instances
* clang format
* Optimize for ScaleBlockMNK=128
* enable abscale f8 gemm ck profiler
* Add pure f8 gemm test suite
* Reverting to the state of project at f60fd77
* update copyright
* clang format
* update copyright
---------
Co-authored-by: root <jizhan@amd.com>
[ROCm/composable_kernel commit: 8c90f25be3]
2024-07-19 22:06:52 +08:00
..
2023-08-14 15:46:27 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-06-21 09:47:58 +02:00
2024-07-19 22:06:52 +08:00
2024-04-13 21:03:18 -05:00
2024-04-13 21:03:18 -05:00
2024-07-19 22:06:52 +08:00
2024-06-21 09:47:58 +02:00
2024-07-19 22:06:52 +08:00
2024-06-21 09:47:58 +02:00
2024-07-19 22:06:52 +08:00
2024-06-21 09:47:58 +02:00
2024-06-21 09:47:58 +02:00
2024-06-21 09:47:58 +02:00
2024-06-21 09:47:58 +02:00
2024-06-27 00:33:34 -07:00
2024-06-21 09:47:58 +02:00
2024-06-27 00:33:34 -07:00
2023-08-23 11:36:17 -07:00
2023-05-31 18:46:57 -05:00
2023-07-06 10:58:55 -05:00
2023-07-06 10:58:55 -05:00
2023-11-25 13:35:22 +01:00
2024-03-08 17:11:51 -08:00
2023-07-26 14:18:15 -05:00
2024-03-22 10:40:43 +01:00
2023-05-31 18:46:57 -05:00
2023-07-26 14:18:15 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-04-26 07:26:30 -05:00
2024-05-28 12:04:22 -05:00