Haocong WANG
be31f1ddf3
[GEMM] F8 GEMM, performance optimized. (#1384)
* add ab_scale init support
* enabled interwave
* add scale type; update isSupport
* adjust example
* clean
* enable f8 pure gemm rcr ckprofiler
* Add gemm_multiply_multiply instances
* clang format
* Optimize for ScaleBlockMNK=128
* enable abscale f8 gemm ck profiler
* Add pure f8 gemm test suite
* Reverting to the state of project at f60fd77
* update copyright
* clang format
* update copyright
---------
Co-authored-by: root <jizhan@amd.com>
[ROCm/composable_kernel commit: 8c90f25be3]
2024-07-19 22:06:52 +08:00
..
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2023-09-20 22:15:56 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2023-09-26 08:39:11 -07:00
2023-07-26 14:18:15 -05:00
2023-05-31 18:46:57 -05:00
2024-06-18 10:26:49 +02:00
2024-01-24 13:47:48 -08:00
2023-05-31 18:46:57 -05:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2023-10-21 22:19:43 +02:00
2023-09-20 22:15:56 -07:00
2023-09-20 22:15:56 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-07-19 22:06:52 +08:00
2024-07-03 23:34:38 -07:00
2024-07-03 23:34:38 -07:00
2024-07-12 11:08:42 -07:00
2024-05-15 10:03:39 +02:00
2023-05-31 18:46:57 -05:00
2024-04-25 15:12:53 -05:00
2024-04-25 15:12:53 -05:00
2023-12-19 04:23:11 +08:00
2024-04-02 09:42:17 -07:00
2023-08-31 21:01:50 +08:00
2024-05-28 11:13:21 +08:00
2023-09-20 22:15:56 -07:00
2023-05-31 18:46:57 -05:00
2024-06-27 11:30:32 +02:00
2023-08-23 11:36:17 -07:00
2023-05-31 18:46:57 -05:00
2024-04-02 09:42:17 -07:00
2024-07-03 23:34:38 -07:00
2024-04-02 09:42:17 -07:00
2024-07-08 21:21:16 -07:00