Haocong WANG
3049b5467c
[GEMM] gemm_universal related optimization (#1453)
* replace buffer_atomic with global_atomic
* fixed global_atomic_add
* added bf16 atomic_add
* format
* clang-format-12
* clean
* clean
* add guards
* Update gtest.cmake
* enabled splitk_gemm_multi_d
* format
* add ckProfiler
* format
* fixed naming
* format
* clean
* clean
* add guards
* fix clang format
* format
* add kbatch printout
* clean
* Add rocm6.2 related gemm optimization
* Limit bf16 atomic usage
* remove redundant RCR gemm_universal instance
* Add RRR fp8 gemm universal instance
* Bug fix
* Add GPU_TARGET guard to FP8/BF8 target
* bug fix
* update cmake
* remove all fp8/bf8 example if arch not support
* Enable fp8 RRR support in ckProfiler
* limit greedy-reverse flag to gemm_universal in ckProfiler
---------
Co-authored-by: Jing Zhang <jizhan@fb.com>
Co-authored-by: Jing Zhang <jizhan@meta.com>
Co-authored-by: zjing14 <zhangjing14@gmail.com>
Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
Co-authored-by: illsilin <Illia.Silin@amd.com>
2024-08-14 10:42:30 +08:00
..
2024-08-06 10:06:10 +02:00
2023-08-10 12:04:35 +08:00
2024-06-27 00:33:34 -07:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2024-06-27 00:33:34 -07:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-06-27 00:33:34 -07:00
2024-05-17 10:42:51 -07:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2023-11-30 15:09:27 -06:00
2023-11-30 15:09:27 -06:00
2023-11-30 15:09:27 -06:00
2024-04-19 13:31:17 +02:00
2024-08-06 10:06:10 +02:00
2024-06-18 10:26:49 +02:00
2024-06-27 00:33:34 -07:00
2024-06-18 10:26:49 +02:00
2023-08-18 11:14:59 +08:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2023-05-31 18:46:57 -05:00
2024-05-17 10:42:51 -07:00
2024-06-27 00:33:34 -07:00
2024-05-17 10:42:51 -07:00
2024-04-19 13:31:17 +02:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2024-06-27 00:33:34 -07:00
2023-07-26 07:19:55 -07:00
2024-06-27 00:33:34 -07:00
2024-05-10 09:41:39 -07:00
2024-04-26 07:26:30 -05:00
2024-06-27 00:33:34 -07:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2024-06-27 00:33:34 -07:00
2023-11-25 13:35:22 +01:00
2024-07-19 22:06:52 +08:00
2024-08-14 10:42:30 +08:00
2024-03-05 17:08:43 -08:00
2024-05-17 10:42:51 -07:00
2024-06-27 00:33:34 -07:00
2023-12-03 23:08:47 +01:00
2024-07-05 21:40:30 -07:00
2024-01-19 07:02:22 -06:00
2024-08-14 10:42:30 +08:00
2024-07-19 22:01:22 +08:00
2023-11-07 09:09:58 -06:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-02-07 01:08:34 +01:00
2024-02-12 09:45:42 -08:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2024-02-02 11:35:26 -08:00
2024-06-27 00:33:34 -07:00
2024-06-18 15:05:30 -05:00
2024-06-27 00:33:34 -07:00
2024-06-18 15:05:30 -05:00
2024-07-12 11:08:42 -07:00
2024-06-27 00:33:34 -07:00
2024-06-14 09:53:03 -05:00
2024-08-06 10:06:10 +02:00
2024-08-06 10:06:10 +02:00
2024-08-06 10:06:10 +02:00
2024-08-10 13:12:05 +02:00
2024-08-06 10:06:10 +02:00
2023-05-31 18:46:57 -05:00
2024-08-06 10:06:10 +02:00
2024-04-03 09:08:08 -05:00
2024-08-06 10:06:10 +02:00
2024-07-12 11:08:42 -07:00
2024-05-20 08:34:45 -07:00
2024-06-27 00:33:34 -07:00
2024-05-17 10:42:51 -07:00
2024-06-18 09:01:49 -05:00
2024-02-02 11:35:26 -08:00
2024-04-02 11:02:52 -05:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-06-27 00:33:34 -07:00
2024-08-06 10:06:10 +02:00
2024-04-19 13:31:17 +02:00
2024-06-27 00:33:34 -07:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2023-05-31 18:46:57 -05:00
2023-08-15 02:25:28 +08:00
2023-08-15 02:25:28 +08:00
2023-06-19 09:44:22 -05:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2023-10-11 14:27:29 -05:00
2023-05-31 18:46:57 -05:00
2024-02-02 11:35:26 -08:00