Haocong WANG
020148d0f7
[BlockScale GEMM] FP8 Blockscale GEMM optimization and ckProfiler ( #1913 )
...
* Added two kernel for M=32 problem
* Comment the first one
* Enable multiply_multiply for Scale_Block_M = 1 for deepseek
* Modify the a_thread offset since the A data load is different from B.
* edit fp8 ab scale for Scale_Block_M=1
* edit GemmSpec to MNKPadding
* enable blockwise pipelie v1 and v2. v1 is work for small K.
* add instance for gemm_ab_scale
* fix cmakelist of ckProfiler
* optimize blockscale gemm. todo: reduce vgpr usage
* fix a correctness bug
* sanity checked
* revert ckprofiler cmake changes
* clang format
* revert unnecessary changes.
* remove commented codes.
---------
Co-authored-by: mtgu0705 <mtgu@amd.com >
Co-authored-by: chenjun <junchen2@amd.com >
2025-02-25 15:42:20 +08:00
..
2025-02-07 15:05:05 -07:00
2024-09-12 11:47:52 +02:00
2023-08-10 12:04:35 +08:00
2024-06-27 00:33:34 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2024-06-27 00:33:34 -07:00
2025-02-07 15:05:05 -07:00
2024-12-13 21:08:35 +01:00
2025-02-07 15:05:05 -07:00
2024-06-27 00:33:34 -07:00
2025-02-07 15:05:05 -07:00
2025-02-20 18:58:14 -08:00
2025-02-10 11:17:02 +08:00
2025-02-07 15:05:05 -07:00
2023-11-30 15:09:27 -06:00
2023-11-30 15:09:27 -06:00
2023-11-30 15:09:27 -06:00
2024-10-12 14:05:11 +08:00
2025-01-31 09:48:39 -08:00
2025-02-07 15:05:05 -07:00
2024-06-27 00:33:34 -07:00
2024-06-18 10:26:49 +02:00
2023-08-18 11:14:59 +08:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2023-05-31 18:46:57 -05:00
2025-02-07 15:05:05 -07:00
2024-06-27 00:33:34 -07:00
2024-05-17 10:42:51 -07:00
2024-04-19 13:31:17 +02:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2024-06-27 00:33:34 -07:00
2023-07-26 07:19:55 -07:00
2024-06-27 00:33:34 -07:00
2024-05-10 09:41:39 -07:00
2024-04-26 07:26:30 -05:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2024-06-27 00:33:34 -07:00
2023-11-25 13:35:22 +01:00
2025-02-25 15:42:20 +08:00
2025-02-20 14:00:27 -08:00
2025-02-20 14:00:27 -08:00
2025-02-20 18:58:14 -08:00
2024-05-17 10:42:51 -07:00
2024-06-27 00:33:34 -07:00
2023-12-03 23:08:47 +01:00
2025-01-02 10:30:04 -08:00
2024-01-19 07:02:22 -06:00
2025-01-03 18:35:21 +08:00
2025-02-20 14:00:27 -08:00
2024-07-19 22:01:22 +08:00
2023-11-07 09:09:58 -06:00
2024-05-17 10:42:51 -07:00
2024-05-17 10:42:51 -07:00
2025-01-31 09:48:39 -08:00
2025-01-31 09:48:39 -08:00
2024-02-02 11:35:26 -08:00
2025-02-07 15:05:05 -07:00
2024-02-02 11:35:26 -08:00
2025-02-07 15:05:05 -07:00
2024-12-06 10:55:23 +01:00
2025-02-07 15:05:05 -07:00
2024-09-03 10:52:03 +02:00
2025-02-07 15:05:05 -07:00
2025-02-11 17:25:00 -07:00
2024-09-03 10:52:03 +02:00
2025-02-19 13:47:39 -08:00
2025-02-20 10:02:08 +01:00
2024-11-05 09:59:08 -08:00
2024-08-06 10:06:10 +02:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2023-05-31 18:46:57 -05:00
2024-08-06 10:06:10 +02:00
2024-04-03 09:08:08 -05:00
2025-02-07 15:05:05 -07:00
2024-09-20 10:45:46 +02:00
2024-10-04 17:32:43 +02:00
2024-12-02 09:13:56 +01:00
2024-12-02 09:13:56 +01:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2025-02-07 15:05:05 -07:00
2024-06-27 00:33:34 -07:00
2025-01-31 09:48:39 -08:00
2024-04-19 13:31:17 +02:00
2024-06-27 00:33:34 -07:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2023-05-31 18:46:57 -05:00
2024-09-11 15:21:00 +02:00
2023-08-15 02:25:28 +08:00
2023-06-19 09:44:22 -05:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2023-10-11 14:27:29 -05:00
2023-05-31 18:46:57 -05:00
2025-02-07 15:05:05 -07:00