Enrico Degregori
9575bcd099
Fix splitk preshuffle (#3137)
* Fix splitK multiply_multiply_wp
* Add tests for gemm_multiply_multiply_wp
* Add tests for gemm_universal_preshuffle (KBatch = 1)
* Add tests gemm_blockscale_wp
* Fix splitk gemm universal preshuffle
* Run new tests on arch supporting fp8
* Restore example
* Fix strides profiler
* Fix tests
* Fix clang format
* Finalize profiler preshuffle with tolerances
* Minor improvements to splitk related changes
* Address review comments: clang format and ckProfiler typo
* Remove b_k_split_offset from SplitKBatchOffset struct
[ROCm/composable_kernel commit: 507d81c3af]
2025-11-03 11:59:01 -08:00
..
2025-09-30 08:24:40 -07:00
2025-10-16 11:00:42 -07:00
2025-09-12 08:17:07 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-07-22 10:52:10 -07:00
2025-11-03 09:35:05 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-18 22:51:01 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-05 16:31:08 +02:00
2025-07-24 18:49:58 -07:00
2025-11-03 11:59:01 -08:00
2025-10-31 11:19:26 -07:00
2025-09-26 22:55:18 -04:00
2025-11-03 11:59:01 -08:00
2025-07-11 15:32:12 -06:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-11-03 11:59:01 -08:00
2025-09-12 21:36:43 +02:00
2025-09-30 08:24:40 -07:00
2025-10-17 15:36:39 +03:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-10-23 10:54:13 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-30 08:24:40 -07:00
2025-09-16 16:23:29 -07:00
2025-09-30 08:24:40 -07:00
2025-07-28 11:34:07 -07:00
2025-07-16 07:58:23 -07:00
2025-02-07 15:05:05 -07:00
2025-09-30 08:24:40 -07:00
2025-09-12 08:17:07 -07:00
2025-11-03 11:59:01 -08:00