linqunAMD
1e9b1826b5
[CK_TILE][REGRESSION] Correct blockSize in Generic2dBlockShape (c254f… ( #2837 )
...
* [CK_TILE][REGRESSION] Correct blockSize in Generic2dBlockShape (5b17f135b7 )
WarpPerBlock_M * WarpPerBlock_N are not equal with ThreadPerBlock_M * ThreadPerBlock_N /warpSize. we should calculate BlockSize from WarpPerBlock_M * WarpPerBlock_N
To compatible with wave32, function GetBlockSize is added to calculate correct size in host side.
* fix blocksize for all kernel related with generic2dblockshap
* remove constexpr for blocks
[ROCm/composable_kernel commit: b7a806f244 ]
2025-09-16 08:47:55 -07:00
..
2025-06-24 07:28:13 -07:00
2025-09-12 08:17:07 -07:00
2023-09-20 22:15:56 -07:00
2024-04-02 09:42:17 -07:00
2025-09-12 08:17:07 -07:00
2025-09-12 08:17:07 -07:00
2023-09-26 08:39:11 -07:00
2025-07-22 10:52:10 -07:00
2023-05-31 18:46:57 -05:00
2025-09-16 08:47:55 -07:00
2025-09-12 08:17:07 -07:00
2024-01-24 13:47:48 -08:00
2024-08-06 10:06:10 +02:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2025-07-28 11:34:07 -07:00
2023-09-20 22:15:56 -07:00
2023-09-20 22:15:56 -07:00
2025-09-05 16:31:08 +02:00
2025-07-24 18:49:58 -07:00
2025-09-12 08:17:07 -07:00
2025-07-11 15:32:12 -06:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2025-06-17 14:29:45 -07:00
2025-09-12 21:36:43 +02:00
2025-06-17 14:29:45 -07:00
2025-09-15 10:59:25 -07:00
2025-09-15 10:59:25 -07:00
2025-09-15 10:59:25 -07:00
2025-09-15 10:59:25 -07:00
2025-09-12 08:17:07 -07:00
2023-05-31 18:46:57 -05:00
2025-08-04 11:43:47 -07:00
2024-04-25 15:12:53 -05:00
2024-04-25 15:12:53 -05:00
2023-12-19 04:23:11 +08:00
2024-04-02 09:42:17 -07:00
2025-07-28 11:34:07 -07:00
2024-05-28 11:13:21 +08:00
2024-08-12 16:28:10 +02:00
2025-07-28 11:34:07 -07:00
2025-07-16 07:58:23 -07:00
2025-02-07 15:05:05 -07:00
2023-08-23 11:36:17 -07:00
2023-05-31 18:46:57 -05:00
2024-04-02 09:42:17 -07:00
2024-07-03 23:34:38 -07:00
2025-09-12 08:17:07 -07:00
2025-09-15 10:59:25 -07:00