jian.wu
2bbff45dcb
[CK_TILE][FMHA][Feature] Add support for large hdim
...
* root cause: fhma_bwd not support if hdim > 256 due to the use of LDS goes beyond the hardware limitations.
* solution: 1. split dqdkdv kernel into 2 kernels.
* 1) QGrad
* 2) KGrad & VGrad
* 2. reuse LDS memory.
* 1). K and K^T use same LDS memory in dq kernel
* 2). OGrad and OGrad^T use same LDS memory in dq kernel
* 3. to avoid or reduce the number of VGPR spills, the calculation order has been readjusted, and prefetch has been disabled.
2025-08-15 16:23:32 +08:00
..
2025-07-28 11:34:07 -07:00
2024-07-03 23:34:38 -07:00
2024-04-02 09:42:17 -07:00
2025-06-05 13:54:15 -06:00
2025-03-17 18:08:53 -07:00
2024-11-14 08:40:50 -08:00
2025-07-28 11:34:07 -07:00
2023-09-20 22:15:56 -07:00
2024-04-02 09:42:17 -07:00
2025-05-19 14:25:50 -07:00
2025-07-28 11:34:07 -07:00
2024-08-06 10:06:10 +02:00
2025-07-28 11:34:07 -07:00
2024-04-19 13:31:17 +02:00
2025-02-18 10:10:22 +01:00
2024-12-03 08:42:55 -08:00
2023-10-18 11:14:14 -05:00
2023-05-31 18:46:57 -05:00
2025-06-05 13:54:15 -06:00
2023-09-20 22:15:56 -07:00
2024-05-10 09:41:39 -07:00
2025-07-28 11:34:07 -07:00
2023-09-20 22:15:56 -07:00
2024-06-27 00:33:34 -07:00
2025-03-04 10:32:25 -08:00
2025-02-07 15:05:05 -07:00
2025-07-28 11:34:07 -07:00
2023-05-31 18:46:57 -05:00
2025-07-16 07:58:23 -07:00
2025-06-05 13:54:15 -06:00
2025-07-28 11:34:07 -07:00
2024-12-03 08:42:55 -08:00
2025-07-17 08:19:57 +08:00
2025-07-28 11:34:07 -07:00
2024-04-02 09:42:17 -07:00
2025-02-07 15:05:05 -07:00
2025-07-28 11:34:07 -07:00
2023-09-20 22:15:56 -07:00
2025-07-28 11:34:07 -07:00
2023-10-19 07:36:29 +08:00
2024-05-10 09:41:39 -07:00
2024-12-03 08:42:55 -08:00
2024-01-24 13:47:48 -08:00
2023-09-20 22:15:56 -07:00
2023-09-20 22:15:56 -07:00
2024-01-24 13:47:48 -08:00
2024-04-02 09:42:17 -07:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2024-12-03 08:42:55 -08:00
2024-04-26 07:26:30 -05:00
2025-07-28 11:34:07 -07:00
2025-07-28 11:34:07 -07:00
2025-07-28 11:34:07 -07:00
2024-04-02 09:42:17 -07:00
2025-08-06 00:34:39 -07:00
2025-06-10 09:13:59 -07:00
2025-08-06 00:34:39 -07:00
2025-08-15 16:23:32 +08:00
2025-07-24 17:38:14 +08:00
2024-12-04 00:46:47 +01:00