Po Yen Chen
b0cfd7f12e
[CK_TILE] Fix incorrect computation of group mode PagedAttention (#1688)
* Allow getting batch size from splitkv tile partitioner
* Fix wrong paged-kvcache impl for group mode
* Fix wrong example code for page-kvcache
* Undo changes in fmha_fwd.cpp
* Always use 2D block table
* Add is_gappy kernel argument for paged-kvcache
The is_gappy argument is used for differentiating seqstart_k_ptr usage
in flash-attention & xformers
* Remove out-of-date comments
* Remove no-longer used method
* Fix wrong # page-block calculation
* Fix wrong comment
---------
Co-authored-by: Qianfeng <qianfeng.zhang@amd.com>
[ROCm/composable_kernel commit: cf2d635ea2]
2024-11-26 20:37:54 +08:00
..
2024-11-26 08:45:14 +01:00
2024-07-03 23:34:38 -07:00
2024-04-02 09:42:17 -07:00
2024-06-27 00:33:34 -07:00
2024-05-10 22:48:28 -07:00
2024-11-14 08:40:50 -08:00
2024-08-13 16:15:47 +02:00
2023-09-20 22:15:56 -07:00
2024-04-02 09:42:17 -07:00
2024-06-18 09:01:49 -05:00
2024-11-14 08:40:50 -08:00
2024-08-06 10:06:10 +02:00
2024-01-24 13:47:48 -08:00
2024-04-19 13:31:17 +02:00
2024-08-14 10:42:30 +08:00
2024-04-19 13:31:17 +02:00
2023-10-18 11:14:14 -05:00
2023-05-31 18:46:57 -05:00
2024-11-18 14:03:45 +01:00
2023-09-20 22:15:56 -07:00
2024-05-10 09:41:39 -07:00
2023-11-28 11:17:37 -08:00
2023-09-20 22:15:56 -07:00
2024-06-27 00:33:34 -07:00
2024-07-03 23:34:38 -07:00
2024-04-02 09:42:17 -07:00
2024-07-03 23:34:38 -07:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2024-07-19 22:01:22 +08:00
2023-05-31 18:46:57 -05:00
2024-10-26 15:22:37 +02:00
2024-07-03 23:34:38 -07:00
2023-10-18 11:14:14 -05:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2023-11-28 11:17:37 -08:00
2023-09-20 22:15:56 -07:00
2024-11-05 13:58:29 -08:00
2023-10-19 07:36:29 +08:00
2024-05-10 09:41:39 -07:00
2024-04-02 09:42:17 -07:00
2024-01-24 13:47:48 -08:00
2023-09-20 22:15:56 -07:00
2023-09-20 22:15:56 -07:00
2024-01-24 13:47:48 -08:00
2024-04-02 09:42:17 -07:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2024-04-30 08:28:19 -07:00
2024-04-26 07:26:30 -05:00
2024-07-11 18:08:07 -07:00
2024-10-26 15:22:37 +02:00
2023-11-28 11:17:37 -08:00
2024-04-02 09:42:17 -07:00
2024-10-26 16:39:34 +08:00
2024-10-22 09:26:18 +08:00
2024-11-26 20:37:54 +08:00
2024-11-07 14:14:42 -08:00