Po Yen Chen
2920604786
[CK_TILE] Add logits soft-capping & customization support to the FMHA forward kernel/pipelines (#2163)
* hack for cap logits
* fix bug
* Re-format files
* Allow specifying logits_soft_cap through APIs
* Support turn on/off logits_soft_cap in async pipeline
* Do not generate non-verified kernels
* Align receipt used in Aiter
* Sync logits soft-capping across pipelines
* Re-enable some hdim pipelines
* fix perf
* Add attention variant for logits_soft_cap
* Add newline at end-of-file
* Fix performance
* Add comment to explain logits_soft_cap pre-processing
* Unify code
* Unify floating-point literal style
* Use class data member to slience the compilation error
* [CK_TILE] Update attention customizaton interface: add LogitsMask() (#2133)
* Send 'mask' along with variant params to the LogitsMask()
* Send block indices to the variant
* Add indices parameters in variant interface
* Fix fmha bwd codegen error
* Allow switch logits_soft_cap impl
* Eliminate register spills
* Fix compilation errors
* Fix wrong LSE
* Fix LSE for splitkv kernel
* Sync splitkv pipeline changes
* Add batch_prefill kernel/pipeline
* Fix codegen error
* Undo changes in CMakeLists.txt
* Merge pipeline filtering check
* Use different code path if kHasLogitsSoftCap=false
* Remove [[maybe_unused]] attribute
* Use pre-existing compile-time flag to instantiate templates
* Sync pipeline changes
* Update CHANGELOG.md
---------
Co-authored-by: Bernard <bernaliu@amd.com>
Co-authored-by: coderfeli <coderfeli@163.com>
2025-05-13 12:19:25 +08:00
..
2025-04-14 16:58:57 +08:00
2024-07-03 23:34:38 -07:00
2024-04-02 09:42:17 -07:00
2025-03-05 11:07:33 -08:00
2025-03-17 18:08:53 -07:00
2024-11-14 08:40:50 -08:00
2024-08-13 16:15:47 +02:00
2023-09-20 22:15:56 -07:00
2024-04-02 09:42:17 -07:00
2025-04-03 13:35:43 +02:00
2024-11-14 08:40:50 -08:00
2024-08-06 10:06:10 +02:00
2025-03-05 11:07:33 -08:00
2024-04-19 13:31:17 +02:00
2025-02-18 10:10:22 +01:00
2024-12-03 08:42:55 -08:00
2023-10-18 11:14:14 -05:00
2023-05-31 18:46:57 -05:00
2025-04-14 16:58:57 +08:00
2023-09-20 22:15:56 -07:00
2024-05-10 09:41:39 -07:00
2023-11-28 11:17:37 -08:00
2023-09-20 22:15:56 -07:00
2024-06-27 00:33:34 -07:00
2025-03-04 10:32:25 -08:00
2025-02-07 15:05:05 -07:00
2024-12-03 08:42:55 -08:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2024-12-03 08:42:55 -08:00
2023-05-31 18:46:57 -05:00
2024-12-03 08:42:55 -08:00
2024-12-03 08:42:55 -08:00
2023-10-18 11:14:14 -05:00
2024-04-02 09:42:17 -07:00
2025-02-07 15:05:05 -07:00
2023-11-28 11:17:37 -08:00
2023-09-20 22:15:56 -07:00
2024-11-05 13:58:29 -08:00
2023-10-19 07:36:29 +08:00
2024-05-10 09:41:39 -07:00
2024-12-03 08:42:55 -08:00
2024-01-24 13:47:48 -08:00
2023-09-20 22:15:56 -07:00
2023-09-20 22:15:56 -07:00
2024-01-24 13:47:48 -08:00
2024-04-02 09:42:17 -07:00
2023-12-19 04:23:11 +08:00
2023-12-19 04:23:11 +08:00
2024-12-03 08:42:55 -08:00
2024-04-26 07:26:30 -05:00
2024-07-11 18:08:07 -07:00
2025-03-05 11:07:33 -08:00
2023-11-28 11:17:37 -08:00
2024-04-02 09:42:17 -07:00
2025-05-12 09:52:58 -07:00
2024-10-22 09:26:18 +08:00
2025-05-01 11:55:48 -06:00
2025-05-13 12:19:25 +08:00
2025-04-30 10:20:16 -07:00
2024-12-04 00:46:47 +01:00