Po Yen Chen
7fbc9d6c97
[CK_TILE] FMHA FAv3 scheduling fine-tuning for performance (#2833)
* Re-mapping thread block indices for causal=True kernels
* Use more intuitive remap_opt value
* Fallback to origin remapping if seqlen_q >= 64K
* Use GenericAttentionMask to reduce mask computation
* Avoid unnecessary boundary check for IsMasking=false case
* Fix wrong kernel entry specifier
* Add s_nop to prevent delay wave0-3
* Refine scheduling
* Remove unnecessary sched_group_barrier()
* Move sched_group_barrier() call to scheduler
* Replace inline asm s_setprio with intrinsics
* Rephrase comments
* Expend some o_acc rescaling insts to avoid SIMD idle
* Fix block idx special mapping logic
* Tune block index mapping for causal=False cases
* Tune block index mapping for causal=True cases
* Fix wrong vmcnt()
* Remove parameter name
* Use boolean option for turn on/off causal mask
* Update benchmark_fwd_v3.sh option usages
* Add option if compiler support it
2025-09-16 11:32:38 +08:00
..
2025-05-13 12:19:25 +08:00
2025-07-17 10:09:01 +08:00
2024-08-16 13:40:10 -07:00
2024-08-16 13:40:10 -07:00
2025-08-25 20:55:12 +08:00
2025-08-25 20:55:12 +08:00
2025-08-12 17:02:52 +08:00
2025-09-08 09:21:14 -07:00
2025-09-08 09:21:14 -07:00
2025-08-15 16:22:27 -07:00
2025-08-12 11:11:55 +08:00
2025-09-08 09:21:14 -07:00
2024-08-28 20:50:43 +08:00
2024-08-28 20:50:43 +08:00
2025-07-07 16:16:54 +08:00
2025-07-11 18:14:47 +08:00
2024-12-20 14:41:01 +08:00
2025-01-08 10:09:54 -08:00
2025-03-07 14:19:51 +08:00
2025-05-13 12:19:25 +08:00
2025-03-07 14:19:51 +08:00
2025-07-07 16:16:54 +08:00
2025-09-01 09:16:45 +08:00
2025-09-16 11:32:38 +08:00
2025-08-13 14:57:43 +08:00
2025-09-01 09:16:45 +08:00
2025-03-07 14:19:51 +08:00
2025-08-15 16:22:27 -07:00
2025-08-25 20:55:12 +08:00
2025-06-10 15:03:23 +08:00
2025-03-07 14:19:51 +08:00
2024-10-30 14:03:16 +08:00
2025-03-07 14:19:51 +08:00
2025-06-22 00:28:30 -07:00
2025-08-28 11:39:39 +08:00
2025-03-07 14:19:51 +08:00
2025-05-13 12:19:25 +08:00
2025-08-22 10:13:47 +08:00
2025-08-12 17:02:52 +08:00
2025-09-01 09:16:45 +08:00