Po Yen Chen
|
7fcd094b8c
|
[CK_TILE] Add FAv3 fwd pipeline (#2731)
* Add FAv3 fwd pipeline
* Unpack v_pk_mul to hide v_mov
* Avoid compiler moving l compute across phase
* Sync sched_group_barrier() setting for masking cases
[ROCm/composable_kernel commit: d876e87fe4]
|
2025-09-01 09:16:45 +08:00 |
|