Files
composable_kernel/example
Ding, Yi 4195052efa Time launcher construction and prepare_workspace in benchmark output
Wrap fmha_bwd_launcher constructor with std::chrono and prepare_workspace
with ck_tile::gpu_timer; append "init:Xms, prws:Yms" to the benchmark
header line. Also reorder launcher construction to occur after device
buffer allocation so its timing is isolated.
2026-04-22 02:07:55 -05:00
..
2026-01-14 07:31:45 -08:00