Sami Remes
919096fde8
[rocm-libraries] ROCm/rocm-libraries#7935 (commit 5c96097)
...
[CK] Allow skipping split-K C-buffer zero-init in
xdl_cshuffle blockscale GEMM (#7935 )
Add a `skip_zero_init` flag (default false) to the Problem/Argument of
the xdl_cshuffle block-scale GEMM device ops (multiple_d ab_scale and
blockscale b-preshuffle). When the flag is set, the device invoker skips
the internal hipMemsetAsync that zeroes p_c_grid before the KBatch > 1
split-K atomic-accumulation path. The flag is declared on the gridwise
Problem struct (inherited by Argument), so it is visible on both the
rotating-cache (arg_) and the normal (arg) launch paths in each device
op.
Why: callers that already pre-zero the output buffer otherwise pay for a
redundant device-wide memset before split-K atomic accumulation. Gating
the memset behind an opt-in flag lets such callers avoid the duplicate
work. Because the flag defaults to false, every existing call site is
unaffected and the observable behavior is unchanged.
## Motivation
<!-- Explain the purpose of this PR and the goals it aims to achieve.
-->
## Technical Details
<!-- Explain the changes along with any relevant GitHub links. -->
## Test Plan
<!-- Explain any relevant testing done to verify this PR. -->
## Test Result
<!-- Briefly summarize test outcomes. -->
## Submission Checklist
- [ ] Look over the contributing guidelines at
https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests .
Co-authored-by: Cursor <cursoragent@cursor.com >
2026-06-02 13:08:46 +00:00
..
2026-05-15 06:46:51 -07:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-01-26 10:20:30 -08:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-20 10:45:57 -08:00
2026-01-26 10:20:30 -08:00
2026-05-15 06:46:51 -07:00
2026-05-22 18:39:01 +00:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-01-23 12:39:03 -08:00
2026-05-22 18:39:01 +00:00
2026-01-23 12:39:03 -08:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2026-05-15 06:46:51 -07:00
2026-03-31 08:18:11 -07:00
2025-11-18 10:17:18 -08:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2026-05-15 06:46:51 -07:00
2025-11-18 10:17:18 -08:00
2026-05-15 06:46:51 -07:00
2025-11-20 10:45:57 -08:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2026-05-27 06:56:58 -07:00
2025-12-11 09:06:20 +01:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-06-02 13:08:46 +00:00
2026-05-15 06:46:51 -07:00
2026-06-02 13:08:46 +00:00
2026-05-27 18:55:15 +00:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2025-12-11 09:06:20 +01:00
2026-05-22 18:39:01 +00:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 18:55:15 +00:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 08:21:54 +00:00
2026-05-27 08:21:54 +00:00
2026-05-27 08:21:54 +00:00
2026-05-29 14:00:06 +00:00
2026-05-27 06:56:58 -07:00
2026-05-27 08:21:54 +00:00
2026-05-22 18:39:01 +00:00
2026-05-15 06:46:51 -07:00
2026-05-27 08:21:54 +00:00
2026-05-29 14:00:06 +00:00
2026-05-27 08:21:54 +00:00
2026-05-27 08:21:54 +00:00
2026-05-29 14:00:06 +00:00
2026-05-27 08:21:54 +00:00
2026-05-27 08:21:54 +00:00
2026-05-27 08:21:54 +00:00
2025-11-20 10:45:57 -08:00
2026-05-27 06:56:58 -07:00
2026-05-29 14:00:06 +00:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2026-05-27 08:21:54 +00:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2026-05-15 06:46:51 -07:00
2026-05-29 14:00:06 +00:00
2026-04-20 12:24:59 +00:00
2026-05-22 18:39:01 +00:00
2026-05-15 06:46:51 -07:00
2025-11-20 10:45:57 -08:00
2026-05-15 06:46:51 -07:00
2026-05-22 18:39:01 +00:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-22 18:39:01 +00:00
2026-05-22 18:39:01 +00:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-15 06:46:51 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-18 10:17:18 -08:00
2025-11-18 10:17:18 -08:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-26 11:00:05 -07:00
2026-05-27 06:56:58 -07:00
2026-05-27 06:56:58 -07:00
2025-11-26 11:00:05 -07:00
2026-01-08 08:02:02 +01:00
2026-05-27 08:21:54 +00:00