composable_kernel/include/ck_tile/ops at aghamari/ua-on-develop - composable_kernel - Public git mirror

ROCm/composable_kernel

mirror of https://github.com/ROCm/composable_kernel.git synced 2026-06-29 19:28:33 +00:00

Files

History

Amir Ghamarian 4af7e472a3 Add unified attention kernel on top of CK develop

Cherry-picked all unified attention files from aghamari/unified-attention-decode-opt
onto CK develop (046d3ac27). Includes:
- Unified attention pipeline, kernel, and block masking
- All kernel tiers: large (8-warp), medium (4-warp), small (2-warp), tiny (1-warp)
- block_size=32 support with bs32 narrow tier (2-warp 16x16 MFMA kBlockM=32)
- int32 overflow fix (long_index_t for KV cache strides)
- BlockSize_ template parameter for flexible page block sizes
- Example binary and 40 instance files

Made-with: Cursor

2026-03-30 17:37:12 +00:00

..

add_rmsnorm2d_rdquant

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

batched_contraction

[CK Tile] batched contraction kernel generalizing (#3126 )

2025-12-02 13:30:27 +01:00

batched_transpose

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

[rocm-libraries] ROCm/rocm-libraries#5237 (commit ef10dc6)

2026-03-13 01:21:08 +00:00

[rocm-libraries] ROCm/rocm-libraries#4302 (commit e62bd8a)

2026-03-19 09:19:06 +00:00

[rocm-libraries] ROCm/rocm-libraries#5045 (commit 64a5502)

2026-03-03 21:55:14 +00:00

[rocm-libraries] ROCm/rocm-libraries#4302 (commit e62bd8a)

2026-03-19 09:19:06 +00:00

[rocm-libraries] ROCm/rocm-libraries#4819 (commit b995a0b)

2026-02-25 16:13:13 +00:00

[rocm-libraries] ROCm/rocm-libraries#5789 (commit 6654ca6)

2026-03-26 01:41:35 +00:00

[rocm-libraries] ROCm/rocm-libraries#5095 (commit 7e55766)

2026-03-20 01:08:52 +00:00

[rocm-libraries] ROCm/rocm-libraries#5323 (commit 5454e9e)

2026-03-17 18:58:56 +00:00

grouped_convolution

[rocm-libraries] ROCm/rocm-libraries#5516 (commit ff3afda)

2026-03-25 14:36:11 +00:00

image_to_column

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

Shuffle fix for gfx950 (#3491 )

2026-01-13 09:21:29 -08:00

[CK Tile] multi reduce improvements (#3607 )

2026-01-27 12:56:09 -08:00

Fix redundant cast in model sensitive rmsnorm (#3681 )

2026-01-30 10:52:19 +08:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

[rocm-libraries] ROCm/rocm-libraries#4274 (commit 7c380df)

2026-02-11 05:52:42 +00:00

[CK_TILE][FMHA] Add sparse attention VSA (#3341 )

2026-01-31 00:59:47 +08:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

Shuffle fix for gfx950 (#3491 )

2026-01-13 09:21:29 -08:00

unified_attention

Add unified attention kernel on top of CK develop

2026-03-30 17:37:12 +00:00

add_rmsnorm2d_rdquant.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

batched_contraction.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

batched_transpose.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

common.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

elementwise.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

epilogue.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

flatmm.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

fmha.hpp

[rocm-libraries] ROCm/rocm-libraries#4368 (commit 17f7dfc)

2026-03-11 10:00:52 +00:00

fused_moe.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

gemm_mx.hpp

[rocm-libraries] ROCm/rocm-libraries#5241 (commit 43daeac)

2026-03-12 08:27:49 +00:00

gemm_quant.hpp

[rocm-libraries] ROCm/rocm-libraries#4964 (commit 3271d9a)

2026-03-16 08:31:56 +00:00

gemm.hpp

[rocm-libraries] ROCm/rocm-libraries#4964 (commit 3271d9a)

2026-03-16 08:31:56 +00:00

grouped_convolution.hpp

[rocm-libraries] ROCm/rocm-libraries#5241 (commit 43daeac)

2026-03-12 08:27:49 +00:00

image_to_column.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

layernorm2d.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

moe_flatmm.hpp

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

norm_reduce.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

permute.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

pooling.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

reduce.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

rmsnorm2d.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

smoothquant.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

softmax.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

sparse_attn.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

topk_softmax.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

topk.hpp

[rocm-libraries] ROCm/rocm-libraries#4294 (commit 6601702)

2026-03-02 12:21:44 +00:00

unified_attention.hpp

Add unified attention kernel on top of CK develop

2026-03-30 17:37:12 +00:00