composable_kernel/include/ck_tile/ops at 2d3020e5b03109a56fc2498a721134e5c34ab10f - composable_kernel - Public git mirror

ROCm/composable_kernel

mirror of https://github.com/ROCm/composable_kernel.git synced 2026-04-19 22:39:03 +00:00

Files

History

msaffari-amd 2d3020e5b0 [CK Tile] batched contraction kernel generalizing (#3126 )

* Add help for example

* Refactore the compute reference batched contraction to manage stride-aware calculation and some code cleanings

* Add stride-aware reference for batched contraction with independent D tensor layouts

* Add -num_d argument for runtime D tensor count selection in batched contraction

* Add stride vector arguments in example code for testing non-contiguous batched contraction inputs

* Add descriptor-based architecture for batched contraction multi-dimensional stride support

* Add multi-dimensional non-contiguous stride support to batched contraction, num_d = 0

* Add complete multi-dimensional stride support via descriptors

* Enable vectorization in descriptor-based batched contraction. Add pad_tensor_view to local RunGemm

* Clean up batched contraction: remove old UniversalGemmKernel path

* Clean up batched contraction: remove legacy paths and finalize docs

* Optimize batched contraction example: pass dimension sizes not vectors

* correct the reference calculation, unsigned int to int

* Fix batched_contraction C++17 build errors for gfx90a CI

2025-12-02 13:30:27 +01:00

..

add_rmsnorm2d_rdquant

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

batched_contraction

[CK Tile] batched contraction kernel generalizing (#3126 )

2025-12-02 13:30:27 +01:00

batched_transpose

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

[CK_TILE] Move DataTypeTraits into a Common File (#3146 )

2025-11-27 09:09:54 -08:00

Fix and improve the gemm quant pipeline infrastructure (#3245 )

2025-11-26 18:04:27 -08:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

[CK_Tile] Flatmm MX Cleanup & Explicite Offset Calculation (#3286 )

2025-12-02 14:21:12 +08:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

[CK_TILE] Fix for comp pipeline v4 (#3307 )

2025-12-02 11:38:06 +01:00

Make CK TILE GEMM Aquant support block tile 128x128x128 (#3325 )

2025-12-01 15:04:37 -08:00

grouped_convolution

[CK_TILE] Add indexing optimizations for conv bwd data (#3309 )

2025-12-02 11:37:26 +01:00

image_to_column

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

[CK Tile] enable building examples by default (#3259 )

2025-11-26 16:24:44 -08:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

add_rmsnorm2d_rdquant.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

batched_contraction.hpp

Update pre-commit to fixed versions, run remod for ck_tile (#2895 )

2025-10-16 15:29:17 -07:00

batched_transpose.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

common.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

elementwise.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

epilogue.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

flatmm.hpp

[CK_TILE] Add mxfp4 flatmm (#3080 )

2025-10-31 11:29:05 +08:00

fmha.hpp

Support fp8 dynamic quantization for fmha (#3206 )

2025-11-24 16:28:25 +08:00

fused_moe.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

gemm_quant.hpp

formatting (#3182 )

2025-11-11 07:42:26 -08:00

gemm.hpp

Replace CK_TILE_PIPELINE macros with a common enum

2025-11-03 09:35:05 -07:00

grouped_convolution.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

image_to_column.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

layernorm2d.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

moe_flatmm.hpp

chore(copyright): update copyright header for include directory (#3293 )

2025-11-26 11:00:05 -07:00

norm_reduce.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

permute.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

pooling.hpp

Update pre-commit to fixed versions, run remod for ck_tile (#2895 )

2025-10-16 15:29:17 -07:00

reduce.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

rmsnorm2d.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

smoothquant.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

softmax.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

topk_softmax.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00

topk.hpp

Update include path to break the remod's cyclic dep issue (#2978 )

2025-10-13 13:24:47 +02:00