This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-12 09:16:52 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
bc2551ac3b27edc31f20863e3a873508fb73aad2
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
/
block
History
Thomas Ning
1386924749
Add the instances for small sized GEMM in preshuffle and improve CMake Flag (
#2212
)
...
* Add small instance, add the bug fix, & improve the example CMake * clang format
2025-05-20 15:05:08 -07:00
..
blockwise_gemm_dl_v2r3.hpp
…
blockwise_gemm_dlops_v2r2.hpp
…
blockwise_gemm_dlops_v3.hpp
…
blockwise_gemm_dpp.hpp
…
blockwise_gemm_mx_pipeline_xdlops_base.hpp
…
blockwise_gemm_pipeline_wmma_selector.hpp
DeviceGemm_Wmma_CShuffleV3 with BlockGemmPipelineVersion::v3 (
#2096
)
2025-04-28 10:14:21 +05:00
blockwise_gemm_pipeline_wmmaops_base.hpp
DeviceGemm_Wmma_CShuffleV3 with BlockGemmPipelineVersion::v3 (
#2096
)
2025-04-28 10:14:21 +05:00
blockwise_gemm_pipeline_wmmaops_v3.hpp
DeviceGemm_Wmma_CShuffleV3 with BlockGemmPipelineVersion::v3 (
#2096
)
2025-04-28 10:14:21 +05:00
blockwise_gemm_pipeline_wmmaops.hpp
DeviceGemm_Wmma_CShuffleV3 with BlockGemmPipelineVersion::v3 (
#2096
)
2025-04-28 10:14:21 +05:00
blockwise_gemm_pipeline_xdlops_ab_scale_selector.hpp
…
blockwise_gemm_pipeline_xdlops_b_preshuffle_dequant_v1.hpp
…
blockwise_gemm_pipeline_xdlops_b_preshuffle_dequant_v3.hpp
Use new mfma instructions for FP8 on gfx950 (
#2202
)
2025-05-19 17:29:51 -07:00
blockwise_gemm_pipeline_xdlops_b_preshuffle_gufusion_dequant_v1.hpp
Moe gemm activation (
#2026
)
2025-04-23 10:35:34 +08:00
blockwise_gemm_pipeline_xdlops_b_preshuffle_gufusion_v1.hpp
Moe gemm activation (
#2026
)
2025-04-23 10:35:34 +08:00
blockwise_gemm_pipeline_xdlops_b_preshuffle_selector.hpp
Moe gemm activation (
#2026
)
2025-04-23 10:35:34 +08:00
blockwise_gemm_pipeline_xdlops_b_preshuffle_v1.hpp
Improve the general performance of the Preshuffled GEMM V3 & delete the unnecessary instances (
#2166
)
2025-05-12 09:52:58 -07:00
blockwise_gemm_pipeline_xdlops_b_preshuffle_v2.hpp
Add the instances for small sized GEMM in preshuffle and improve CMake Flag (
#2212
)
2025-05-20 15:05:08 -07:00
blockwise_gemm_pipeline_xdlops_b_preshuffle_v3.hpp
Narrowing error fix for codegen compilation (
#2194
)
2025-05-16 11:11:54 -07:00
blockwise_gemm_pipeline_xdlops_b_scale_selector.hpp
…
blockwise_gemm_pipeline_xdlops_base.hpp
Improve the general performance of the Preshuffled GEMM V3 & delete the unnecessary instances (
#2166
)
2025-05-12 09:52:58 -07:00
blockwise_gemm_pipeline_xdlops_mx_selector.hpp
…
blockwise_gemm_pipeline_xdlops_selector.hpp
…
blockwise_gemm_pipeline_xdlops_v1_ab_scale.hpp
…
blockwise_gemm_pipeline_xdlops_v1_b_scale.hpp
…
blockwise_gemm_pipeline_xdlops_v1_mx.hpp
…
blockwise_gemm_pipeline_xdlops_v1.hpp
…
blockwise_gemm_pipeline_xdlops_v2_ab_scale.hpp
…
blockwise_gemm_pipeline_xdlops_v2_b_scale.hpp
…
blockwise_gemm_pipeline_xdlops_v2.hpp
…
blockwise_gemm_pipeline_xdlops_v3_ab_scale.hpp
…
blockwise_gemm_pipeline_xdlops_v3_b_scale.hpp
…
blockwise_gemm_pipeline_xdlops_v3.hpp
…
blockwise_gemm_pipeline_xdlops_v4_b_scale.hpp
…
blockwise_gemm_pipeline_xdlops_v4.hpp
…
blockwise_gemm_pipeline_xdlops_v5.hpp
…
blockwise_gemm_pipeline_xdlops.hpp
…
blockwise_gemm_smfmac_xdlops.hpp
…
blockwise_gemm_wmma.hpp
…
blockwise_gemm_xdlops_skip_b_lds.hpp
…
blockwise_gemm_xdlops.hpp
…
blockwise_softmax.hpp
…
blockwise_tensor_slice_transfer_v5r1.hpp
…
blockwise_welford.hpp
…
reduction_functions_blockwise.hpp
…
thread_group_tensor_slice_transfer_direct_load.hpp
…
thread_group_tensor_slice_transfer_v4r1_dequant.hpp
…
thread_group_tensor_slice_transfer_v4r1_gather.hpp
Moe gemm activation (
#2026
)
2025-04-23 10:35:34 +08:00
thread_group_tensor_slice_transfer_v4r1.hpp
…
thread_group_tensor_slice_transfer_v4r2.hpp
…
thread_group_tensor_slice_transfer_v6r1.hpp
…
thread_group_tensor_slice_transfer_v6r1r2.hpp
…
thread_group_tensor_slice_transfer_v6r2.hpp
…
thread_group_tensor_slice_transfer_v6r3.hpp
…
thread_group_tensor_slice_transfer_v7.hpp
…
thread_group_tensor_slice_transfer_v7r2.hpp
…
thread_group_tensor_slice_transfer_v7r3_scatter.hpp
Moe gemm activation (
#2026
)
2025-04-23 10:35:34 +08:00
thread_group_tensor_slice_transfer_v7r3.hpp
…