This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-16 19:09:59 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
9cb2dca9585e38921388ca65b9c187b64ddcea28
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
/
thread
History
Andriy Roshchenko
6660dc6b8e
Introduce MX GEMM for FP8 data type (
#2000
)
2025-03-24 15:41:07 -06:00
..
reduction_functions_threadwise.hpp
…
threadwise_contraction_dl.hpp
…
threadwise_gemm_dlops_v3.hpp
…
threadwise_tensor_slice_set.hpp
…
threadwise_tensor_slice_transfer_util.hpp
Added Multi_ABD support into Gemm and GroupedGemmFixedNK (
#978
)
2024-04-15 21:09:45 -05:00
threadwise_tensor_slice_transfer_v3r1_dequant.hpp
…
threadwise_tensor_slice_transfer_v3r1_gather.hpp
ck_moe: fix useless code and remove usless oob (
#1972
)
2025-03-12 09:22:42 -07:00
threadwise_tensor_slice_transfer_v3r1.hpp
[A8W8 GEMM] Optimized weight-preshuffled implementation & add quantization datatype for CK TILE rms_norm (
#1862
)
2025-02-20 14:00:27 -08:00
threadwise_tensor_slice_transfer_v3r2.hpp
Add elementwise with dynamic vector dim (
#1198
)
2024-03-22 10:40:43 +01:00
threadwise_tensor_slice_transfer_v4r1.hpp
…
threadwise_tensor_slice_transfer_v5r1.hpp
Moficiation to fix this issue "threadwise_tensor_slice_transfer_v5r1 issue
#1279
" (
#1492
)
2024-09-04 21:52:55 -07:00
threadwise_tensor_slice_transfer_v6r1.hpp
…
threadwise_tensor_slice_transfer_v6r1r2.hpp
…
threadwise_tensor_slice_transfer_v6r2.hpp
…
threadwise_tensor_slice_transfer_v6r3.hpp
…
threadwise_tensor_slice_transfer_v7.hpp
…
threadwise_tensor_slice_transfer_v7r2.hpp
bf16A_Int8B with fastgelu/bias (
#1264
)
2024-04-26 07:26:30 -05:00
threadwise_tensor_slice_transfer_v7r3_scatter.hpp
Ck moe hot fix (
#1979
)
2025-03-19 22:58:27 +08:00
threadwise_tensor_slice_transfer_v7r3.hpp
add f8 gemm multiD with both row/col wise scale (
#1300
)
2024-05-28 12:04:22 -05:00
threadwise_tensor_slice_transfer.hpp
Introduce MX GEMM for FP8 data type (
#2000
)
2025-03-24 15:41:07 -06:00
threadwise_welford.hpp
…