This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-30 11:47:48 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
6443574bf034f2039adb2eebb8a8f8a866ea0cef
composable_kernel
/
include
/
ck_tile
/
core
/
arch
History
TianyuanWu
6443574bf0
Merge remote-tracking branch 'origin/develop' into tianyuwu/ck_tile/WMMA_GEMM_F16
2025-08-08 17:37:36 +08:00
..
amd_buffer_addressing_builtins.hpp
Expand the bandwidth of direct_global_to_lds for gfx950 (
#2576
)
2025-07-28 23:56:53 -07:00
amd_buffer_addressing.hpp
Revert "Add atomic add fallback method for gfx11"
2025-08-07 03:53:49 +00:00
amd_transpose_load_encoding.hpp
[CK_TILE] Use read_tr in universal gemm (
#2436
)
2025-07-16 23:56:22 -07:00
arch.hpp
Merge remote-tracking branch 'origin/develop' into tianyuwu/ck_tile/WMMA_GEMM_F16
2025-08-08 17:37:36 +08:00
generic_memory_space_atomic.hpp
Revert "Enable CK_TILE_USE_AMD_BUFFER_ATOMIC_ADD_FLOAT for gfx12"
2025-08-07 03:58:52 +00:00
utility.hpp
Do not use warpSize as compile time constant as it is removed (
#2320
)
2025-06-17 11:54:30 -07:00
workgroup_barrier.hpp
[CK_TILE] optimize moe sorting kernel, boost large context case up to 20x (
#2153
)
2025-05-06 17:32:07 +08:00