This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-04 05:31:24 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
85976b0b8724c47d3e3aee8d5e6a2156cea4f5d7
composable_kernel
/
include
/
ck_tile
/
core
/
arch
History
Feng Shijie
81899bd920
add pk_fp4_t and e8m0_t support for amd_buffer_load_impl
2025-08-20 06:40:03 +00:00
..
amd_buffer_addressing_builtins.hpp
add pk_fp4_t and e8m0_t support for amd_buffer_load_impl
2025-08-20 06:40:03 +00:00
amd_buffer_addressing.hpp
update
2025-08-11 11:24:34 +00:00
amd_transpose_load_encoding.hpp
transpose load api development (
#2177
)
2025-06-18 01:28:34 -07:00
arch.hpp
Remove usage of 'warpSize' variable as it has been deprecated (
#2295
)
2025-06-10 07:34:54 -07:00
generic_memory_space_atomic.hpp
add moe_flatmm
2025-08-06 08:33:33 +00:00
utility.hpp
Do not use warpSize as compile time constant as it is removed (
#2320
)
2025-06-17 11:54:30 -07:00
workgroup_barrier.hpp
[CK_TILE] optimize moe sorting kernel, boost large context case up to 20x (
#2153
)
2025-05-06 17:32:07 +08:00