This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-03 21:21:22 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
7ea1508b59a0e8f89540d8d5f7eb3e7da9a50a62
composable_kernel
/
include
/
ck_tile
/
core
/
arch
History
valarLip
0fdbf6bcd1
extend buffer load for fp16/bf16x16 (
#2270
)
...
* extend buffer load for fp16/bf16x16 * format
2025-06-02 10:29:54 +08:00
..
amd_buffer_addressing_builtins.hpp
fix the buffer intrinsic names for clang >=20 (
#2228
)
2025-05-23 14:58:25 -07:00
amd_buffer_addressing.hpp
extend buffer load for fp16/bf16x16 (
#2270
)
2025-06-02 10:29:54 +08:00
arch.hpp
[CK_TILE] optimize moe sorting kernel, boost large context case up to 20x (
#2153
)
2025-05-06 17:32:07 +08:00
generic_memory_space_atomic.hpp
Addressing (Post Merge) code review comments for PR 1845 (
#1883
)
2025-03-06 11:40:30 -08:00
utility.hpp
[CK_TILE] fused-moe first version (
#1634
)
2024-11-26 11:14:56 +08:00
workgroup_barrier.hpp
[CK_TILE] optimize moe sorting kernel, boost large context case up to 20x (
#2153
)
2025-05-06 17:32:07 +08:00