This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-03 21:21:22 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
5a27a97391d08652c3da0a5347209c19d3ebb03d
composable_kernel
/
include
/
ck_tile
/
core
/
arch
History
Gino Lu
fb1d090f3c
[CK_TILE] Patch for pk_fp4 ref check and buffer load. (
#3044
)
...
* Patch for pk_fp4_raw_t buffer load and ref check
2025-10-20 14:47:04 +08:00
..
amd_buffer_addressing_builtins.hpp
[CK_TILE] Patch for pk_fp4 ref check and buffer load. (
#3044
)
2025-10-20 14:47:04 +08:00
amd_buffer_addressing.hpp
Use __builtin_amdgcn_readfirstlane for buffer resource in fused_moe (
#2893
)
2025-09-30 15:12:30 -07:00
amd_transpose_load_encoding.hpp
[CK_TILE] Use read_tr in universal gemm (
#2436
)
2025-07-16 23:56:22 -07:00
arch.hpp
update s_barrier's logic in gfx12 architecture (
#3003
)
2025-10-14 08:49:34 -07:00
generic_memory_space_atomic.hpp
Extend XDL kernel to Support RDNA3/4 - Part 5 (
#2725
)
2025-09-15 10:59:25 -07:00
utility.hpp
Re-enable optimization for gfx950 fmha fwd (
#2671
)
2025-08-13 14:57:43 +08:00
workgroup_barrier.hpp
[CK_TILE] optimize moe sorting kernel, boost large context case up to 20x (
#2153
)
2025-05-06 17:32:07 +08:00