Logo
Explore Help
Register Sign In
ROCm/composable_kernel
1
0
Fork 0
You've already forked composable_kernel
mirror of https://github.com/ROCm/composable_kernel.git synced 2026-05-03 21:21:22 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
5a27a97391d08652c3da0a5347209c19d3ebb03d
composable_kernel/include/ck_tile/core/arch
History
Gino Lu fb1d090f3c [CK_TILE] Patch for pk_fp4 ref check and buffer load. (#3044)
* Patch for pk_fp4_raw_t buffer load and ref check
2025-10-20 14:47:04 +08:00
..
amd_buffer_addressing_builtins.hpp
[CK_TILE] Patch for pk_fp4 ref check and buffer load. (#3044)
2025-10-20 14:47:04 +08:00
amd_buffer_addressing.hpp
Use __builtin_amdgcn_readfirstlane for buffer resource in fused_moe (#2893)
2025-09-30 15:12:30 -07:00
amd_transpose_load_encoding.hpp
[CK_TILE] Use read_tr in universal gemm (#2436)
2025-07-16 23:56:22 -07:00
arch.hpp
update s_barrier's logic in gfx12 architecture (#3003)
2025-10-14 08:49:34 -07:00
generic_memory_space_atomic.hpp
Extend XDL kernel to Support RDNA3/4 - Part 5 (#2725)
2025-09-15 10:59:25 -07:00
utility.hpp
Re-enable optimization for gfx950 fmha fwd (#2671)
2025-08-13 14:57:43 +08:00
workgroup_barrier.hpp
[CK_TILE] optimize moe sorting kernel, boost large context case up to 20x (#2153)
2025-05-06 17:32:07 +08:00
Powered by Gitea Version: 1.25.4 Page: 571ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API