This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-08 15:30:23 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
d6e49c5fdec1eedf9c6e6dbd59e7f788c2e2fc2e
composable_kernel
/
include
/
ck_tile
/
ops
/
fmha
History
Po Yen Chen
4a7ecce096
[CK_TILE][FMHA] Enable dwordx4 loading in async_load_tile_raw() (
#2549
)
...
* Support async load dwordx4 * Enlarge load size on gfx950
2025-08-22 10:13:47 +08:00
..
block
support y-direction step length greater than 1 for SimplifiedGenericAttentionMask (
#2338
)
2025-07-09 23:18:55 +08:00
kernel
[CK_TILE] FMHA BWD Fix Compilation with Bias (
#2682
)
2025-08-22 10:01:10 +08:00
pipeline
[CK_TILE][FMHA] Enable dwordx4 loading in async_load_tile_raw() (
#2549
)
2025-08-22 10:13:47 +08:00