This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-11 17:00:18 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
7c0b149811765a7e25e38f7c00c61bba7e8b683d
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Anthony Chang
08a979f188
use inline asm for 4x4 int8 transposition (
#187
)
2022-04-22 15:47:31 -05:00
..
block
removed unused lds loads (
#196
)
2022-04-20 22:10:35 -05:00
device
Compile CK for all targets (
#188
)
2022-04-15 14:17:28 -05:00
element
Gemm+Reduce Fusion (
#128
)
2022-03-23 22:18:42 -05:00
grid
Compile CK for all targets (
#188
)
2022-04-15 14:17:28 -05:00
thread
use inline asm for 4x4 int8 transposition (
#187
)
2022-04-22 15:47:31 -05:00
warp
Compile for gfx908 and gfx90a (
#130
)
2022-03-31 12:33:34 -05:00