This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-13 01:36:06 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
3973caa48c132d5cbe7bbee25143b30666a1bddd
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
illsilin
3973caa48c
switch between intrinsic mfma routines on mi100/200 and mi300
2023-02-07 09:46:58 -08:00
..
block
[Navi3x-LWPCK-545] Block-wise GEMM + Real GEMM_WMMA_FP16 (
#541
)
2023-01-16 20:06:01 -06:00
device
enable gfx940
2023-01-30 21:23:23 -08:00
element
Batchnorm inference instances, external API, client examples and gtests (
#531
)
2023-01-25 17:09:04 -06:00
grid
enable gfx940
2023-01-30 21:23:23 -08:00
thread
Batchnorm-forward implemented using welford method to calculate variance (
#403
)
2022-10-27 18:52:54 -06:00
warp
switch between intrinsic mfma routines on mi100/200 and mi300
2023-02-07 09:46:58 -08:00