This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-13 01:36:06 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
1b409ffe5b4c5059f9bbd9571de5af1fabc9d972
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Jing Zhang
1b409ffe5b
fix mfma_int8 on MI300
2023-02-28 20:20:46 +00:00
..
block
[Navi3x-LWPCK-545] Block-wise GEMM + Real GEMM_WMMA_FP16 (
#541
)
2023-01-16 20:06:01 -06:00
device
fix mfma_int8 on MI300
2023-02-28 20:20:46 +00:00
element
Batchnorm inference instances, external API, client examples and gtests (
#531
)
2023-01-25 17:09:04 -06:00
grid
fix mfma_int8 on MI300
2023-02-28 20:20:46 +00:00
thread
Batchnorm-forward implemented using welford method to calculate variance (
#403
)
2022-10-27 18:52:54 -06:00
warp
fix mfma_int8 on MI300
2023-02-28 20:20:46 +00:00