This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-07 08:15:04 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
eaa95c20035a98598cb5d633188bc5c2c40e49fe
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Qianfeng Zhang
eaa95c2003
Add CountDataType as template parameter in blockwise_welford
2023-06-23 09:38:35 +00:00
..
block
Add CountDataType as template parameter in blockwise_welford
2023-06-23 09:38:35 +00:00
device
Use dim 0 as faster dim for writing mean/var/count workspace in batchnorm multiblock method [performance]
2023-06-22 22:40:35 +00:00
element
FP8 enablement - add a pseudorandom number generator, add conversion methods (
#708
)
2023-06-19 11:20:35 -05:00
grid
Use dim 0 as faster dim for writing mean/var/count workspace in batchnorm multiblock method [performance]
2023-06-22 22:40:35 +00:00
thread
update copyright headers (
#726
)
2023-05-31 18:46:57 -05:00
warp
update copyright headers (
#726
)
2023-05-31 18:46:57 -05:00