This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-07 00:04:37 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
44f52fd081c523363b512e2b034e80e3afe44e78
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Qianfeng Zhang
44f52fd081
Add utility/get_shift.hpp
2023-06-23 12:45:09 +00:00
..
block
Add utility/get_shift.hpp
2023-06-23 12:45:09 +00:00
device
Use dim 0 as faster dim for writing mean/var/count workspace in batchnorm multiblock method [performance]
2023-06-22 22:40:35 +00:00
element
FP8 enablement - add a pseudorandom number generator, add conversion methods (
#708
)
2023-06-19 11:20:35 -05:00
grid
Use dim 0 as faster dim for writing mean/var/count workspace in batchnorm multiblock method [performance]
2023-06-22 22:40:35 +00:00
thread
update copyright headers (
#726
)
2023-05-31 18:46:57 -05:00
warp
update copyright headers (
#726
)
2023-05-31 18:46:57 -05:00