This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-12 09:16:52 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
ec2bae27ff2b7ac658bfb92f533d34db15977eec
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Bartłomiej Kocot
fd72380aeb
Optimize grouped conv bwd weight for small M and N (
#1303
)
...
* Optimize grouped conv bwd weight for small M and N * Fixes
2024-05-22 21:01:01 +02:00
..
block
remove wrong use of nonexistent class members (
#1290
)
2024-05-15 08:08:17 -07:00
device
Optimize grouped conv bwd weight for small M and N (
#1303
)
2024-05-22 21:01:01 +02:00
element
Add element op (
#1259
)
2024-04-26 12:55:45 -05:00
grid
Optimize grouped conv bwd weight for small M and N (
#1303
)
2024-05-22 21:01:01 +02:00
thread
bf16A_Int8B with fastgelu/bias (
#1264
)
2024-04-26 07:26:30 -05:00
warp
Code clean-up (
#1285
)
2024-05-10 09:41:39 -07:00