This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-05-13 01:36:06 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
e7dce4d247d2aad9afc7695b29b4c35eaf62b9cc
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Bartłomiej Kocot
1519ce91a3
Fix and optimize dynamic unary elementwise (
#1818
)
...
* Fix and optimize dynamic unary elementwise * fix
2025-01-16 13:48:39 -08:00
..
block
Implement the fp16xint4 scale weight only kernel for Ali (
#1786
)
2025-01-03 18:35:21 +08:00
device
Fix and optimize dynamic unary elementwise (
#1818
)
2025-01-16 13:48:39 -08:00
element
Fix and optimize dynamic unary elementwise (
#1818
)
2025-01-16 13:48:39 -08:00
grid
Implement the fp16xint4 scale weight only kernel for Ali (
#1786
)
2025-01-03 18:35:21 +08:00
thread
Grouped convolution backward weight special vector size loads (
#1772
)
2025-01-10 22:02:30 +08:00
warp
terminology clean-up (
#1792
)
2025-01-03 16:38:22 -08:00