This website requires JavaScript.
Explore
Help
Register
Sign In
ROCm
/
composable_kernel
Watch
1
Star
0
Fork
0
You've already forked composable_kernel
mirror of
https://github.com/ROCm/composable_kernel.git
synced
2026-06-06 15:54:31 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
e9fd122889f76ed050c80315cf73cda60f0ad248
composable_kernel
/
include
/
ck
/
tensor_operation
/
gpu
History
Illia Silin
06f1fc864c
Remove the workaround for bf16 attention tests. (
#586
)
...
* remove workanround in bf16 attention test * clean up another workaround
2023-02-14 18:06:24 -06:00
..
block
[Navi3x-LWPCK-545] Block-wise GEMM + Real GEMM_WMMA_FP16 (
#541
)
2023-01-16 20:06:01 -06:00
device
Gemm+layernorm instance, ckProfiler, client example (
#568
)
2023-02-09 15:02:55 -06:00
element
Add GemmAddSoftmaxGemm support for MSFT ORT (instances and client API) (
#576
)
2023-02-08 14:34:45 -06:00
grid
Remove the workaround for bf16 attention tests. (
#586
)
2023-02-14 18:06:24 -06:00
thread
Batchnorm-forward implemented using welford method to calculate variance (
#403
)
2022-10-27 18:52:54 -06:00
warp
[Navi3x-LWPCK-545] Block-wise GEMM + Real GEMM_WMMA_FP16 (
#541
)
2023-01-16 20:06:01 -06:00