Logo
Explore Help
Register Sign In
ROCm/composable_kernel
1
0
Fork 0
You've already forked composable_kernel
mirror of https://github.com/ROCm/composable_kernel.git synced 2026-05-05 06:01:23 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
0345963eef4f92e9c5eab608bb8557b5463a1dcb
composable_kernel/include/ck/tensor_operation/gpu
History
zjing14 0345963eef Add MNK padding, M = 0 support into grouped_gemm (#539)
* add mnk padding, support m=0

* clean code

* clean code

Co-authored-by: Rostyslav Geyyer <46627076+geyyer@users.noreply.github.com>
2022-12-15 15:07:24 -06:00
..
block
Add pipeline v1/v2 selector, add more instances (#381)
2022-11-02 16:50:48 -06:00
device
Add MNK padding, M = 0 support into grouped_gemm (#539)
2022-12-15 15:07:24 -06:00
element
Add multiple d gridwise gemm on Navi21 for ResNet50 (#517)
2022-12-02 11:42:31 -06:00
grid
Gridwise elementwise 2d (#466)
2022-12-12 09:18:10 -06:00
thread
Batchnorm-forward implemented using welford method to calculate variance (#403)
2022-10-27 18:52:54 -06:00
warp
Input/output permutation for fused attention (#460)
2022-10-27 14:58:20 -06:00
Powered by Gitea Version: 1.25.4 Page: 187ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API