Logo
Explore Help
Register Sign In
ROCm/composable_kernel
1
0
Fork 0
You've already forked composable_kernel
mirror of https://github.com/ROCm/composable_kernel.git synced 2026-06-06 07:51:52 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
cbb6f2ab8c27046e9954db190bb671e69b3fcf12
composable_kernel/include/ck/tensor_operation/gpu
History
Bartłomiej Kocot 4a870942e6 Fix bug with n block id calculation in DeviceGroupedConvXdlCShuffle (#1457)
* Fix typo in TransformConvFwdToGemm

* Fix bug in n offset calculation
2024-08-10 13:12:05 +02:00
..
block
[GEMM] F8 GEMM, performance optimized. (#1384)
2024-07-19 22:06:52 +08:00
device
Fix bug with n block id calculation in DeviceGroupedConvXdlCShuffle (#1457)
2024-08-10 13:12:05 +02:00
element
Adding more instances of grouped convolution 3d forward for FP8 with ConvScale+Bias element-wise operation. (#1412)
2024-07-24 15:49:55 -05:00
grid
Fix for beta!=0 in reduce (#1440)
2024-08-06 09:10:39 -07:00
thread
Merging the gfx12 code into public repo. (#1362)
2024-06-27 00:33:34 -07:00
warp
Add structural sparsity xdlops (#1363)
2024-07-04 12:00:14 +02:00
Powered by Gitea Version: 1.25.4 Page: 349ms Template: 12ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API