Logo
Explore Help
Register Sign In
ROCm/composable_kernel
1
0
Fork 0
You've already forked composable_kernel
mirror of https://github.com/ROCm/composable_kernel.git synced 2026-05-14 02:02:46 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
aa4c28d53b3a773a11b89d8172835b2b3aec0577
composable_kernel/test
History
Jianfeng Yan aa4c28d53b refactored deviceBatchedGemm; removed GridwiseBatchedGemm; added fp32 and int8 to profiler (#120)
changed long_index_t to index_t when computing memory offset

uncomment other ops in profiler

added test for batched_gemm

[ROCm/composable_kernel commit: cb87b049de]
2022-03-21 16:45:14 -05:00
..
batched_gemm
refactored deviceBatchedGemm; removed GridwiseBatchedGemm; added fp32 and int8 to profiler (#120)
2022-03-21 16:45:14 -05:00
conv2d_bwd_data
Fix conv2d bwd data bug when filter is 1x1 and stride = 2 (#132)
2022-03-21 10:53:23 -05:00
conv2d_fwd
Reorganize files, Part 1 (#119)
2022-03-08 21:46:36 -06:00
conv_util
Reorganize files, Part 1 (#119)
2022-03-08 21:46:36 -06:00
convnd_fwd
Reorganize files, Part 1 (#119)
2022-03-08 21:46:36 -06:00
gemm
Gemm_c_shuffle (4 layouts) X (fp32 bf16 int8) (#131)
2022-03-21 15:59:51 -05:00
gemm_split_k
Reorganize files, Part 1 (#119)
2022-03-08 21:46:36 -06:00
include
Gemm_c_shuffle (4 layouts) X (fp32 bf16 int8) (#131)
2022-03-21 15:59:51 -05:00
magic_number_division
Reorganize files, Part 1 (#119)
2022-03-08 21:46:36 -06:00
reference_conv_fwd
Reorganize files, Part 1 (#119)
2022-03-08 21:46:36 -06:00
space_filling_curve
Use Space Filling Curve in Threadwise Copy (#118)
2022-03-11 00:08:47 -06:00
CMakeLists.txt
refactored deviceBatchedGemm; removed GridwiseBatchedGemm; added fp32 and int8 to profiler (#120)
2022-03-21 16:45:14 -05:00
Powered by Gitea Version: 1.25.4 Page: 183ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API