Chao Liu
|
2f5ccb68f5
|
ckProfiler and device-level XDL GEMM operator (#48)
* add DeviceGemmXdl
* update script
* fix naming issue
* fix comment
* output HostTensorDescriptor
* rename
* padded GEMM for fwd v4r4r4 nhwc
* refactor
* refactor
* refactor
* adding ckProfiler
* adding ckProfiler
* refactor
* fix tuning parameter bug
* add more gemm instances
* add more fp16 GEMM instances
* fix profiler driver
* fix bug in tuning parameter
* add fp32 gemm instances
* small fix
* refactor
* rename
* refactor gemm profiler; adding DeviceConv and conv profiler
* refactor
* fix
* add conv profiler
* refactor
* adding more GEMM and Conv instance
* Create README.md
Add build instruction for ckProfiler
* Create README.md
Add Readme for gemm_xdl example
* Update README.md
Remove build instruction from top most folder
* Update README.md
* clean up
[ROCm/composable_kernel commit: e823d518cb]
|
2021-11-14 11:28:32 -06:00 |
|
Chao Liu
|
1e312fef12
|
rename
[ROCm/composable_kernel commit: c03045ce2d]
|
2021-08-10 23:45:36 +00:00 |
|
Chao Liu
|
f3b7220822
|
Update README.md
[ROCm/composable_kernel commit: 85a1429301]
|
2021-07-28 09:41:38 -05:00 |
|
Chao Liu
|
6403529fbc
|
Update README.md
[ROCm/composable_kernel commit: 56f93c6f33]
|
2021-07-28 09:40:44 -05:00 |
|
Chao Liu
|
d297f0d524
|
Create README.md (#45)
* Create README.md
[ROCm/composable_kernel commit: 4682d070a6]
|
2021-07-08 13:32:29 -05:00 |
|