JD
|
569dd9f47b
|
Add host API (#220)
* Add host API
* manually rebase on develop
* clean
* manually rebase on develop
* exclude tests from all target
* address review comments
* update client app name
* fix missing lib name
* clang-format update
* refactor
* refactor
* refactor
* refactor
* refactor
* fix test issue
* refactor
* refactor
* refactor
* upate cmake and readme
Co-authored-by: Chao Liu <chao.liu2@amd.com>
[ROCm/composable_kernel commit: cec69bc3bc]
|
2022-05-12 09:21:01 -05:00 |
|
Wen-Heng (Jack) Chung
|
1dc34ba98b
|
Update README.md (#228)
[ROCm/composable_kernel commit: 968bd93285]
|
2022-05-09 15:00:04 -05:00 |
|
Chao Liu
|
3f732cceab
|
Compile for gfx908 and gfx90a (#130)
* adding compilation for multiple targets
* fix build
* clean
* update Jekinsfile
* update readme
* update Jenkins
* use ck::half_t instead of ushort for bf16
* rename enum classes
* clean
* rename
* clean
[ROCm/composable_kernel commit: cd167e492a]
|
2022-03-31 12:33:34 -05:00 |
|
Chao Liu
|
b9f9ed96ac
|
ckProfiler and device-level XDL GEMM operator (#48)
* add DeviceGemmXdl
* update script
* fix naming issue
* fix comment
* output HostTensorDescriptor
* rename
* padded GEMM for fwd v4r4r4 nhwc
* refactor
* refactor
* refactor
* adding ckProfiler
* adding ckProfiler
* refactor
* fix tuning parameter bug
* add more gemm instances
* add more fp16 GEMM instances
* fix profiler driver
* fix bug in tuning parameter
* add fp32 gemm instances
* small fix
* refactor
* rename
* refactor gemm profiler; adding DeviceConv and conv profiler
* refactor
* fix
* add conv profiler
* refactor
* adding more GEMM and Conv instance
* Create README.md
Add build instruction for ckProfiler
* Create README.md
Add Readme for gemm_xdl example
* Update README.md
Remove build instruction from top most folder
* Update README.md
* clean up
[ROCm/composable_kernel commit: e823d518cb]
|
2021-11-14 11:28:32 -06:00 |
|
Chao Liu
|
c5a4edb9e8
|
rename
[ROCm/composable_kernel commit: c03045ce2d]
|
2021-08-10 23:45:36 +00:00 |
|
Chao Liu
|
1b8b55fc61
|
Update README.md
[ROCm/composable_kernel commit: 85a1429301]
|
2021-07-28 09:41:38 -05:00 |
|
Chao Liu
|
18bec192f5
|
Update README.md
[ROCm/composable_kernel commit: 56f93c6f33]
|
2021-07-28 09:40:44 -05:00 |
|
Chao Liu
|
2ac5994379
|
Create README.md (#45)
* Create README.md
[ROCm/composable_kernel commit: 4682d070a6]
|
2021-07-08 13:32:29 -05:00 |
|