zjing14
|
e25c18aeb7
|
Improve 4k gemm perf (#1047)
* improve 4k gemm perf
* add f8 instances
* format
---------
Co-authored-by: Jing Zhang <jizha@amd.com>
[ROCm/composable_kernel commit: e8cddfdc3b]
|
2023-11-17 07:06:24 -06:00 |
|
Illia Silin
|
d40b8d5e2c
|
update copyright headers (#726)
[ROCm/composable_kernel commit: b94fd0b227]
|
2023-05-31 18:46:57 -05:00 |
|
Chao Liu
|
181df92584
|
disable print for group conv multiple D (#421)
[ROCm/composable_kernel commit: 43c898f6ff]
|
2022-09-16 09:46:32 -05:00 |
|
Chao Liu
|
2ef299e0ad
|
add license in file (#303)
[ROCm/composable_kernel commit: d3051d7517]
|
2022-06-24 23:32:43 -05:00 |
|
JD
|
569dd9f47b
|
Add host API (#220)
* Add host API
* manually rebase on develop
* clean
* manually rebase on develop
* exclude tests from all target
* address review comments
* update client app name
* fix missing lib name
* clang-format update
* refactor
* refactor
* refactor
* refactor
* refactor
* fix test issue
* refactor
* refactor
* refactor
* upate cmake and readme
Co-authored-by: Chao Liu <chao.liu2@amd.com>
[ROCm/composable_kernel commit: cec69bc3bc]
|
2022-05-12 09:21:01 -05:00 |
|