Chao Liu
|
bb0c4bb96a
|
Improve external interface for GEMM and GEMM+add+add+fastgelu (#311)
* interface for GEMM and GEMM+add+add+fastgelu
* rename namespace
* instance factory
* fix build
* fix build; add GEMM client example
* clean
[ROCm/composable_kernel commit: 0dcb3496cf]
|
2022-06-30 22:11:00 -05:00 |
|
Chao Liu
|
31706d4896
|
add license in file (#303)
[ROCm/composable_kernel commit: d3051d7517]
|
2022-06-24 23:32:43 -05:00 |
|
Chao Liu
|
4a27f120ea
|
Absolute include path (#281)
* ad gelu and fast_gelu
* added GeLU and fast GeLU
* clean up
* add gemm+fastgelu example
* add gemm+gelu instances
* update profiler
* clean up
* clean up
* adding gemm+bias+activation
* clean
* adding bias
* clean
* adding gemm multiple d
* debugging
* add gemm bias add fastgelu
* rename, clean
* refactoring; add readme
* refactor
* refactor
* refactor
* refactor
* refactor
* refactor
* fix
* fix
* update example
* update example
* rename
* update example
* add ckProfiler
* clean
* clean
* clean
* clean
* add client app example
* update readme
* delete obselete files
* remove old client app
* delete old file
* cleaning
* clean
* remove half
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path for all examples
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* revert client app example
* clean build
* fix build
* temporary disable client test on Jenkins
* clean
* clean
* clean
[ROCm/composable_kernel commit: d1db6a0c3e]
|
2022-06-24 20:51:04 -05:00 |
|
JD
|
69d5f78b16
|
Add host API (#220)
* Add host API
* manually rebase on develop
* clean
* manually rebase on develop
* exclude tests from all target
* address review comments
* update client app name
* fix missing lib name
* clang-format update
* refactor
* refactor
* refactor
* refactor
* refactor
* fix test issue
* refactor
* refactor
* refactor
* upate cmake and readme
Co-authored-by: Chao Liu <chao.liu2@amd.com>
[ROCm/composable_kernel commit: cec69bc3bc]
|
2022-05-12 09:21:01 -05:00 |
|
ltqin
|
8db6e759dc
|
NHWC Conv2d Bwd weight fp16 ckprofiler and test (#166)
* change backward weight name
* start add bwd weight lib and profiler
* change tuning paramter
* change output info
* add bwd weight test
* change test info
* using conv_util
* change wgt to weight
* add }
* add fp32
[ROCm/composable_kernel commit: 781cacd2e6]
|
2022-04-04 20:32:00 -05:00 |
|