rocking5566
e63d071fc2
Refine kernel parameter of int8 (ScalarPerVector) ( #155 )
...
* Change int8 ScalarPerVector
* Modify vector width of C
[ROCm/composable_kernel commit: 98e1e2d0e9 ]
2022-03-29 17:36:21 -05:00
Chao Liu
8cba08d07a
Gemm+Reduce Fusion ( #128 )
...
* add gridwise gemm v4r1
* rename
* adding gemm+reduce
* adding gemm+reduce
* adding gemm+reduce
* adding gemm+reduce
* use sfc in shuffling
* remove hardcode
* remove hardcode
* refactor
* fix build
* adding gemm+reduce
* adding gemm+reduce
* adding gemm+reduce
* adding gemm+reduce
* adding gemm+reduce
* format
* clean
* adding gemm+reduce
* adding profiler for gemm+reduce
* adding gemm+reduce profiler
* fix build
* clean up
* gemm+reduce
* fix build
* update DeviceGemm_Xdl_CShuffle; update enum to enum class
* clean up
* add test for gemm+reduce
* clean up
* refactor
* fix build
* fix build
[ROCm/composable_kernel commit: f95267f166 ]
2022-03-23 22:18:42 -05:00
rocking5566
900ea4ae3e
Consider gemm requant relu requant as gemm fusuion ( #116 )
...
* [What] Separate fixpoint gemm from gemm example
[Why] let example of gemm_int8 be pure gemm.
[What]
1. Add gemm_requant_relu_requant,
2. Let CDataType be int32 in pure gemm, because no one use int8 CDataType. It is also part of gemm_requant_relu_requant
* Fix path
* Revise cmakelist due to merge develop
Co-authored-by: rocking <chunylai@amd.com >
[ROCm/composable_kernel commit: 9a17e7fbfd ]
2022-03-11 20:41:03 -06:00
Chao Liu
6203866064
Reorganize files, Part 1 ( #119 )
...
* delete obselete files
* move files
* build
* update cmake
* update cmake
* fix build
* reorg examples
* update cmake for example and test
[ROCm/composable_kernel commit: 5d37d7bff4 ]
2022-03-08 21:46:36 -06:00