Jun Liu
0c17233608
[Composable Kernel] update develop branch code to ck_upstream
...
Merge pull request #1236 from ROCmSoftwarePlatform/develop
[ROCm/composable_kernel commit: 5890e30076 ]
2021-10-25 19:49:17 -07:00
Qianfeng
d1c185cde7
[Enhancements] Several bugfixes and refactoring of dynamic generic reduction ( #1156 )
...
* Squashed 'src/composable_kernel/' content from commit aa8c98119
git-subtree-dir: src/composable_kernel
git-subtree-split: aa8c981198
* add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files
* Squashed 'src/composable_kernel/' changes from aa8c98119..1d8dbe3c5
1d8dbe3c5 Update develop (#5 ) (#6 )
8ce0728ae Merge pull request #4 from ROCmSoftwarePlatform/separate_online_compile
f017e3448 refactor
9eb35eec8 refactor
041c48a06 rename
git-subtree-dir: src/composable_kernel
git-subtree-split: 1d8dbe3c57
* fix
* refactor
* remove online compilation from CK
* refactor
* fix
* add ctest
* tidy
* add tidy
* tidy
* tidy
* tidy
* tidy
* tidy
* tidy
* tidy
* tidy
* tidy
* add c-style pointer cast
* vector/scalar pointer cast use c-style pointer cast instead of reinterpret_cast
* fix clang warning suppression
* tidy
* suppress cppcheck
* fix enum issue
* revert chagnes to hip build
* fix kernel filename
* update CK build script
* rename
* rename
* make innner product compatiable on gfx900
* Update src/include/miopen/solver/ck_utility_common.hpp
Co-authored-by: JD <Jehandad.Khan@amd.com >
* compiler parameter use stream
* use int instead of index_t in kernel wrapper
* DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element
* refactor
* refactor
* change cmakelist
* change ck common utility
* fix
* Squashed 'src/composable_kernel/' changes from 1d8dbe3c5..887df7b12
887df7b12 Merge pull request #16 from ROCmSoftwarePlatform/develop
7e6b9fb7a Merge pull request #14 from ROCmSoftwarePlatform/miopen_downstream_init_integration
833701f40 Merge pull request #8 from ROCmSoftwarePlatform/miopen_downstream_init_integration
e25c4c2f1 refactor
27048b771 refactor
65e834905 DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element
b3759bf6a use int instead of index_t in kernel wrapper
04ed8ddf4 compiler parameter use stream
9f40048d1 make innner product compatiable on gfx900
f7df8c7ee rename
1e312fef1 rename
c9869a5ac update CK build script
c825eb6b1 fix kernel filename
594b1cf91 fix enum issue
286475c6b tidy
a7c943aba fix clang warning suppression
d49e0ddcb vector/scalar pointer cast use c-style pointer cast instead of reinterpret_cast
314b9d78e add c-style pointer cast
d4b35bd09 tidy
cb2edf210 tidy
4771cfa34 tidy
eb7f9f35b tidy
b14b5d337 tidy
9c589af82 tidy
e8def0e77 tidy
9e2c3c776 tidy
51ab4abaf add tidy
cba13cb6b fix
5ed1b840a remove online compilation from CK
5856acc10 refactor
7221bedc9 Merge commit '437cc595c6e206dfebb118985b5171bbc1e29eab' into composable_kernel_init_integration_v3
0bb6c85c2 Merge pull request #7 from ROCmSoftwarePlatform/master
a0b9a203a Update develop (#5 )
898807d60 add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files
git-subtree-dir: src/composable_kernel
git-subtree-split: 887df7b129
* Tiny fix in using data type template parameters in blockwise and direct_threadwise kernel
* Fix with regard to implementing GetZeroVal() in both kernel and host
* Avoid convert to compType from dstDataType before writting the output value
* Add half_t support to NumericLimits and make constexpr GetZeroVal() of binary operator
* Add CONSTANT decorator for descriptor read buffer
* Use get_thread_local_1d_id() for thread local Id
* Rename GetZeroVal() to GetReductionZeroVal() in the kernels
* Remove constexpr from initialized zeroVal and tiny fix in reduction_operator.hpp
* Occasional tiny simplification and update in the kernel files
* Update in src/reducetensor.cpp for consistent IDs passing to the kernel
* Update to re-order tensor dimensions on the host, split second_call kernel wrapper files and simplify reduce_all kernel wrappers
* Update to remove OpenCL tidy checking failures
* Small updates in src/reducetensor.cpp
* Update for better readability
* Remove unused codes and not-needed template parameters in the kernel wrappers
Co-authored-by: Chao Liu <chao.liu2@amd.com >
Co-authored-by: JD <Jehandad.Khan@amd.com >
[ROCm/composable_kernel commit: dfb80c4e39 ]
2021-09-29 08:12:11 -07:00
Jun Liu
47bf3c6a3d
Merge pull request #1165 from ROCmSoftwarePlatform/develop
...
Merge develop into CK_upstream (Please don't squash when merging)
[ROCm/composable_kernel commit: 8557901d02 ]
2021-09-21 15:52:12 -07:00
Qianfeng
b315c39b11
[SWDEV-281541][MSRCHA-100] Implementation of Dynamic Generic Reduction ( #1108 )
...
* add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files
* make inner product compatible on gfx900
* Update src/include/miopen/solver/ck_utility_common.hpp
* compiler parameter use stream
* use int instead of index_t in kernel wrapper
* DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element
* Add dynamic generic reduction kernel layer (kernel wrappers, kernel implementations and utilities)
* Some updates to dynamic composable kernel facility for the need of dynamic generic reduction
* Update to generic reduction C++ host interface layer to support dynamic generic reduction
* Update to remove tidy complaints in host interface layer
* Change the unary operator form from void op(T &x) to T op(T x)
* Update to pass single workspace pointer for all kernels (fix for OpenCL backend)
* Use cppcheck-suppress to prevent some strange warnings
* Re-use operator [] and () for DynamicBuffer and update to depending codes
* Remove useless codes in first call threadwise/warpwise/blockwise kernel wrappers
* [performance] Remove un-needed local buffer initialization
Co-authored-by: Chao Liu <chao.liu2@amd.com >
Co-authored-by: JD <Jehandad.Khan@amd.com >
[ROCm/composable_kernel commit: 9e80cdceb7 ]
2021-08-26 18:04:55 -07:00
Chao Liu
ee428d2d6f
Composable kernel init integration v3 ( #1097 )
...
* Squashed 'src/composable_kernel/' content from commit f6edda611
git-subtree-dir: src/composable_kernel
git-subtree-split: f6edda6119
* add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files
* Squashed 'src/composable_kernel/' changes from f6edda611..5781adf5c
5781adf5c Update develop (#5 ) (#6 )
97e6d514f Merge pull request #4 from ROCmSoftwarePlatform/separate_online_compile
7b1ec41e5 refactor
49c33aaea refactor
54b3e73d1 rename
git-subtree-dir: src/composable_kernel
git-subtree-split: 5781adf5cf
* fix
* refactor
* remove online compilation from CK
* refactor
* fix
* add ctest
* add c-style pointer cast
* vector/scalar pointer cast use c-style pointer cast instead of reinterpret_cast
* fix clang warning suppression
* tidy
* suppress cppcheck
* fix enum issue
* revert chagnes to hip build
* fix kernel filename
* update CK build script
* rename
* rename
* make innner product compatiable on gfx900
* Update src/include/miopen/solver/ck_utility_common.hpp
Co-authored-by: JD <Jehandad.Khan@amd.com >
* compiler parameter use stream
* use int instead of index_t in kernel wrapper
* DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element
* refactor
* refactor
* change cmakelist
* change ck common utility
* fix
Co-authored-by: JD <Jehandad.Khan@amd.com >
[ROCm/composable_kernel commit: 6fe3627a9e ]
2021-08-19 10:55:03 -05:00
Chao Liu
887df7b129
Merge pull request #16 from ROCmSoftwarePlatform/develop
...
Merge develop into master
[ROCm/composable_kernel commit: 31b403526e ]
2021-08-18 11:22:34 -05:00
Chao Liu
7e6b9fb7a6
Merge pull request #14 from ROCmSoftwarePlatform/miopen_downstream_init_integration
...
MIOpen Downstream: Initial integration 2nd PR
[ROCm/composable_kernel commit: b62bf8c3f8 ]
2021-08-16 16:39:40 -05:00
Chao Liu
833701f408
Merge pull request #8 from ROCmSoftwarePlatform/miopen_downstream_init_integration
...
[ROCm/composable_kernel commit: ccc4a1d365 ]
2021-08-16 16:28:53 -05:00
Chao Liu
e25c4c2f15
refactor
...
[ROCm/composable_kernel commit: 67ad47e7c1 ]
2021-08-16 21:01:33 +00:00
Chao Liu
27048b7714
refactor
...
[ROCm/composable_kernel commit: 16effa767c ]
2021-08-16 20:36:47 +00:00
Chao Liu
65e8349051
DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element
...
[ROCm/composable_kernel commit: a91b68dfcb ]
2021-08-13 23:40:19 +00:00
Chao Liu
b3759bf6a2
use int instead of index_t in kernel wrapper
...
[ROCm/composable_kernel commit: 2cbabbba54 ]
2021-08-13 20:55:39 +00:00
Chao Liu
04ed8ddf43
compiler parameter use stream
...
[ROCm/composable_kernel commit: 0834bc7635 ]
2021-08-13 01:05:14 +00:00
Chao Liu
9f40048d1c
make innner product compatiable on gfx900
...
[ROCm/composable_kernel commit: f2ac7832c6 ]
2021-08-11 09:42:53 -05:00
Chao Liu
f7df8c7eea
rename
...
[ROCm/composable_kernel commit: 4e57b30a6a ]
2021-08-11 00:08:42 +00:00
Chao Liu
1e312fef12
rename
...
[ROCm/composable_kernel commit: c03045ce2d ]
2021-08-10 23:45:36 +00:00
Chao Liu
c9869a5ace
update CK build script
...
[ROCm/composable_kernel commit: b2589957f3 ]
2021-08-10 22:19:13 +00:00
Chao Liu
c825eb6b1c
fix kernel filename
...
[ROCm/composable_kernel commit: 2c48039d0e ]
2021-08-10 22:15:23 +00:00
Chao Liu
594b1cf918
fix enum issue
...
[ROCm/composable_kernel commit: d626dccc95 ]
2021-08-10 20:55:13 +00:00
Chao Liu
286475c6b6
tidy
...
[ROCm/composable_kernel commit: 643ebd4f3e ]
2021-08-10 07:07:11 +00:00
Chao Liu
a7c943abab
fix clang warning suppression
...
[ROCm/composable_kernel commit: ddd49ec9e7 ]
2021-08-10 06:20:24 +00:00
Chao Liu
d49e0ddcb2
vector/scalar pointer cast use c-style pointer cast instead of reinterpret_cast
...
[ROCm/composable_kernel commit: 4f566c6221 ]
2021-08-10 05:55:20 +00:00
Chao Liu
314b9d78ef
add c-style pointer cast
...
[ROCm/composable_kernel commit: 172036d728 ]
2021-08-10 00:01:52 -05:00
Chao Liu
d4b35bd09f
tidy
...
[ROCm/composable_kernel commit: 76f3131939 ]
2021-08-09 18:49:59 -05:00
Chao Liu
cb2edf2100
tidy
...
[ROCm/composable_kernel commit: d18428901e ]
2021-08-09 18:20:02 -05:00
Chao Liu
4771cfa340
tidy
...
[ROCm/composable_kernel commit: f885c131d8 ]
2021-08-09 22:13:47 +00:00
Chao Liu
eb7f9f35b2
tidy
...
[ROCm/composable_kernel commit: 80120f0a0c ]
2021-08-09 21:10:09 +00:00
Chao Liu
b14b5d3371
tidy
...
[ROCm/composable_kernel commit: c3efeb5e20 ]
2021-08-09 19:32:07 +00:00
Chao Liu
9c589af829
tidy
...
[ROCm/composable_kernel commit: 56fc0842b3 ]
2021-08-09 19:27:49 +00:00
Chao Liu
e8def0e778
tidy
...
[ROCm/composable_kernel commit: 54fba515b3 ]
2021-08-09 17:33:32 +00:00
Chao Liu
9e2c3c7765
tidy
...
[ROCm/composable_kernel commit: e62bae7a4a ]
2021-08-09 15:11:35 +00:00
Chao Liu
51ab4abaf4
add tidy
...
[ROCm/composable_kernel commit: 24c8728942 ]
2021-08-08 17:41:54 +00:00
Chao Liu
cba13cb6b3
fix
...
[ROCm/composable_kernel commit: 61487e0a00 ]
2021-08-07 02:31:19 +00:00
Chao Liu
5ed1b840a0
remove online compilation from CK
...
[ROCm/composable_kernel commit: ae98b52ad8 ]
2021-08-07 00:51:05 +00:00
Chao Liu
5856acc10f
refactor
...
[ROCm/composable_kernel commit: cb95421311 ]
2021-08-06 22:17:51 +00:00
Chao Liu
7221bedc99
Merge commit '437cc595c6e206dfebb118985b5171bbc1e29eab' into composable_kernel_init_integration_v3
...
[ROCm/composable_kernel commit: 73ca970155 ]
2021-08-06 21:32:27 +00:00
Chao Liu
0bb6c85c29
Merge pull request #7 from ROCmSoftwarePlatform/master
...
Master
[ROCm/composable_kernel commit: 3b86646115 ]
2021-08-06 16:29:58 -05:00
Chao Liu
1d8dbe3c57
Update develop ( #5 ) ( #6 )
...
* refactor
[ROCm/composable_kernel commit: 5781adf5cf ]
2021-08-06 16:13:07 -05:00
Chao Liu
a0b9a203a2
Update develop ( #5 )
...
* refactor
[ROCm/composable_kernel commit: d09ea4f4e5 ]
2021-08-06 16:11:15 -05:00
Chao Liu
8ce0728ae2
Merge pull request #4 from ROCmSoftwarePlatform/separate_online_compile
...
Separate online compile
[ROCm/composable_kernel commit: 97e6d514f7 ]
2021-08-06 16:09:22 -05:00
Chao Liu
f017e34483
refactor
...
[ROCm/composable_kernel commit: 7b1ec41e5b ]
2021-08-06 20:50:01 +00:00
Chao Liu
9eb35eec87
refactor
...
[ROCm/composable_kernel commit: 49c33aaea7 ]
2021-08-06 19:59:53 +00:00
Chao Liu
041c48a06c
rename
...
[ROCm/composable_kernel commit: 54b3e73d17 ]
2021-08-06 18:07:15 +00:00
Chao Liu
898807d60f
add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files
...
[ROCm/composable_kernel commit: 3d32ae9404 ]
2021-07-30 17:50:17 -05:00
Chao Liu
aa8c981198
Merge pull request #3 from ROCmSoftwarePlatform/format
...
Update to clang-format-10
[ROCm/composable_kernel commit: f6edda6119 ]
2021-07-30 17:16:23 -05:00
Chao Liu
e2352d83a9
update to clang-format-10
...
[ROCm/composable_kernel commit: 82fae390fb ]
2021-07-30 16:37:00 -05:00
Chao Liu
2e187cf3ab
Merge pull request #2 from asroy/master
...
Update readme
[ROCm/composable_kernel commit: bd27ed6c38 ]
2021-07-28 09:43:56 -05:00
Chao Liu
f3b7220822
Update README.md
...
[ROCm/composable_kernel commit: 85a1429301 ]
2021-07-28 09:41:38 -05:00
Chao Liu
6403529fbc
Update README.md
...
[ROCm/composable_kernel commit: 56f93c6f33 ]
2021-07-28 09:40:44 -05:00
Chao Liu
989c345b36
Merge pull request #1 from ROCmSoftwarePlatform/some_fix_210727
...
fix building issue
[ROCm/composable_kernel commit: 594f1dbe96 ]
2021-07-27 13:19:11 -05:00