Commit Graph

7 Commits

Author SHA1 Message Date
Jianfeng Yan
e018a779fd Use Space Filling Curve in Threadwise Copy (#118)
* fixed a corner case in GetCoordinateResetStep

* clean

* rename num_accesses to num_access

Co-authored-by: Chao Liu <chao.liu2@amd.com>

[ROCm/composable_kernel commit: 9e33fe70c3]
2022-03-11 00:08:47 -06:00
Qianfeng
bc3b1d6fe7 Pr82 followup (#115)
* Use thread cluster descriptor and explicit M_K 2d descriptor to simply Blockwise Reduction

* Change by replacing ReduceDims by NumReduceDims as Device Reduce interface template parameter

* Rename the folder name for the pool2d and reduce examples

* Update to reduction test scripts

* Add Readme for pool2d_fwd and reduce_blockwise examples

* Tiny fix in reduce profiler and tiny update in reduce testing scripts

* Tiny fix in testing script profile_reduce_no_index.sh

* Tiny change in script/profile_reduce_with_index.sh

* Renaming and refining in Reduction profiler/device layer/examples

* Renaming and refining in Reduction profiler/device layer/examples

* Renaming all NumReduceDims to NumReduceDim

[ROCm/composable_kernel commit: 827301d95a]
2022-03-10 10:14:43 -06:00
Chao Liu
82ad74304e Reorganize files, Part 1 (#119)
* delete obselete files

* move files

* build

* update cmake

* update cmake

* fix build

* reorg examples

* update cmake for example and test

[ROCm/composable_kernel commit: 5d37d7bff4]
2022-03-08 21:46:36 -06:00
Chao Liu
5f217ebda5 reorginzed files
[ROCm/composable_kernel commit: 1566b31736]
2019-06-13 15:12:12 -05:00
Chao Liu
11c6b2ab9a change build
[ROCm/composable_kernel commit: c82b833d8e]
2019-06-12 10:47:25 -05:00
Chao Liu
42e9613bb8 fixed build issue
[ROCm/composable_kernel commit: f2b92ba945]
2019-06-12 09:59:28 -05:00
Chao Liu
14451df1fd reorginze files
[ROCm/composable_kernel commit: 81497a93a0]
2019-06-11 23:49:51 -05:00