Overhaul to Reducton and its dependants (#237)

mirror of https://github.com/ROCm/composable_kernel.git synced 2026-04-19 22:39:03 +00:00

* Tiny fix in dynamic_buffer.hpp to support vectorized AtomicAdd for double type

* Update to host layer and host reduction

* Merge and remove reduction kernels

* Merge and remove reduction device interfaces and update pooling device interface

* Merge and remove useless reduction device instances

* Update to reduction profiler and reduction ctests

* Update to reduction and pooling examples and add one reduction example

* Change to reduction examples to let them testable by ctest

* Add explicit pass checking for reduction and pooling examples

* Explicit assignment of tensor shapes in example reduce_blockwise_two_call

* Use atomic_add to repace atomicAdd and add atomic_add for double type

* Add reduce ctest support for double data type

* Replace to_int_vector() by using c++ std::vector::assign()

* Keep DeviceReduceThreadWise separated from DeviceReduceBlockWise

* Merge DeviceReduceBlockWise and DeviceReduceMultiBlockAtomicAdd into DeviceReduceMultiBlock

* Add GetAtomicOperationZeroValue() support for AtomicMax

* Tiny change to reduce example README.md

* Fix some tiny issues due to branch merging

* Revoke previous change in dynamic_buffer.hpp and add atomic_add for double2_t

* Add reduce multiblock_atomic_add instances for fp64 to verify vectorized atomic_add on fp64

* Renaming

* Clean the header includings in device_reduce instances header files

This commit is contained in:

Qianfeng

2022-05-25 01:19:12 +08:00

committed by

GitHub

parent 1085794df3

commit 63eee2d999

94 changed files with 2429 additions and 6785 deletions

									
										2

include/ck/utility/dynamic_buffer.hpp
									
												View File
												
				@@ -325,7 +325,7 @@ struct DynamicBuffer

				        {

				            if(is_valid_element)

				            {

				                atomic_add<X>(c_style_pointer_cast<X*>(&p_data_[i]), x);

				                atomic_add(c_style_pointer_cast<X*>(&p_data_[i]), x);

				            }

				        }

				    }

Overhaul to Reducton and its dependants (#237)

2 include/ck/utility/dynamic_buffer.hpp Unescape Escape View File

2

include/ck/utility/dynamic_buffer.hpp

View File