mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-04-19 14:29:05 +00:00
* Add elementwise operation kernel and example * Add comment * Add template argument of dim . Prepare to support multiple dimension * Rename example * Support 1 dimension * Add static assert * Add comment * Extract pad * Remove redundant argument * Support any dimension for elementwise operation * Remove line * Let it be the multiple number of CU * Move thread per block to the parameter of constructor * rename threadPerBlock with blockSize * Support double * rename kernel function name * remove redundant include header * Refine type * Need to the final dimension * Refine variable name * Refine type * Use index_t instead of int in API Co-authored-by: rocking <chunylai@amd.com>
3 lines
217 B
CMake
3 lines
217 B
CMake
add_example_executable(example_broadcast_add_2d broadcast_add_2d.cpp)
|
|
add_example_executable(example_elementwise_add_1d elementwise_add_1d.cpp)
|
|
add_example_executable(example_elementwise_add_4d elementwise_add_4d.cpp) |