arai713
|
0e5c264c3e
|
Gridwise elementwise 2d (#466)
* added 2d gridwise elementwise
* added 2d version of device elementwise
* added example file with updated device elementwise call
* added Cmake file
* changed NumDim into 2D
* fixed compiler issues
* fixed indexing for loop step
* fixed NumDim dimension error
* changed blockID to 2D
* updated Grid Desc
* updated kernel call
* fixed 2d thread indexing
* added dimensions for example file
* commented out unused code
* changed vector load
* removed extra code
* temporarily removing vector load on 2nd dim
* changed vector load back, still causing errors
* altered indexing
* changed isSupportedArgument for 2D
* changed indexing + do/while
* fixed isSupportedArgument
* changed dimension for debugging
* fixed
* added testing printouts
* testing change
* added variables to distribute threads through both dimensions
* testing changes
* integrated variable for thread distribution into device elementwise and added as parameter for gridwise elementwise
* removed most of the extraneous code, testing with different dimensions
* testing
* removed debugging print statements
* moved 2d elementwise permute into elementwise permute directory
* fixed formatting
* removed debugging comments from threadwise transfer
Co-authored-by: Jing Zhang <jizhan@amd.com>
Co-authored-by: Po Yen Chen <PoYen.Chen@amd.com>
|
2022-12-12 09:18:10 -06:00 |
|