Files
composable_kernel/include
Po Yen Chen 830b346bbb Fix incomplete object size (=4n + 3) support of amd_wave_read_first_lane() (#738)
* Fix wrong pointer type

* Rename type trait get_unsigned_int<> to get_carrier<>

* Add 3-bytes carrier type

* Add missing __device__ specifier

* Rename template non-type parameter

* Leave the rest byte uninitialized

* Avoid invoking (host) STL algorithms

* Remove unnecessary 'inline' specifier

* Extract common logic out as helper method

* Hide dummy member function

* Add missing __device__ specifier

[ROCm/composable_kernel commit: 7c24654c24]
2023-06-12 08:36:40 -05:00
..