* Use DynamicBuffer to hold raw pointer (to global and LDS memory) * add workaround for compiler issue (inefficient ISA) of ds_write for int8x4, int8x8, int8x16 [ROCm/composable_kernel commit: 78b987fbd6]
78b987fbd6