mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-20 21:09:08 +00:00
Add bfp16/int8 support into XDL GEMM operator (#50)
* init StaticBufferV2 * clean * adopt old output stage for staticBufferV2 * clean * remove hack * clean * clean * add parameters * clean code * move c_buffer alloc into blockwise gemm * add adaptors for m/n_thread_data_on_grid * tweak gemm * adjust blockwise_gemm_xdlops * tweak * update conv * update script * adding bwd 1x1 * update script * adding 1x1 bwd * debugging bwd 1x1 failure * update script * update script * test * test v100 * add bf16_1k * clang-format * clean * add bfp16 for gfx908 * add verification * clean up * clean code * restore bfl16 * clean * add bfp16 support into gemm_driver * apply new generator to other drivers * add int8 support * cleanb * clean * clean * clean Co-authored-by: Chao Liu <chao.liu2@amd.com> Co-authored-by: Chao Liu <lc.roy86@gmail.com> Co-authored-by: root <root@hayabusa6111.amd.com>
This commit is contained in:
2
external/rocm/include/bfloat16_dev.hpp
vendored
2
external/rocm/include/bfloat16_dev.hpp
vendored
@@ -31,7 +31,7 @@ extern "C" {
|
||||
#endif
|
||||
|
||||
#ifdef __HIP_PLATFORM_HCC__
|
||||
#define EXECUTION_SPECIFIER __device__
|
||||
#define EXECUTION_SPECIFIER __device__ __host__
|
||||
#else
|
||||
#define EXECUTION_SPECIFIER
|
||||
#endif // MIOPEN_BACKEND_HIP
|
||||
|
||||
Reference in New Issue
Block a user