Files
composable_kernel/include/ck/utility
guangzlu 4c4c7328a6 Add BF16 tests for batched_gemm_softmax_gemm_permute (#504)
* fixed bug in softmax reference & add bf16 examples for batched_gemm_scale_softmax_gemm

* added bf16 tests for batched_gemm_softmax_gemm_permute

* changed format of device_batched_gemm_softmax_gemm_permute_xdl_cshuffle_bf16_bf16_bf16_bf16_gmk_gnk_gno_gmo_instance.cpp

* changed format device_batched_gemm_softmax_gemm_permute_xdl_cshuffle_bf16_bf16_bf16_bf16_gmk_gnk_gno_gmo_instance.cpp

* aligned annotations

* modified CMakeLists for examples

* add common example code of fp16/bf16 version for batched_gemm_scale_softmax_gemm_xdl

* use macro to control the instances

* added macro control into instances

* clang-format some files

* changed error tolerance for bf16

* changed index for 10_elementwise_normalization

* fixed xdlops code bug in amd_xdlops.hpp

Co-authored-by: Po Yen Chen <PoYen.Chen@amd.com>
2022-11-15 16:30:23 -06:00
..
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-09-19 11:25:28 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-18 14:53:47 -05:00
2022-08-13 09:43:18 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00
2022-08-13 09:18:58 -05:00
2022-06-24 23:32:43 -05:00
2022-06-24 23:32:43 -05:00