mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-26 08:00:13 +00:00
* [What] Separate fixpoint gemm from gemm example
[Why] let example of gemm_int8 be pure gemm.
[What]
1. Add gemm_requant_relu_requant,
2. Let CDataType be int32 in pure gemm, because no one use int8 CDataType. It is also part of gemm_requant_relu_requant
* Fix path
* Revise cmakelist due to merge develop
* Add gemm fp16 test
* Extract PrepareGemmTensor
* Extract TestGemm
* Add test for different layout
* Add 4 layouts of shuffle version of fp32
* Add 4 layouts of shuffle version of int8
* Add 4 layouts of shuffle version of bf16
* replace all DeviceGemmPtr_ with DeviceGemmNoOpPtr to fit naming convension
* Add test for non-shuffle verstion of gemm
* Fix typo
* Print kernel information
* Add rest of the fp32 kernel to the test
* 1. Add rest of the fp16 device iop.
2. Mark the invalid device operation
Co-authored-by: rocking <chunylai@amd.com>
[ROCm/composable_kernel commit: 485ea46a40]
16 lines
703 B
CMake
16 lines
703 B
CMake
add_test_executable(test_gemm_fp32 gemm_fp32.cpp)
|
|
target_link_libraries(test_gemm_fp32 PRIVATE host_tensor)
|
|
target_link_libraries(test_gemm_fp32 PRIVATE device_gemm_instance)
|
|
|
|
add_test_executable(test_gemm_fp16 gemm_fp16.cpp)
|
|
target_link_libraries(test_gemm_fp16 PRIVATE host_tensor)
|
|
target_link_libraries(test_gemm_fp16 PRIVATE device_gemm_instance)
|
|
|
|
add_test_executable(test_gemm_bf16 gemm_bf16.cpp)
|
|
target_link_libraries(test_gemm_bf16 PRIVATE host_tensor)
|
|
target_link_libraries(test_gemm_bf16 PRIVATE device_gemm_instance)
|
|
|
|
add_test_executable(test_gemm_int8 gemm_int8.cpp)
|
|
target_link_libraries(test_gemm_int8 PRIVATE host_tensor)
|
|
target_link_libraries(test_gemm_int8 PRIVATE device_gemm_instance)
|