mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-06-28 10:47:00 +00:00
[rocm-libraries] ROCm/rocm-libraries#8019 (commit 6472935)
[CK TILE] Fix performance regression caused by Dispatcher codegen compiler flag. (#8019) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit ## Motivation Currently CK Tile two codegen paths: CK Builder and CK Tile Dispatcher. The CK Tile Dispatcher codegen uses an additional compiler flag that is not present in the CK Builder codegen workflow. The additional compiler flag can cause performance regression for so instances as it disables relevant compiler optimizations. ## Technical Details Removed compiler flag `-mllvm -enable-noalias-to-md-conversion=0` from the CMakeLists.txt that creates instance library from Dispatcher codegen. ## Test Plan Required testing is contained in the CI/CD pipeline. ## Submission Checklist - [x] Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.
This commit is contained in:
committed by
assistant-librarian[bot]
parent
320a813d67
commit
c6c55db757
@@ -115,7 +115,7 @@ function(ck_add_dispatcher_conv_instances VARIANT)
|
||||
)
|
||||
target_compile_options(${TARGET_NAME} PRIVATE
|
||||
-DCK_TILE_FLOAT_TO_BFLOAT16_DEFAULT=0
|
||||
-mllvm -enable-noalias-to-md-conversion=0
|
||||
-DCK_TILE_EXPERIMENTAL_USE_BUFFER_LOAD_OOB_CHECK_OFFSET_TRICK=1
|
||||
-Wno-undefined-func-template
|
||||
-Wno-float-equal
|
||||
-Wno-header-hygiene
|
||||
|
||||
Reference in New Issue
Block a user