From c6c55db75748d3605ebf412314cbece92a00a489 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ville=20Pietil=C3=A4?= <188998872+vpietila-amd@users.noreply.github.com> Date: Wed, 10 Jun 2026 09:12:03 +0000 Subject: [PATCH] [rocm-libraries] ROCm/rocm-libraries#8019 (commit 6472935) [CK TILE] Fix performance regression caused by Dispatcher codegen compiler flag. (#8019) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit ## Motivation Currently CK Tile two codegen paths: CK Builder and CK Tile Dispatcher. The CK Tile Dispatcher codegen uses an additional compiler flag that is not present in the CK Builder codegen workflow. The additional compiler flag can cause performance regression for so instances as it disables relevant compiler optimizations. ## Technical Details Removed compiler flag `-mllvm -enable-noalias-to-md-conversion=0` from the CMakeLists.txt that creates instance library from Dispatcher codegen. ## Test Plan Required testing is contained in the CI/CD pipeline. ## Submission Checklist - [x] Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests. --- dispatcher/library/CMakeLists.txt | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/dispatcher/library/CMakeLists.txt b/dispatcher/library/CMakeLists.txt index a0e07fd3ed..79adbade8a 100644 --- a/dispatcher/library/CMakeLists.txt +++ b/dispatcher/library/CMakeLists.txt @@ -115,7 +115,7 @@ function(ck_add_dispatcher_conv_instances VARIANT) ) target_compile_options(${TARGET_NAME} PRIVATE -DCK_TILE_FLOAT_TO_BFLOAT16_DEFAULT=0 - -mllvm -enable-noalias-to-md-conversion=0 + -DCK_TILE_EXPERIMENTAL_USE_BUFFER_LOAD_OOB_CHECK_OFFSET_TRICK=1 -Wno-undefined-func-template -Wno-float-equal -Wno-header-hygiene