[rocm-libraries] ROCm/rocm-libraries#8019 (commit 6472935)

[CK TILE] Fix performance regression caused by Dispatcher
 codegen compiler flag. (#8019)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

## Motivation

Currently CK Tile two codegen paths: CK Builder and CK Tile Dispatcher.
The CK Tile Dispatcher codegen uses an additional compiler flag that is
not present in the CK Builder codegen workflow. The additional compiler
flag can cause performance regression for so instances as it disables
relevant compiler optimizations.

## Technical Details

Removed compiler flag `-mllvm -enable-noalias-to-md-conversion=0` from
the CMakeLists.txt that creates instance library from Dispatcher
codegen.

## Test Plan

Required testing is contained in the CI/CD pipeline.

## Submission Checklist

- [x] Look over the contributing guidelines at
https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.
This commit is contained in:
Ville Pietilä
2026-06-10 09:12:03 +00:00
committed by assistant-librarian[bot]
parent 320a813d67
commit c6c55db757

View File

@@ -115,7 +115,7 @@ function(ck_add_dispatcher_conv_instances VARIANT)
)
target_compile_options(${TARGET_NAME} PRIVATE
-DCK_TILE_FLOAT_TO_BFLOAT16_DEFAULT=0
-mllvm -enable-noalias-to-md-conversion=0
-DCK_TILE_EXPERIMENTAL_USE_BUFFER_LOAD_OOB_CHECK_OFFSET_TRICK=1
-Wno-undefined-func-template
-Wno-float-equal
-Wno-header-hygiene