mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-11 17:00:18 +00:00
* Squashed 'src/composable_kernel/' content from commitf6edda611git-subtree-dir: src/composable_kernel git-subtree-split:f6edda6119* add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files * Squashed 'src/composable_kernel/' changes from f6edda611..5781adf5c5781adf5cUpdate develop (#5) (#6)97e6d514fMerge pull request #4 from ROCmSoftwarePlatform/separate_online_compile7b1ec41e5refactor49c33aaearefactor54b3e73d1rename git-subtree-dir: src/composable_kernel git-subtree-split:5781adf5cf* fix * refactor * remove online compilation from CK * refactor * fix * add ctest * add c-style pointer cast * vector/scalar pointer cast use c-style pointer cast instead of reinterpret_cast * fix clang warning suppression * tidy * suppress cppcheck * fix enum issue * revert chagnes to hip build * fix kernel filename * update CK build script * rename * rename * make innner product compatiable on gfx900 * Update src/include/miopen/solver/ck_utility_common.hpp Co-authored-by: JD <Jehandad.Khan@amd.com> * compiler parameter use stream * use int instead of index_t in kernel wrapper * DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element * refactor * refactor * change cmakelist * change ck common utility * fix Co-authored-by: JD <Jehandad.Khan@amd.com>
26 lines
928 B
Bash
Executable File
26 lines
928 B
Bash
Executable File
rm *.ll *.s
|
|
|
|
BC_FILE=$1
|
|
|
|
/opt/rocm/llvm/bin/llvm-dis $BC_FILE -o original.ll
|
|
/opt/rocm/llvm/bin/opt -S -inline -inline-threshold=104857 original.ll > inline.ll
|
|
/opt/rocm/llvm/bin/opt -S -sroa inline.ll > sroa.ll
|
|
/opt/rocm/llvm/bin/opt -S -O3 sroa.ll > o3.ll
|
|
|
|
/opt/rocm/llvm/bin/llc -mcpu=gfx906 original.ll
|
|
/opt/rocm/llvm/bin/llc -mcpu=gfx906 inline.ll
|
|
/opt/rocm/llvm/bin/llc -mcpu=gfx906 sroa.ll
|
|
/opt/rocm/llvm/bin/llc -mcpu=gfx906 o3.ll
|
|
|
|
#/opt/rocm/llvm/bin/opt -S -O3 -sroa inline.ll > o3.ll
|
|
#/opt/rocm/llvm/bin/opt -S -O3 -sroa o3.ll > o3_2.ll
|
|
#/opt/rocm/llvm/bin/opt -S -O3 -sroa o3_2.ll > o3_3.ll
|
|
#/opt/rocm/llvm/bin/opt -S -O3 -sroa o3_3.ll > o3_4.ll
|
|
|
|
#/opt/rocm/llvm/bin/llc -mcpu=gfx908 opt.ll
|
|
#/opt/rocm/llvm/bin/llc -mcpu=gfx908 inline.ll
|
|
#/opt/rocm/llvm/bin/llc -mcpu=gfx908 o3.ll
|
|
#/opt/rocm/llvm/bin/llc -mcpu=gfx908 o3_2.ll
|
|
#/opt/rocm/llvm/bin/llc -mcpu=gfx908 o3_3.ll
|
|
#/opt/rocm/llvm/bin/llc -mcpu=gfx908 o3_4.ll
|