mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-04-01 06:09:13 +00:00
[CK_TILE, CK_BUILDER] Add bwd data to CK Tile profiler (#5516) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit ## Motivation We want close the performance gap between old CK and CK Tile for bwd data convolutions. To achieve this, we need tow things - Configurations for the old CK kernel instances such that we can map them into CK Tile instances. - Support in CK profiler to run the CK Tile instance with the same API as for old CK instances. ## Technical Details Extracted kernel configurations from old CK. The codegen python script for CK Tile convs is extended to support also bwd data. The generated instances are added to the CMake build (target `device_grouped_conv_bwd_data_tile_instances`). A new profiler op (`grouped_conv_bwd_data_tile`) has been added to the CK Profiler. The API is same as for old CK's profiler op `grouped_conv_bwd_data`.
117 lines
1.8 KiB
Plaintext
117 lines
1.8 KiB
Plaintext
# Compiled Object files
|
|
*.slo
|
|
*.lo
|
|
*.o
|
|
*.obj
|
|
|
|
# Precompiled Headers
|
|
*.gch
|
|
*.pch
|
|
*.ipch
|
|
|
|
# Compiled Dynamic libraries
|
|
*.so
|
|
*.dylib
|
|
*.dll
|
|
|
|
# Fortran module files
|
|
*.mod
|
|
|
|
# Compiled Static libraries
|
|
*.lai
|
|
*.la
|
|
*.a
|
|
*.lib
|
|
|
|
# Executables
|
|
*.exe
|
|
*.out
|
|
*.app
|
|
|
|
# vim tags
|
|
tags
|
|
.tags
|
|
.*.swp
|
|
|
|
# Editors
|
|
.vscode
|
|
|
|
# CMake formatting configuration (local)
|
|
.cmake-format.yaml
|
|
|
|
# Cline
|
|
.cline*
|
|
|
|
# build-in-source directory (see exceptions below)
|
|
build*
|
|
|
|
# emacs temporary/backup files
|
|
.\#*
|
|
\#*\#
|
|
*~
|
|
|
|
# GDB temporary files
|
|
.gdb_history
|
|
install.dir*
|
|
|
|
# documentation artifacts
|
|
_build/
|
|
_images/
|
|
_static/
|
|
_templates/
|
|
_toc.yml
|
|
_doxygen/
|
|
docs/doxygen/html
|
|
docs/doxygen/xml
|
|
|
|
# JetBrains IDE (see build* exceptions below)
|
|
.idea/
|
|
cmake-build*/
|
|
build*/
|
|
|
|
# LSP configuration
|
|
.clangd
|
|
|
|
# User-defined CMake presets
|
|
CMakeUserPresets.json
|
|
|
|
# Python virtualenv
|
|
.venv/
|
|
|
|
# Python cache
|
|
__pycache__/
|
|
|
|
# Cache directories
|
|
.cache/
|
|
.ck_tile_cache/
|
|
ck_tile_cache/
|
|
**/kernel_cache/
|
|
**/.kernel_cache/
|
|
|
|
# Dispatcher kernel cache (user-generated, can be large)
|
|
dispatcher/**/kernel_cache/
|
|
dispatcher/**/.kernel_cache/
|
|
dispatcher/**/cached_kernels/
|
|
dispatcher/**/*.hsaco
|
|
dispatcher/**/*.co
|
|
|
|
# Dispatcher generated JSON exports
|
|
dispatcher/**/*_kernels.json
|
|
dispatcher/**/dispatcher_kernels.json
|
|
|
|
# Generated test data
|
|
test_data/*
|
|
!test_data/*.py
|
|
!test_data/*.sh
|
|
!test_data/requirements.txt
|
|
|
|
# Exceptions to build* patterns above
|
|
# The experimental/builder directory should be tracked despite matching build*
|
|
!experimental/builder
|
|
!experimental/builder/**
|
|
experimental/grouped_convolution_tile_instances/instances/*
|
|
!experimental/grouped_convolution_tile_instances/instances/*.in
|
|
!experimental/grouped_convolution_tile_instances/instances/*.inc
|
|
!experimental/grouped_convolution_tile_instances/instances/*.hpp
|
|
experimental/grouped_convolution_tile_instances/*.inc
|