Max Podkorytov
f38751fc2a
invoke script directly ( #2687 )
2025-08-19 00:23:07 -07:00
Thrupti Raj Lakshmana Gowda
0f3083ab5c
[CKTILE] Layout Support for CK Tile engine ( #2482 )
...
* Updating runtime log message for CK TILE ENGINE
* CKTile layout from config
* CKTile custom config for CI
* Documentation for Layout Changes
* CKTile Layout changes to Jenkins
* Fixing Clang Format
* Changes to Jenkins file to fix error
* fix(cmake-ck-dev): no longer sets invalid values as gpu arch
* style(py files): ruff formatting
* fix(cmake-ck-release): no longer sets invalid values as gpu arch
* chore(cmake-tile_engine): add reminder to uncomment user config json
* Changes to jenkin file to address more cases
* Changes to Jenkins to fix Error
* Changes to Jenkins file for fixing an error
* Update Jenkinsfile (#2517 )
* Update Jenkinsfile
---------
Co-authored-by: ThruptiRajLakshmanaGowda <tlakshma@amd.com >
Co-authored-by: AviralGoelAMD <aviral.goel@amd.com >
Co-authored-by: Thomas Ning <Thomas.Ning@amd.com >
2025-07-17 12:19:41 -07:00
Thrupti Raj Lakshmana Gowda
a03682cb80
Updating Runtime log for CK Tile Engine ( #2431 )
...
* Updating runtime log message for CK TILE ENGINE
* Fixing Clang Format
* Update tile_engine/ops/gemm/README.md
Co-authored-by: Aviral Goel <aviral.goel@amd.com >
---------
Co-authored-by: ThruptiRajLakshmanaGowda <tlakshma@amd.com >
Co-authored-by: Aviral Goel <aviral.goel@amd.com >
2025-07-01 10:59:49 -07:00
Khushbu Agarwal
a14753b86f
Enabling diff datatypes for tile_engine and build with more granularity ( #2392 )
...
* merging recent changes to universal gemm to tile_engine
* Reducing Linking time by generating less intermediate files
* make small libs to build faster
* Reducing the instances
* reducing instances
* Restoring default config
* Restoring default config
* warp_n reverted in default config
* Adding diff json files for fp8 and fp16, cmake changes for fp8
* Restructure the CMake File
* Added more granularity for build and some debugging code
* removed some of debugging statements
* added fp8 instances
* tahe datatype from command line to enable both type of json files
* updated README file
* code cleanup
* code cleanup
* updated jenkinsfile
* enable tile_engine daily builds
* updating cmake file
* updated CMakeLists.txt
* Updating CMake code fixing gfx12 build
* Updating CMake code fixing gfx12 build
* Fix CMake file null checks
* fixed traces of rebase
* Update tile_engine/ops/gemm/README.md
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com >
* Update tile_engine/ops/gemm/README.md
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com >
* Update tile_engine/ops/gemm/README.md
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com >
* fixing rebase issue
---------
Co-authored-by: khushbu <khuagarw@gmail.com >
Co-authored-by: ThomasNing <thomas.ning@amd.com >
Co-authored-by: illsilin_amdeng <Illia.Silin@amd.com >
Co-authored-by: AviralGoelAMD <aviral.goel@amd.com >
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com >
2025-06-25 15:18:24 -07:00
Khushbu Agarwal
2e38eb4f1c
Rotating buffer PR CI fix ( #2257 )
...
* Revert "Revert "[CK_tile] Add rotating buffer feature for universal gemm (#2200 )" (#2256 )"
This reverts commit bbdaf79a52 .
* fix regression
2025-06-02 10:25:01 -07:00
Casey-Shi
29574f05f7
change from ninja to make ( #2253 )
2025-05-28 09:25:05 -07:00
Illia Silin
bbdaf79a52
Revert "[CK_tile] Add rotating buffer feature for universal gemm ( #2200 )" ( #2256 )
...
This reverts commit 99857e10e6 .
2025-05-28 09:46:52 -06:00
Khushbu Agarwal
99857e10e6
[CK_tile] Add rotating buffer feature for universal gemm ( #2200 )
...
* Add rotating buffer feature for universal gemm
* adding changes in tile_engine
* Updated code to merge kernel_launch
* removing comments
* Enable rotating buffer changes to flatmm
* Created diff launch_kernel function for rotating buffer
* Simplfied calculation using macros
* merge code with new changes in tile_engine
* clang formatted
* Redefine macros
2025-05-27 23:00:58 -07:00
Casey-Shi
128f5a1eab
[Tile Engine] Add benchmark for tile engine gemm. ( #2193 )
...
* initial commit -m benchmark
* only support profile
* fix
* fix doc
* add default config
* add ci
* fix cmake
* tmp save for gen blobs
* fix bug
* merge
* range config
* test success
* fix
* fix
* move struct
* remove config property
* fix config
* remove comment
* add cmake option & modify
* add changelog
* fix
* format
* add pydantic module to the docker image
* fix
* add benchmark for cold and warmp up
* python format
* add asm cache control
* fix README
* remove pydantic module
* modify changelog
* fix config
* recover benchmark_gemm and fix
* format python
* refactor profiler
* fix csv bug
* fix codegen bug
* add kernel instance object
* add benchmark gemm executable
* fix jenkins & delete extra header
* disable warning output & enable default config
* Disable sparsity for invalid warp tile combinations
* fix gemm host template func
* refactor gemm profiler
* filter out some inmstances
* default config test & fix codegen bug
* add sparse flag to gen more instances
---------
Co-authored-by: illsilin <Illia.Silin@amd.com >
Co-authored-by: khuagarw <khuagarw@amd.com >
Co-authored-by: Thomas Ning <Thomas.Ning@amd.com >
2025-05-26 22:32:36 -07:00
Aviral Goel
1aea51d34e
[Tile Engine] Improved README.md ( #2134 )
...
* improved tile_engine readme
* changed ck tile explanation and json
* further improved readme
* fixed typo
2025-04-29 17:37:07 -07:00
Khushbu Agarwal
768c99eca9
[TileEngine] Support for sparsity in codegen ( #2128 )
...
* Added sparsity flag in codegen
* remove comments
* clan formatted
* added sparsity as runtime argument
* updated README
* updated stream config variable
* fix typo for tail_num in hot loop
2025-04-28 18:19:23 -07:00
Khushbu Agarwal
7cadf187e2
multi instance generation for CkTileEngine ( #2080 )
...
* Add support for multi-instance verification, print detail for each instance, documentation fix
* clang formatted
* Added Readme file
* updated readme
* Addressing review comments
* clang formatted
* Updated ReadMe and GPU reference code
* simplified dispatch kernel code
* indentation
2025-04-21 08:39:45 -07:00