Illia Silin
|
504b101da3
|
upgrade from clang-format-12 to clang-format-18 (#2568)
* upgrade to clang-format-18
* update to clang-format-18 in pre-commit-config
|
2025-07-28 11:34:07 -07:00 |
|
Thrupti Raj Lakshmana Gowda
|
a03682cb80
|
Updating Runtime log for CK Tile Engine (#2431)
* Updating runtime log message for CK TILE ENGINE
* Fixing Clang Format
* Update tile_engine/ops/gemm/README.md
Co-authored-by: Aviral Goel <aviral.goel@amd.com>
---------
Co-authored-by: ThruptiRajLakshmanaGowda <tlakshma@amd.com>
Co-authored-by: Aviral Goel <aviral.goel@amd.com>
|
2025-07-01 10:59:49 -07:00 |
|
Khushbu Agarwal
|
2e38eb4f1c
|
Rotating buffer PR CI fix (#2257)
* Revert "Revert "[CK_tile] Add rotating buffer feature for universal gemm (#2200)" (#2256)"
This reverts commit bbdaf79a52.
* fix regression
|
2025-06-02 10:25:01 -07:00 |
|
Illia Silin
|
bbdaf79a52
|
Revert "[CK_tile] Add rotating buffer feature for universal gemm (#2200)" (#2256)
This reverts commit 99857e10e6.
|
2025-05-28 09:46:52 -06:00 |
|
Khushbu Agarwal
|
99857e10e6
|
[CK_tile] Add rotating buffer feature for universal gemm (#2200)
* Add rotating buffer feature for universal gemm
* adding changes in tile_engine
* Updated code to merge kernel_launch
* removing comments
* Enable rotating buffer changes to flatmm
* Created diff launch_kernel function for rotating buffer
* Simplfied calculation using macros
* merge code with new changes in tile_engine
* clang formatted
* Redefine macros
|
2025-05-27 23:00:58 -07:00 |
|
Casey-Shi
|
128f5a1eab
|
[Tile Engine] Add benchmark for tile engine gemm. (#2193)
* initial commit -m benchmark
* only support profile
* fix
* fix doc
* add default config
* add ci
* fix cmake
* tmp save for gen blobs
* fix bug
* merge
* range config
* test success
* fix
* fix
* move struct
* remove config property
* fix config
* remove comment
* add cmake option & modify
* add changelog
* fix
* format
* add pydantic module to the docker image
* fix
* add benchmark for cold and warmp up
* python format
* add asm cache control
* fix README
* remove pydantic module
* modify changelog
* fix config
* recover benchmark_gemm and fix
* format python
* refactor profiler
* fix csv bug
* fix codegen bug
* add kernel instance object
* add benchmark gemm executable
* fix jenkins & delete extra header
* disable warning output & enable default config
* Disable sparsity for invalid warp tile combinations
* fix gemm host template func
* refactor gemm profiler
* filter out some inmstances
* default config test & fix codegen bug
* add sparse flag to gen more instances
---------
Co-authored-by: illsilin <Illia.Silin@amd.com>
Co-authored-by: khuagarw <khuagarw@amd.com>
Co-authored-by: Thomas Ning <Thomas.Ning@amd.com>
|
2025-05-26 22:32:36 -07:00 |
|