Commit Graph

4 Commits

Author SHA1 Message Date
Khushbu Agarwal
74210a9dfc multi instance generation for CkTileEngine (#2080)
* Add support for multi-instance verification, print detail for each instance, documentation fix

* clang formatted

* Added Readme file

* updated readme

* Addressing review comments

* clang formatted

* Updated ReadMe and GPU reference code

* simplified dispatch kernel code

* indentation

[ROCm/composable_kernel commit: 7cadf187e2]
2025-04-21 08:39:45 -07:00
Khushbu Agarwal
09792fa112 file clang formatted (#2053)
[ROCm/composable_kernel commit: 3bda57c204]
2025-04-03 16:55:49 -07:00
Khushbu Agarwal
844730776f Documentation for newly added struct (#2051)
[ROCm/composable_kernel commit: b443056a26]
2025-04-03 16:24:34 -07:00
Khushbu Agarwal
b85b103194 [New] Build up the feature of CK Tile GEMM CodeGen (#1994)
* New branch for codegen changes

* Fix verify function for int4

* pk_int4 codegen

* Update to review comments

* Remove codegen directory and rename filenames

* Remove extra files; clean up CMake file

* New branch for codegen changes

* Fix verify function for int4

* pk_int4 codegen

* Update to review comments

* Remove codegen directory and rename filenames

* Remove extra files; clean up CMake file

* code changes for single instance

* config file rename, added few more combinations in json file

* Fix cmake file

* Addressing review comments

* Reverting files changed by merge to develop

---------

Co-authored-by: ThomasNing <thomas.ning@amd.com>

[ROCm/composable_kernel commit: fed0709121]
2025-04-03 11:54:12 -07:00