Johannes Graner
580a54b400
Update pre-commit to fixed versions, run remod for ck_tile ( #2895 )
...
* Fix ruff linter errors
* Fix remod dos2unix command
* Clang format
* Ignore utility in remod
* Run remod
* Specify clang-format version in pre-commit
* Specify ruff version
* Include PoolKernelArgs in reference_pool
* Add calculate_total_elements to reference batched contraction
* Fix calculate_total_elements declaration
* Refactor remod pre-commit hook
* Fix Aquant tests
---------
Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com >
[ROCm/composable_kernel commit: d40b50b9d5 ]
2025-10-16 15:29:17 -07:00
linqunAMD
da4657eaca
[CK_TILE] Enable ck_tile tests on gfx11 and gfx12 ( #2821 )
...
* [CK_TILE] Enable ck_tile test on gfx11 & gfx12
* revert an unnecessary change
* enable pk_int4 on gfx11 & gfx12
* revert .pre-commit-config.yaml
[ROCm/composable_kernel commit: b0ee317d83 ]
2025-09-12 12:45:14 -07:00
linqunAMD
01b8e6a80f
[CK_TILE] Refine Generic2dBlockShape to fix ck_tile example 2,10,11,14 on rdna3 and 4 ( #2795 )
...
BlockWarps, WarpTile in Generic2dBlockShape are wave size dependent, it causes mangled name mismatch between host and device side.
Solution: Replace them with ThreadPerBlock and move BlockWarps, WarpTile calculation into Generic2dBlockShape
[ROCm/composable_kernel commit: c254f3d7b4 ]
2025-09-10 08:29:20 +08:00
linqunAMD
807f7510b5
Support Wave32 in CK_TILE - Part 1 ( #2594 )
...
* Support wave32/wave64 in CK_TILE - Part 1
* remove blocksize in kernel launch
* fix build error
* fix clang format
* fix clang format 2
* fix clang format 3
* fix fmha build error
* fix fmha build 2
* fix fmha build 3
* fix build error 4
* address review comment
* update change log
* replace KernelBlockSize with kBlockSize
* fix CI fail
* fix clang format
* address review comment and rebase code.
* fix universal test fail
---------
Co-authored-by: Lin, Qun <Quentin.Lin+amdeng@amd.com >
Co-authored-by: Thomas Ning <Thomas.Ning@amd.com >
[ROCm/composable_kernel commit: 9fcc1ee9fd ]
2025-08-18 10:08:31 -07:00
Illia Silin
3345f5f417
upgrade from clang-format-12 to clang-format-18 ( #2568 )
...
* upgrade to clang-format-18
* update to clang-format-18 in pre-commit-config
[ROCm/composable_kernel commit: 504b101da3 ]
2025-07-28 11:34:07 -07:00
Cong Ma
545819c362
[CK_TILE] Migrate CK Tile examples to Tests to autorun on CI ( #2421 )
...
[CK_TILE] Add new ck tile unit test
* Add new ck tile unit test smoke-gemm-universal
* Add new ck tile unit test smoke-gemm-basic
* Add new ck tile unit test topk_softmax
* Add new ck tile unit test add_rmsnorm2d_rdquant_fwd
[ROCm/composable_kernel commit: f102eedfb3 ]
2025-07-22 08:15:18 -06:00