carlushuang
5b9b083dbc
[CK_TILE] not using structures under ck_tile/ops for ck_tile/host ( #1834 )
...
* not using structures under ck_tile/ops for ck_tile/host
* update as constexpr function
* Rename fn
* Update other examples.
---------
Co-authored-by: Adam Osewski <19374865+aosewski@users.noreply.github.com >
Co-authored-by: Adam Osewski <Adam.Osewski@amd.com >
2025-01-24 15:35:54 +08:00
Mateusz Ozga
3c93d3c444
CK-Tile Grouped GEMM refactor and post PR fixes ( #1756 )
...
* Grouped gemm simple code refactor
* Offset invoker
* Invoke generic Run, and replace name of parrtitioner variable
* Tests fix type
* Removed namespaces
* Add template param to avoid implicit cast
* Remove generic function
* Constant value
* underline enum to int16_t
* Generalize partitioner function
* Remove whitespaces
* Rename function
* Using support
* Clang-format
* Clang-format
* Fn-partitioner description fn
* Typo
* Typo 2
* Better description
* Better description
* Refactor after review
* Use ctr instead of set fn
* Inovke ctr and typo
* Comments
* Remove unnecessary comment
* Review, remove modulo
2025-01-21 21:06:10 +01:00
Bartłomiej Kocot
bdddf1eace
[CK_TILE] Add error threshold calculation for gemm examples ( #1821 )
2025-01-18 01:01:52 +01:00
Bartłomiej Kocot
af66494880
[CK TILE] GEMM and Batched GEMM SplitK support ( #1724 )
...
* [CK TILE] Add split K support in GEMM
* Updates
* Fixes
* rebase
* fix
* Fix
* fixes
* support for batched gemm
2024-12-28 14:40:17 +01:00
aledudek
453ca37347
[CK TILE] Refactor GemmKernel to be reused by other GEMM related operators ( #1730 )
...
* Gemm Kernel Refactor part1
* Gemm Kernel Refactor common gemm pipeline part2
* [CK TILE] Refactor batched gemm to reuse GemmKernel
* [CK TILE] Refactor GemmKernel - review changes part1
* [CK TILE] Refactor GemmKernel - references fix
* [CK TILE] Refactor GemmKernel - naming changes, add problem
* [CK_TILE] Refactor GemmKernel - update tests
* [CK_TILE] Refactor GemmKernel - review changes
* [CK_TILE] Refactor GemmKernel - update test
* [CK_TILE] Refactor GemmKernel - constness fixes
* [CK_TILE] Refactor GemmKernel - update tests
2024-12-18 17:52:46 +01:00
aledudek
f6c4d614e3
[CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm ( #1743 )
...
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review changes
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review fix
2024-12-18 09:45:58 +01:00
aledudek
78f0fea08e
Ck tile batched gemm example ( #1615 )
...
* [CK Tile] Batched GEMM Example
* [CK Tile] Batched GEMM Example - minor refactor
* [CK Tile] Batched GEMM Example - README update
* [CK Tile] Batched Gemm Example - review changes
- Added tensor data layours as input parameters
- Changed structure of Host and Kernel args
- Removed bug with invalid vector read on non-contiguous memory
* [CK Tile] Batched Gemm Example - remove comment
* [CK Tile] Batched Gemm Example - Add GTests part1
* [CK Tile] Batched Gemm Example - GTests part2 + review changes
* [CK TILE] Batched GEMM post merge fixes
* [CK Tile] Batched GEMM Example - fix pad views
2024-11-29 11:52:18 +01:00