Bartłomiej Kocot
7fbc8a9ac1
[CK TILE] GEMM and Batched GEMM SplitK support ( #1724 )
...
* [CK TILE] Add split K support in GEMM
* Updates
* Fixes
* rebase
* fix
* Fix
* fixes
* support for batched gemm
[ROCm/composable_kernel commit: af66494880 ]
2024-12-28 14:40:17 +01:00
aledudek
d9025d054d
[CK TILE] Refactor GemmKernel to be reused by other GEMM related operators ( #1730 )
...
* Gemm Kernel Refactor part1
* Gemm Kernel Refactor common gemm pipeline part2
* [CK TILE] Refactor batched gemm to reuse GemmKernel
* [CK TILE] Refactor GemmKernel - review changes part1
* [CK TILE] Refactor GemmKernel - references fix
* [CK TILE] Refactor GemmKernel - naming changes, add problem
* [CK_TILE] Refactor GemmKernel - update tests
* [CK_TILE] Refactor GemmKernel - review changes
* [CK_TILE] Refactor GemmKernel - update test
* [CK_TILE] Refactor GemmKernel - constness fixes
* [CK_TILE] Refactor GemmKernel - update tests
[ROCm/composable_kernel commit: 453ca37347 ]
2024-12-18 17:52:46 +01:00
aledudek
ce345cc50e
[CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm ( #1743 )
...
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review changes
* [CK_TILE] Move hipmalloc/memcpy calls out of gpu reference gemm - review fix
[ROCm/composable_kernel commit: f6c4d614e3 ]
2024-12-18 09:45:58 +01:00
aledudek
ed385de9b2
Ck tile batched gemm example ( #1615 )
...
* [CK Tile] Batched GEMM Example
* [CK Tile] Batched GEMM Example - minor refactor
* [CK Tile] Batched GEMM Example - README update
* [CK Tile] Batched Gemm Example - review changes
- Added tensor data layours as input parameters
- Changed structure of Host and Kernel args
- Removed bug with invalid vector read on non-contiguous memory
* [CK Tile] Batched Gemm Example - remove comment
* [CK Tile] Batched Gemm Example - Add GTests part1
* [CK Tile] Batched Gemm Example - GTests part2 + review changes
* [CK TILE] Batched GEMM post merge fixes
* [CK Tile] Batched GEMM Example - fix pad views
[ROCm/composable_kernel commit: 78f0fea08e ]
2024-11-29 11:52:18 +01:00