Mateusz Ozga
b63e4bc4b8
CK-Tile Grouped GEMM refactor and post PR fixes ( #1756 )
...
* Grouped gemm simple code refactor
* Offset invoker
* Invoke generic Run, and replace name of parrtitioner variable
* Tests fix type
* Removed namespaces
* Add template param to avoid implicit cast
* Remove generic function
* Constant value
* underline enum to int16_t
* Generalize partitioner function
* Remove whitespaces
* Rename function
* Using support
* Clang-format
* Clang-format
* Fn-partitioner description fn
* Typo
* Typo 2
* Better description
* Better description
* Refactor after review
* Use ctr instead of set fn
* Inovke ctr and typo
* Comments
* Remove unnecessary comment
* Review, remove modulo
[ROCm/composable_kernel commit: 3c93d3c444 ]
2025-01-21 21:06:10 +01:00
Bartłomiej Kocot
7fbc8a9ac1
[CK TILE] GEMM and Batched GEMM SplitK support ( #1724 )
...
* [CK TILE] Add split K support in GEMM
* Updates
* Fixes
* rebase
* fix
* Fix
* fixes
* support for batched gemm
[ROCm/composable_kernel commit: af66494880 ]
2024-12-28 14:40:17 +01:00
aledudek
d9025d054d
[CK TILE] Refactor GemmKernel to be reused by other GEMM related operators ( #1730 )
...
* Gemm Kernel Refactor part1
* Gemm Kernel Refactor common gemm pipeline part2
* [CK TILE] Refactor batched gemm to reuse GemmKernel
* [CK TILE] Refactor GemmKernel - review changes part1
* [CK TILE] Refactor GemmKernel - references fix
* [CK TILE] Refactor GemmKernel - naming changes, add problem
* [CK_TILE] Refactor GemmKernel - update tests
* [CK_TILE] Refactor GemmKernel - review changes
* [CK_TILE] Refactor GemmKernel - update test
* [CK_TILE] Refactor GemmKernel - constness fixes
* [CK_TILE] Refactor GemmKernel - update tests
[ROCm/composable_kernel commit: 453ca37347 ]
2024-12-18 17:52:46 +01:00
aledudek
ed385de9b2
Ck tile batched gemm example ( #1615 )
...
* [CK Tile] Batched GEMM Example
* [CK Tile] Batched GEMM Example - minor refactor
* [CK Tile] Batched GEMM Example - README update
* [CK Tile] Batched Gemm Example - review changes
- Added tensor data layours as input parameters
- Changed structure of Host and Kernel args
- Removed bug with invalid vector read on non-contiguous memory
* [CK Tile] Batched Gemm Example - remove comment
* [CK Tile] Batched Gemm Example - Add GTests part1
* [CK Tile] Batched Gemm Example - GTests part2 + review changes
* [CK TILE] Batched GEMM post merge fixes
* [CK Tile] Batched GEMM Example - fix pad views
[ROCm/composable_kernel commit: 78f0fea08e ]
2024-11-29 11:52:18 +01:00