aledudek
|
634634f5c0
|
[CK_TILE] Blockwise GEMM pipeline v6 - port of v5 from old CK (#2955)
* First checkpoint
* Second checkpoint - hot loop scheduler
* Third checkpoint - init main operator
* Fourth checkpoint - main loop ready
* Fifth checkpoint - main loop fix
* Sixth checkpoint - ReadWritecompFunc
* Seventh checkpoint - Tail finished
* [CK_TILE] Blockwise gemm pipeline v5 complete
* Working
* Working fixes 2
* Rename v5 to v77 temporarily
* Data type adjustment
* Data type adjustment 2
* [CK_TILE] Blockwise Gemm pipeline v5 add tests
* [CK_TILE] Fix calculation error
* TEMP: check pipeline
* Fix name to V6
* naming and documentation changes
* WIP dump
* Try fixing v1
* Failing tests v5
* Debugging
* Changes v2
* F16 tests working great
* Working BlockwiseGemmPipelineV5 as V6
* Cleanup and format
* Merging changes part1
* [CK_TILE] Blockwise Gemm Pipeline Comp V5/V6
* Remove commented code
* Fix gfx950 build issues
* Fix file formatting
* Review changes, more concat info, add bf16 bf8 tests
* Fix formatting
* Add bf16 and bf8 tests
---------
Co-authored-by: Adam Osewski <Adam.Osewski@amd.com>
|
2025-10-13 13:57:37 +02:00 |
|