mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-20 12:59:49 +00:00
Enabling diff datatypes for tile_engine and build with more granularity (#2392)
* merging recent changes to universal gemm to tile_engine
* Reducing Linking time by generating less intermediate files
* make small libs to build faster
* Reducing the instances
* reducing instances
* Restoring default config
* Restoring default config
* warp_n reverted in default config
* Adding diff json files for fp8 and fp16, cmake changes for fp8
* Restructure the CMake File
* Added more granularity for build and some debugging code
* removed some of debugging statements
* added fp8 instances
* tahe datatype from command line to enable both type of json files
* updated README file
* code cleanup
* code cleanup
* updated jenkinsfile
* enable tile_engine daily builds
* updating cmake file
* updated CMakeLists.txt
* Updating CMake code fixing gfx12 build
* Updating CMake code fixing gfx12 build
* Fix CMake file null checks
* fixed traces of rebase
* Update tile_engine/ops/gemm/README.md
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com>
* Update tile_engine/ops/gemm/README.md
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com>
* Update tile_engine/ops/gemm/README.md
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com>
* fixing rebase issue
---------
Co-authored-by: khushbu <khuagarw@gmail.com>
Co-authored-by: ThomasNing <thomas.ning@amd.com>
Co-authored-by: illsilin_amdeng <Illia.Silin@amd.com>
Co-authored-by: AviralGoelAMD <aviral.goel@amd.com>
Co-authored-by: spolifroni-amd <Sandra.Polifroni@amd.com>
[ROCm/composable_kernel commit: a14753b86f]
This commit is contained in:
@@ -199,7 +199,7 @@ warp_tile_supported_combinations = {
|
||||
[64, 4, 16],
|
||||
],
|
||||
"fp8_fp8_fp16": [[32, 32, 16], [32, 32, 32]],
|
||||
"fp8_fp8_fp16": [[32, 32, 16], [32, 32, 32]],
|
||||
"bf8_bf8_fp16": [[32, 32, 16], [32, 32, 32]],
|
||||
},
|
||||
"gfx942": {
|
||||
"fp16_fp16_fp16": [
|
||||
@@ -219,7 +219,7 @@ warp_tile_supported_combinations = {
|
||||
[64, 4, 16],
|
||||
],
|
||||
"fp8_fp8_fp16": [[32, 32, 16], [32, 32, 32], [16, 16, 32], [16, 16, 64]],
|
||||
"fp8_fp8_fp16": [[32, 32, 16], [32, 32, 32], [16, 16, 64], [16, 16, 32]],
|
||||
"bf8_bf8_fp16": [[32, 32, 16], [32, 32, 32], [16, 16, 64], [16, 16, 32]],
|
||||
"int8_int8_int32": [[16, 16, 32], [32, 32, 16]],
|
||||
},
|
||||
"gfx950": {
|
||||
@@ -247,7 +247,7 @@ warp_tile_supported_combinations = {
|
||||
[16, 16, 128],
|
||||
[32, 32, 64],
|
||||
],
|
||||
"fp8_fp8_fp16": [
|
||||
"bf8_bf8_fp16": [
|
||||
[32, 32, 16],
|
||||
[32, 32, 32],
|
||||
[16, 16, 64],
|
||||
|
||||
Reference in New Issue
Block a user