composable_kernel

mirror of https://github.com/ROCm/composable_kernel.git synced 2026-05-05 14:11:29 +00:00

Files

Adam Osewski d00e6115b9 Gemm reduce examples int4/int8/fp32/bf16 (#368 )

* GEMM + Reduce max fp16+fp32

* GEmm + Max bf16 + int8

* Refactor common definitions.

* Refactor common func of mean meansquare example.

* More examples for mean meansquare.

* Update int8 examples and skip them cause of random errors.

* Int4 examples.

* Fix examples for max int4/8

* Tensor conversion for int4 input data for mean meansquare example.

* Remove int4 mean_meansquare example

* Fix int8 mean_meansquare example.

-All ReductionAccData and R<N>DataType have to be F32. The INT32 data
type is giving wrong results.

* Guard int4 with ifdef

* Change int8 example to add_addsquare due to div rounding err.

* Clang format

* Change the return type of common function.

* Get back int8 example with division.

* Remove int8 mean meansquare.

* Use proper cast for BF16 data type.

* Use ck::literals.

* Use proper data type for host tensors & reference.

- Use ReduceAccDataType for reference gemm output data type.
- Cast host reference output tensor to EDataType
- Fix ifdefs for int4.

Co-authored-by: Adam Osewski <aosewski@amd.com>

2022-08-30 11:38:26 -05:00

CMakeLists.txt

Gemm reduce examples int4/int8/fp32/bf16 (#368 )

2022-08-30 11:38:26 -05:00

gemm_add_add_mean_meansquare_xdl_fp16.cpp

Gemm multiple d multiple r (#335 )

2022-08-13 01:07:12 -05:00

gemm_add_addsquare_xdl_int8.cpp

Gemm reduce examples int4/int8/fp32/bf16 (#368 )

2022-08-30 11:38:26 -05:00

gemm_max_xdl_bf16.cpp

Gemm reduce examples int4/int8/fp32/bf16 (#368 )