blis/addon at d69473c8f72735bf19fa41b43042ec6f941bcd5b - blis

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-07-17 09:07:31 +00:00

Files

eashdash 32a9e735f1 BF16 Output downscaling functionality

- BF16 instructions output is accumulated at a higher precision of
FP32 which needs to be converted to a lower precison of bf16 post
the GEMM operations. This is required in AI workloads where both
input and output are in BF16 format.
- BF16 downscaling is implemented as post-ops inside the GEMM
microkernels.

Change-Id: Id1606746e3db4f3ed88cba385a7709c8604002a8

2022-08-30 13:46:09 -04:00

aocl_gemm

BF16 Output downscaling functionality

2022-08-30 13:46:09 -04:00

gemmd

Added support for addons.

2022-03-31 12:03:27 +05:30