Files
blis/kernels
Meghana Vankadari d5b4d3aa5e Fixing control flow in aocl_gemm_bf16s4f32of32|bf16
- Fixed framework of bf16s4f32of32 API to correct
  pointer updations.
- Modified pre_op structure to exclude pre-op-offset.
  Now offset is passed as a separate parameter to the
  scale-pack functions.
- Fixed work-distribution among threads in MT scenario.
- Added Blocksizes and kernel-pointers and verified
  functionality for the new API.

AMD-Internal: [SWLCSG-2943]
Change-Id: I58fece240d62c798c880a2b2b7fa64e560cc753d
2024-07-29 05:12:09 -04:00
..
2021-10-08 02:35:58 +09:00
2023-11-22 17:51:46 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2020-07-22 18:24:26 +05:30