mirror of
https://github.com/amd/blis.git
synced 2026-03-18 08:17:22 +00:00
- Currently TRSM reference kernels are derived from GEMM blocksizes and GEMM_UKR. - This does not allow the flexibility to use different GEMM_UKR for GEMM and TRSM if optimized TRSM_UKR are not available. - Made changes so that ref TRSM kernels are derived from TRSM blocksizes. - Changed ZEN4 and ZEN5 cntx to use AVX2 kernels for CTRSM. AMD-Internal: [SWLCSG-3702]