mirror of
https://github.com/amd/blis.git
synced 2026-05-11 17:50:00 +00:00
- In zen4 arch TRSM and GEMM have different blocksizes. TRSM call will update blockize in global cntx object which is incorrect for GEMM, when GEMM and TRSM are called in parallel. - Hence using a local copy of cntx which holds blocksizes would help. AMD-Internal: [CPUPL-3019] Change-Id: I5f0f5675b3917d2a11d582ac626ca5d8f4752c53