mirror of
https://github.com/amd/blis.git
synced 2026-04-19 23:28:52 +00:00
Details: - In FP32 GEMM interface, mtag_b is being set to PACK by default. This is leading to packing of B matrix even though packing is not absolutely required leading to perf regression. - Setting mtag_b to PACK only if it is absolutely necessary to pack B matrix modified check conditions before packing appropriately. AMD-Internal - [SWLCSG-3575]