Files
blis/addon
Vankadari, Meghana 8968973c2d Performance fix for FP32 GEMV (#47)
Details:
- In FP32 GEMM interface, mtag_b is being set to PACK by default.
  This is leading to packing of B matrix even though packing is not
  absolutely required leading to perf regression.
- Setting mtag_b to PACK only if it is absolutely necessary to pack B matrix
  modified check conditions before packing appropriately.

AMD-Internal - [SWLCSG-3575]
2025-06-10 14:54:01 +05:30
..
2024-08-05 15:35:08 -04:00
2024-08-05 15:35:08 -04:00