mirror of
https://github.com/amd/blis.git
synced 2026-05-12 01:59:59 +00:00
- Added the correct strides to be used while unreorder/convert B matrix in m=1 cases. - Modified Zero point vector loads to proper instructions. - Modified bf16 store in AVX2 GEMV M kenrel AMD Internal - [SWLCSG - 3602 ]