mirror of
https://github.com/amd/blis.git
synced 2026-05-12 10:05:38 +00:00
- Added the correct strides to be used while unreorder/convert B matrix in m=1 cases. - Modified Zero point vector loads to proper instructions. - Modified bf16 store in AVX2 GEMV M kenrel AMD Internal - [SWLCSG - 3602 ]