mirror of
https://github.com/amd/blis.git
synced 2026-05-12 10:05:38 +00:00
For the kernel of size 4x8, cs_b is used instead of cs_a to calculate address of diagonal elements of matrix A. Correcting the mistake. Change-Id: Ie74e0f6a397fcd32fefb5804cd00f1e90bfe5523