mirror of
https://github.com/amd/blis.git
synced 2026-05-11 17:50:00 +00:00
Also enabled weighted partitioning for herk, trmm Fixed bug where multiple threads would try to modify the same state in the internal level 3 functions Correctly computed a_next and b_next for gemm, herk macrokernels a_next and b_next point to the current micropanels in trmm