mirror of
https://github.com/amd/blis.git
synced 2026-05-11 17:50:00 +00:00
More improvements to DTL coverage and coding: - Removed some DTL overheads from performance stats timing for all APIs where it is currently implemented (i.e. gemm, gemmt, trsm, nrm2) - Expand logging coverage to gemm pack and compute APIs, including performance stats for gemm_compute - Expand logging coverage to rot, rotg, rotm and rotmg APIs - Tidied order of function prototypes in aocl_dtl/aocldtl_blis.h AMD-Internal: [CPUPL-7010]