mirror of
https://github.com/amd/blis.git
synced 2026-05-11 17:50:00 +00:00
- The new AMAXV adheres to the BLAS definition of ISAMAX by not handle NaN separately. In the previous kernel, NaN is considered the smallest element of all the elements in the array. - The new logic uses two helper functions - bli_vec_absmax_double and bli_vec_search_double. - bli_vec_absmax_double finds the absolute largest element and the index range in which the first occurence of this element can be found. - bli_vec_search_double returns the index of the first occurence of the absolute value of an element. - AMAXV uses these two helper functions to find the absolute largest element and then searches using bli_vec_search_double in the reduced range provided by bli_vec_absmax_double. - Added condition check for n == 1 in BLAS layer. It is an optimization mention in the BLAS standard API definition. - Removed redundant n == 0 condition check from the kernel. This is a BLAS exception and is already done in the BLAS layer. - Removed AVX2 flag check from the BLAS layer. Kernels will be picked based on the architecture ID in the new design. AMD-Internal: [CPUPL-2773] Change-Id: Ida2dae84a60742e632dc810ab1b7b80fc354e178