Files
blis/frame/base
Vignesh Balasubramanian 02da190560 AVX512 optimizations for DNRM2
- Implemented bli_dnorm2fv_unb_var1_avx512( ... ) AVX512
  computational kernel for DNRM2 API.

- Updated the header to include this kernel signature, as well
  as the framework layer to use this function in case of ZEN4
  and ZEN5 configurations.

- Updated the tipping points for ideal thread setting in DNRM2
  for ZEN5 micro-architecture. These thresholds are specific
  to the library's linkage to LLVM's OpenMP or GNU's OpenMp.

- Further abstracted the AOCL-DYNAMIC logic to separate functions
  for ?NRM2 APIs that currently support it(namely, DNRM2 and ZNRM2).

- Further updated the ?NRM2 framework to accommodate the necessary
  changes to invoke the newer AOCL-DYNAMIC functions and the AVX512
  kernel, when needed.

- Added micro-kernel and memory tests for this kernel in GTestsuite,
  to validate accuracy and out-of-bounds read and write.

AMD-Internal: [CPUPL-5265]
Change-Id: I4fc0d0f1e6906bf27d46562ca387c338cc4d2049
2024-06-24 08:50:36 -04:00
..
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-06-18 19:31:17 +05:30
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-08-16 08:09:01 -04:00
2024-01-17 11:41:15 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00
2024-05-08 12:51:32 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2021-04-27 18:16:44 -05:00
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-06-24 08:50:36 -04:00
2024-06-24 08:50:36 -04:00
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00