Files
blis/kernels
Arnav Sharma 90f915d3a9 Vectorized and parallelized zdscal routine
- Implemented optimized intrinsic kernel for zdscalv for the cases where AVX2 is supported.
- Also added multithreaded support for the same.
- The optimal number of threads is being calculated on the basis of input size.

AMD-Internal: [CPUPL-2602]
Change-Id: I4d05c3b1cc365a7770703286a89c6dce3875c067
2022-09-30 06:11:07 -04:00
..
2022-08-19 12:31:35 -04:00
2020-09-29 16:52:18 -05:00
2022-07-22 03:42:17 -04:00
2021-04-27 11:09:48 +05:30
2020-07-22 18:24:26 +05:30
2022-06-13 10:52:53 +05:30