Files
blis/frame/include
Arnav Sharma 90f915d3a9 Vectorized and parallelized zdscal routine
- Implemented optimized intrinsic kernel for zdscalv for the cases where AVX2 is supported.
- Also added multithreaded support for the same.
- The optimal number of threads is being calculated on the basis of input size.

AMD-Internal: [CPUPL-2602]
Change-Id: I4d05c3b1cc365a7770703286a89c6dce3875c067
2022-09-30 06:11:07 -04:00
..
2021-04-27 11:09:48 +05:30
2021-04-27 11:09:48 +05:30
2022-04-01 13:55:30 +05:30