Files
blis/frame/base
Harsh Dave 75356d45e5 DGEMM improvement for very tiny sizes less than 24.
- This commit helps improving performance for very small input
by reducing framework check and routing all such inputs to
bli_dgemm_tiny_6x8_kernel. It forces single threaded computation
for such sizes.

- It invokes bli_dgemm_tiny_6x8_kernel for ZEN, ZEN2, ZEN3 and ZEN4
code path. Except for the case AOCL_ENABLE_INSTRUCTIONS environment
variable is set to avx512. In that case, such a small inputs are
routed to bli_dgemm_tiny_24x8_kernel avx512 kernel.

AMD-Internal: [CPUPL-1701]
Change-Id: Idf59f4a8ee76ee8f2514a33be2b618e3ce02383e
2023-11-08 23:45:57 -05:00
..
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2023-08-16 08:09:01 -04:00
2021-04-27 11:09:48 +05:30
2021-04-27 18:16:44 -05:00
2023-11-06 22:35:14 -05:00