Vignesh Balasubramanian
f23b8e636b
AVX2 and AVX512 optimizations for DAXPYV
...
- Removed some of the unrolling factors that affected the
performance of AVX2 DAXPYV kernel. In addition to improving
the current performance on sizes compatible to single-threaded
runs, this will now perform better for tiny sizes as well
since the overhead to reach the computation is less.
- Updated the vector partitioning logic, by using
bli_thread_range_sub( ... ), which ensures that there is no
false sharing among multiple threads.
- Updated the AOCL-DYNAMIC logic for the API, to include thresholds
or zen4 and zen5 micro-architectures.
AMD-Internal: [CPUPL-5514]
Change-Id: Iee9edddac685334213cd6694421ab3df3547e930
2024-07-31 09:24:36 -04:00
..
2023-11-09 15:49:45 +05:30
2023-11-09 15:49:45 +05:30
2024-07-26 10:36:37 -04:00
2023-11-23 08:54:31 -05:00
2024-05-21 11:13:28 +05:30
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-07-22 11:32:19 +05:30
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-07-31 09:24:36 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-07-24 08:23:07 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-05-08 04:54:05 -04:00
2024-05-08 01:46:17 -04:00
2023-11-23 08:54:31 -05:00
2024-07-08 06:09:11 -04:00
2023-11-23 08:54:31 -05:00
2024-07-09 07:07:24 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-22 17:11:10 -05:00
2023-11-22 17:11:10 -05:00
2023-11-09 18:53:59 +05:30
2023-11-22 17:11:10 -05:00
2023-11-09 18:53:59 +05:30
2023-11-22 17:11:10 -05:00
2024-07-08 06:09:11 -04:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2024-05-21 11:13:28 +05:30
2023-04-21 10:02:48 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-08-21 07:01:38 -04:00
2020-11-18 12:55:36 +05:30
2024-05-21 11:13:28 +05:30
2023-04-21 10:02:48 -04:00
2024-05-06 12:57:38 -04:00
2023-04-21 10:02:48 -04:00
2024-07-17 00:27:47 +05:30
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-07-08 06:09:11 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00