Balasubramanian, Vignesh
ab4bb2f1e8
Threshold tuning for code-paths and optimal thread selection for ZGEMM(ZEN4)
...
- Updated the thresholds to enter the AVX512 Tiny and SUP codepaths
for ZGEMM(on ZEN4). This caters to inputs that perform well on
a single-threaded execution(in the Tiny-path), and inputs that
scale well with multithreaded-execution(in the SUP path).
- Also updated the thresholds to decide ideal threads, based on
'm', 'n' and 'k' values. The thread-setting logic involves
determining the number of tiles for computation, and using them
to further tune for the optimal number of threads.
AMD-Internal: [CPUPL-6378][CPUPL-6661]
Co-authored-by: Vignesh Balasubramanian <vignbala@amd.com >
2025-07-10 15:35:22 +05:30
..
2023-11-09 15:49:45 +05:30
2023-11-09 15:49:45 +05:30
2023-11-09 15:49:45 +05:30
2023-11-09 15:49:45 +05:30
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-08-06 06:56:01 -04:00
2023-11-08 23:45:57 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2021-10-28 16:05:43 -05:00
2023-08-21 07:01:38 -04:00
2021-08-16 00:12:33 -04:00
2024-04-12 07:26:31 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2019-08-23 14:18:07 +05:30
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-07-08 06:09:11 -04:00
2024-07-08 06:09:11 -04:00
2018-12-04 14:31:06 -06:00
2023-04-21 10:02:48 -04:00
2024-08-28 12:25:57 -04:00
2024-08-28 12:25:57 -04:00
2024-06-25 05:48:46 -04:00
2023-11-23 08:54:31 -05:00
2024-06-25 05:48:46 -04:00
2023-11-23 08:54:31 -05:00
2021-03-31 17:09:36 -05:00
2020-08-03 11:47:18 +05:30
2023-11-23 08:54:31 -05:00
2019-08-23 14:18:07 +05:30
2024-07-08 06:09:11 -04:00
2023-04-04 07:54:31 -04:00
2021-10-28 16:05:43 -05:00
2021-10-28 16:05:43 -05:00
2023-10-16 08:48:51 -04:00
2023-10-16 08:48:51 -04:00
2024-12-16 04:45:26 -05:00
2024-05-08 12:51:32 -04:00
2020-08-03 11:46:07 +05:30
2019-08-23 14:18:07 +05:30
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2021-03-31 17:09:36 -05:00
2020-08-03 11:47:18 +05:30
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2020-08-03 11:27:13 +05:30
2020-08-03 11:27:13 +05:30
2023-11-23 08:54:31 -05:00
2020-08-03 11:47:18 +05:30
2020-08-03 11:27:13 +05:30
2023-11-03 08:47:17 -04:00
2023-11-03 08:47:17 -04:00
2023-11-23 08:54:31 -05:00
2021-04-27 18:16:44 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2019-11-14 16:48:14 -06:00
2019-08-23 14:18:07 +05:30
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00
2023-11-10 13:05:12 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2020-08-03 11:47:18 +05:30
2024-04-16 00:43:28 -04:00
2020-08-03 11:27:13 +05:30
2025-07-10 15:35:22 +05:30
2025-06-17 12:39:48 +05:30
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00
2021-05-01 18:54:48 -05:00
2021-05-01 18:54:48 -05:00
2021-05-01 18:54:48 -05:00
2021-05-01 18:54:48 -05:00
2018-12-04 14:31:06 -06:00
2019-08-23 14:18:07 +05:30
2018-12-04 14:31:06 -06:00
2020-08-03 11:47:18 +05:30
2018-12-04 14:31:06 -06:00
2020-08-03 11:47:18 +05:30
2024-08-05 15:35:08 -04:00