Files
blis/frame/base
Balasubramanian, Vignesh ab4bb2f1e8 Threshold tuning for code-paths and optimal thread selection for ZGEMM(ZEN4)
- Updated the thresholds to enter the AVX512 Tiny and SUP codepaths
  for ZGEMM(on ZEN4). This caters to inputs that perform well on
  a single-threaded execution(in the Tiny-path), and inputs that
  scale well with multithreaded-execution(in the SUP path).

- Also updated the thresholds to decide ideal threads, based on
  'm', 'n' and 'k' values. The thread-setting logic involves
  determining the number of tiles for computation, and using them
  to further tune for the optimal number of threads.

AMD-Internal: [CPUPL-6378][CPUPL-6661]

Co-authored-by: Vignesh Balasubramanian <vignbala@amd.com>
2025-07-10 15:35:22 +05:30
..
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-08-06 06:56:01 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2021-04-27 18:16:44 -05:00
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2023-11-23 08:54:31 -05:00
2024-05-08 12:51:32 -04:00
2024-08-05 15:35:08 -04:00