Files
blis/frame/3
Arnav Sharma 4aace5f524 Smart Threading for SGEMM SUP for Zen4 Architecture
- Added Smart Threading logic for AVX-512 based SGEMM SUP.
- Calculating ic and jc for optimal work distribution to the allocated
  threads based on logic similar to Zen3.
- Zen4 Architecture specific Native-to-SUP check has been added to
  redirect few Native inputs to the SUP path based on the fact that in a
  multi-threaded environment some Native cases perfom better as SUP.
- For the same, the SUP thresholds, namely, BLIS_MT and BLIS_NT have
  been increased from 512 and 200 to 682 and 512, respectively.
- Further optimizations to the work distribution logic will be added
  subsequently.

AMD-Internal: [CPUPL-3248]
Change-Id: Ibccbbefef251010ec94bd37ffc86c35b7866a5ca
2023-04-21 12:54:03 +05:30
..
2023-04-19 12:44:56 -04:00
2023-04-19 12:44:56 -04:00
2023-04-19 12:44:56 -04:00
2023-04-19 12:44:56 -04:00
2023-04-19 12:44:56 -04:00
2023-04-19 12:44:56 -04:00
2023-04-19 12:44:56 -04:00
2023-04-19 12:44:56 -04:00
2022-10-21 07:38:39 -04:00
2023-01-09 04:34:52 -05:00
2022-10-14 12:43:35 +05:30
2022-10-14 12:43:35 +05:30
2022-10-14 12:43:35 +05:30
2022-10-14 12:43:35 +05:30
2021-04-27 11:09:48 +05:30
2021-04-29 12:35:30 +05:30
2022-05-17 18:10:39 +05:30