Files
blis/frame/compat
Harihara Sudhan S 828ac8e2dd Partial completion of work in L1 APIs
- Partial completion of compute was happening since BLIS was unable
  to launch the required number of threads. This was because rntm
  was returning a thread count greater than the maximum number of
  threads that can be launched in the subsequent parallel region.
- Added 'omp_get_num_threads' inside the parallel regions to get the
  actual number of threads spawned. The work distribution happens
  based on the actual number of threads launched in that region.

AMD-Internal: [CPUPL-3268]
Change-Id: I086ad4b9b644f966b7bab439e43222396f0c2bf0
2023-04-27 15:17:26 +05:30
..
2021-03-08 19:04:17 +05:30
2022-10-21 07:38:39 -04:00