Files
blis/frame/base
Meghana Vankadari eea347b02e Added dynamic threading support for GEMM SUP code path
Details:
- Introduced new feature called AOCL_DYNAMIC.
- When this macro is defined, Optimum number of threads to solve DGEMM
  is estimated based on the dimensions (M,N,K).
- Range of optimum number of threads will be [1, num_threads],
  where "num_threads" is number of threads set by the application.
- Num_threads is derived from either environment variable "OMP_NUM_THREADS
  or BLIS_NUM_THREADS' or bli_set_num_threads() API.
- Only local copy of rntm is modified by AOCL_DYNAMIC feature.
  global_rntm data structure remains unchanged in order to keep track of
  original number of threads set by application.
- Optimum number of threads calculation is done only for SUP.
- Since 'native' code path handles larger problem sizes, we use max
  number of threads recommended by the application.

AMD-Internal: [CPUPL-1376]
Change-Id: I665ce14543d6719857d70325c4a9f959c08e66e3
2021-05-07 09:52:51 +05:30
..
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2019-10-11 11:53:51 -05:00
2021-04-27 11:09:48 +05:30
2021-04-29 12:35:30 +05:30
2021-04-27 11:09:48 +05:30
2021-04-27 11:09:48 +05:30
2021-04-27 11:09:48 +05:30
2019-10-11 11:53:51 -05:00
2019-10-11 11:53:51 -05:00
2019-10-11 11:53:51 -05:00
2019-10-11 11:53:51 -05:00
2021-03-08 19:04:17 +05:30