blis/frame/thread at f69f59c32c0839ba67985e058006da59fd22d2bc - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-12 10:05:38 +00:00

Files

History

mkadavil 457c33a601 Eliminating barriers in SUP path when matrices are not packed.

-Current gemm SUP path uses bli_thrinfo_sup_grow, bli_thread_range_sub
to generate per thread data ranges at each loop of gemm algorithm.
bli_thrinfo_sup_grow involves usage of multiple barriers for cross
thread synchronization. These barriers are necessary in cases where
either the A or B matrix are packed for centralized pack buffer
allocation/deallocation (bli_thread_am_ochief thread).

-However for cases where both A and B matrices are unpacked, these
barrier are resulting in overhead for smaller dimensions. Here creation
of unnecessary communicators are avoided and subsequently the
requirement for barriers are eliminated when packing is disabled for
both the input matrices in SUP path.

Change-Id: Ic373dfd2d6b08b8f577dc98399a83bb08f794afa

2022-01-06 01:56:43 -05:00

..

This check in has changes w.r.t Copyright information, which is changed to (start year) - 2019

2019-05-27 16:24:43 +05:30

bli_l3_decor_openmp.c

Support multithreading within the sup framework.

2020-08-06 10:09:28 +05:30

bli_l3_decor_openmp.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_decor_pthreads.c

Support multithreading within the sup framework.

2020-08-06 10:09:28 +05:30

bli_l3_decor_pthreads.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_decor_single.c

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_decor_single.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_decor.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_sup_decor_openmp.c

Support multithreading within the sup framework.

2020-08-06 10:09:28 +05:30

bli_l3_sup_decor_openmp.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_sup_decor_pthreads.c

Support multithreading within the sup framework.

2020-08-06 10:09:28 +05:30

bli_l3_sup_decor_pthreads.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_sup_decor_single.c

Rebased amd-staging-milan-3.0 branch on master

2020-08-06 10:09:29 +05:30

bli_l3_sup_decor_single.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_l3_sup_decor.h

Support multithreading within the sup framework.

2020-08-06 10:09:28 +05:30

bli_pthread.c

Disabled _self() and _equal() in bli_pthread API.

2021-03-12 19:47:39 -06:00

bli_pthread.h

Disabled _self() and _equal() in bli_pthread API.

2021-03-12 19:47:39 -06:00

bli_thrcomm_openmp.c

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_thrcomm_openmp.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_thrcomm_pthreads.c

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_thrcomm_pthreads.h

Replaced use of bool_t type with C99 bool.

2020-08-03 11:27:13 +05:30

bli_thrcomm_single.c

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_thrcomm_single.h

Replaced use of bool_t type with C99 bool.

2020-08-03 11:27:13 +05:30

bli_thrcomm.c

Cleaned up bool_t usage and various typecasts.

2020-08-03 11:23:40 +05:30

bli_thrcomm.h

"Merge Selective Packing code from amd branch flame/blis"

2020-08-06 10:09:28 +05:30

bli_thread.c

Multi-threaded BLIS - OpenMP

2021-06-17 05:17:37 -04:00

bli_thread.h

Fix dgemm_ Multi-thread running as Single Thread

2021-06-15 12:14:11 +05:30

bli_thrinfo_sup.c

Eliminating barriers in SUP path when matrices are not packed.

2022-01-06 01:56:43 -05:00

bli_thrinfo_sup.h

Support multithreading within the sup framework.

2020-03-13 01:09:29 -04:00

bli_thrinfo.c

Replaced use of bool_t type with C99 bool.

2020-08-03 11:27:13 +05:30

bli_thrinfo.h

Removed export macros from all internal prototypes.

2020-08-03 11:47:18 +05:30

CMakeLists.txt

AOCL Windows: 3.1 BLIS changes

2021-03-08 19:04:17 +05:30