blis/bench at 3a3472e4e5f51b8b1d6b9e2966596e5b51b517d3 - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-11 17:50:00 +00:00

Files

History

mkadavil bf4d1da1b9 Column major input support for BFloat16 gemm.

-The bf16 gemm framework is modified to swap input column major matrices
and compute gemm for the transposed matrices (now row major) using the
existing row-major kernels. The output is written to C matrix assuming
it is transposed.
-Framework changes to support leading dimensions that are greater than
matrix widths.
-Bench changes to test low precision gemm for column major inputs.

AMD-Internal: [CPUPL-2570]
Change-Id: I22c76f52619fd76d0c0e41531828b437a1935495

2022-09-22 02:50:46 -04:00

..

bench_aocl_gemm

Column major input support for BFloat16 gemm.

2022-09-22 02:50:46 -04:00

bench_amaxv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_axpbyv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_copyv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_dotv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_gemm.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_gemmt.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_gemv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_ger.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_nrm2.c

Adding AVX2 support for DNRM2

2022-09-20 06:05:01 -04:00

bench_scalv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_swapv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_syrk.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_trsm.c

Fixed Bug in bench_trsm.c

2022-07-25 15:38:30 +00:00

bench_trsv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

CMakeLists.txt

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

inputamaxv.txt

Bench addition for amaxv API

2021-06-04 17:45:04 +05:30

inputaxpbyv.txt

Optimized AXPBYV Kernel using AVX2 Intrinsics

2022-01-05 04:19:11 -05:00

inputcopy.txt

Added bench utility for copyv API

2021-06-09 12:29:49 +05:30

inputdotv.txt

Added bench utility for dotv and scalv APIs.

2021-05-21 10:00:32 +05:30

inputgemm.txt

AOCL DTL - Added thread and execution time details in logs

2021-11-12 08:58:54 +05:30

inputgemmt.txt

Added bench app for syrk - input is a log file generated from AOCL_DTL

2021-05-11 14:57:51 +05:30

inputgemv.txt

Fixed crash issue in bench utility for gemv API

2021-05-19 14:21:09 +05:30

inputger.txt

Added bench utility for ger API.

2021-05-19 14:05:01 +05:30

inputnrm2.txt

Adding AVX2 support for DNRM2

2022-09-20 06:05:01 -04:00

inputscalv.txt

Added bench utility for dotv and scalv APIs.

2021-05-21 10:00:32 +05:30

inputswap.txt

Added bench utility for swapv API

2021-06-09 17:05:00 +05:30

inputsyrk.txt

Added bench app for syrk - input is a log file generated from AOCL_DTL

2021-05-11 14:57:51 +05:30

inputtrsm.txt

Trsm bench utility missmatch DTL logs and bench

2021-11-12 08:58:52 +05:30

inputtrsv.txt

Bench trsv logging error

2021-06-08 11:54:55 +05:30

Makefile

Adding AVX2 support for DNRM2

2022-09-20 06:05:01 -04:00