blis/bench at 6132194468ca45b786ff89fd006c6d9aa7e1d27a - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-24 18:34:40 +00:00

Files

History

mkadavil c3b97559c1 Zero Point support for <u|s>8s8s<32|16>os8 LPGEMM APIs

-Downscaled / quantized value is calculated using the formula
x' = (x / scale_factor) + zero_point. As it stands, the micro-kernels
for these APIs only support scaling.
Zero point addition is implemented as part of this commit, with it
being fused as part of the downscale post-op in the micro-kernel. The
zero point input is a vector of int8 values, and currently only vector
based zero point addition is supported.
-Bench enhancements to test/benchmark zero point addition.

AMD-Internal: [SWLCSG-2332]
Change-Id: I96b4b1e5a384a4683b50ca310dcfb63debb1ebea

2023-10-10 12:05:47 +05:30

..

bench_aocl_gemm

Zero Point support for <u|s>8s8s<32|16>os8 LPGEMM APIs

2023-10-10 12:05:47 +05:30

bench_amaxv.c

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

bench_axpbyv.c

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

bench_copyv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_dotv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_gemm.c

Integrated 32x6 DGEMM kernel for zen4 and its related changes are added.

2023-01-19 23:11:36 +05:30

bench_gemmt.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_gemv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_ger.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_nrm2.c

Adding AVX2 support for DNRM2

2022-09-20 06:05:01 -04:00

bench_scalv.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_swapv.c

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

bench_syrk.c

AOCL-WINDOWS: Added the windows build system to build bench folder on windows.

2022-06-27 22:32:39 -04:00

bench_trsm.c

Fixed Bug in bench_trsm.c

2022-07-25 15:38:30 +00:00

bench_trsv.c

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

CMakeLists.txt

Adding nrm2 target for benchmarking on Windows.

2023-07-10 14:03:05 -04:00

inputamaxv.txt

Bench addition for amaxv API

2021-06-04 17:45:04 +05:30

inputaxpbyv.txt

Optimized AXPBYV Kernel using AVX2 Intrinsics

2022-01-05 04:19:11 -05:00

inputcopy.txt

Added bench utility for copyv API

2021-06-09 12:29:49 +05:30

inputdotv.txt

Added bench utility for dotv and scalv APIs.

2021-05-21 10:00:32 +05:30

inputgemm.txt

AOCL DTL - Added thread and execution time details in logs

2021-11-12 08:58:54 +05:30

inputgemmt.txt

Added bench app for syrk - input is a log file generated from AOCL_DTL

2021-05-11 14:57:51 +05:30

inputgemv.txt

Fixed crash issue in bench utility for gemv API

2021-05-19 14:21:09 +05:30

inputger.txt

Added bench utility for ger API.

2021-05-19 14:05:01 +05:30

inputnrm2.txt

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

inputscalv.txt

Added bench utility for dotv and scalv APIs.

2021-05-21 10:00:32 +05:30

inputswap.txt

Added bench utility for swapv API

2021-06-09 17:05:00 +05:30

inputsyrk.txt

Added bench app for syrk - input is a log file generated from AOCL_DTL

2021-05-11 14:57:51 +05:30

inputtrsm.txt

Trsm bench utility missmatch DTL logs and bench

2021-11-12 08:58:52 +05:30

inputtrsv.txt

Bench trsv logging error

2021-06-08 11:54:55 +05:30

Makefile

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00