blis/bench at 711437651908622f4afff286dbec75c9c19b4748 - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-07-13 18:49:10 +00:00

Files

History

mkadavil d37c91dffa Quantization (scale + zero point) support for BF16 LPGEMM api.

-Quantization of f32 to bf16 (bf16 = (f32 * scale_factor) + zero_point)
instead of just type conversion in aocl_gemm_bf16bf16f32obf16.
-Support for multiple scale/sum/matrix_add/bias post-ops in a single
LPGEMM api call.
-Post-ops mask related fixes in lpgemv kernels .
-Additional scale post-ops sanity checks.

AMD-Internal: [SWLCSG-2945]
Change-Id: I3b35cc413c176bb50bfdbd6acd4839a5ba7e94bb

2024-07-18 05:32:51 -04:00

..

bench_aocl_gemm

Quantization (scale + zero point) support for BF16 LPGEMM api.

2024-07-18 05:32:51 -04:00

bench_amaxv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_axpbyv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_axpyv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_copyv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_dotv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_gemm_pack_compute.c

Code cleanup: No newline at end of file

2023-11-22 17:11:10 -05:00

bench_gemm.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_gemmt.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_gemv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_ger.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_nrm2.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_scalv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_swapv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_syrk.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_trsm.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

bench_trsv.c

Bench APPs - change in Print statement for more params

2024-07-11 02:04:19 -04:00

CMakeLists.txt

CMake: Added logic to link openmp library given through OpenMP_libomp_LIBRARY cmake variable on linux.

2024-06-10 04:41:23 -04:00

inputamaxv.txt

Bench addition for amaxv API

2021-06-04 17:45:04 +05:30

inputaxpbyv.txt

Optimized AXPBYV Kernel using AVX2 Intrinsics

2022-01-05 04:19:11 -05:00

inputaxpyv.txt

Added support to benchmark AXPYV APIs

2024-04-08 00:06:54 -04:00

inputcopy.txt

Added bench utility for copyv API

2021-06-09 12:29:49 +05:30

inputdotv.txt

Support for DOTC in DOTV Bench and DTL updates

2024-04-04 12:27:53 +05:30

inputgemm.txt

AOCL DTL - Added thread and execution time details in logs

2021-11-12 08:58:54 +05:30

inputgemmpackcompute.txt

Code cleanup: No newline at end of file

2023-11-22 17:11:10 -05:00

inputgemmt.txt

Added bench app for syrk - input is a log file generated from AOCL_DTL

2021-05-11 14:57:51 +05:30

inputgemv.txt

Fixed crash issue in bench utility for gemv API

2021-05-19 14:21:09 +05:30

inputger.txt

Added bench utility for ger API.

2021-05-19 14:05:01 +05:30

inputnrm2.txt

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

inputscalv.txt

Added support to benchmark AXPYV APIs

2024-04-08 00:06:54 -04:00

inputswap.txt

Added bench utility for swapv API

2021-06-09 17:05:00 +05:30

inputsyrk.txt

Added bench app for syrk - input is a log file generated from AOCL_DTL

2021-05-11 14:57:51 +05:30

inputtrsm.txt

Trsm bench utility missmatch DTL logs and bench

2021-11-12 08:58:52 +05:30

inputtrsv.txt

Bench trsv logging error

2021-06-08 11:54:55 +05:30

Makefile

Added support to benchmark AXPYV APIs

2024-04-08 00:06:54 -04:00