blis/kernels at 25bab76f586e382f45959c8aa9490ce42c8061ee - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-12 18:15:37 +00:00

Files

History

mkadavil c3b97559c1 Zero Point support for <u|s>8s8s<32|16>os8 LPGEMM APIs

-Downscaled / quantized value is calculated using the formula
x' = (x / scale_factor) + zero_point. As it stands, the micro-kernels
for these APIs only support scaling.
Zero point addition is implemented as part of this commit, with it
being fused as part of the downscale post-op in the micro-kernel. The
zero point input is a vector of int8 values, and currently only vector
based zero point addition is supported.
-Bench enhancements to test/benchmark zero point addition.

AMD-Internal: [SWLCSG-2332]
Change-Id: I96b4b1e5a384a4683b50ca310dcfb63debb1ebea

2023-10-10 12:05:47 +05:30

..

New kernel set for Arm SVE using assembly (#396 )

2020-05-21 11:56:45 +05:30

Squash-merge 'pr' into 'squash'. (#457 )

2020-11-14 09:39:48 -06:00

avoid loading twice in armv8a gemm kernel (#403 )

2020-05-21 12:37:53 +05:30

Replaced use of bool_t type with C99 bool.

2020-08-03 11:27:13 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Added a dummy file to kernels/generic.

2017-11-21 12:34:20 -06:00

Fixed incorrect ymm registers usage in FMA operation.

2023-10-02 03:20:44 -04:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Type saga continues; fixed sgemm ukernel signature.

2020-09-12 17:48:15 -05:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Optionally disable trsm diagonal pre-inversion.

2020-12-04 16:08:15 -06:00

BLIS library porting on to Windows:

2020-06-16 18:29:00 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

BLIS: Missing clobbers (batch 6)

2023-08-07 10:52:23 -04:00

Zero Point support for <u|s>8s8s<32|16>os8 LPGEMM APIs

2023-10-10 12:05:47 +05:30

BLIS:merge:

2021-04-27 11:09:48 +05:30

Added support for zen3 configuration

2020-07-22 18:24:26 +05:30

Zero Point support for <u|s>8s8s<32|16>os8 LPGEMM APIs

2023-10-10 12:05:47 +05:30

CMakeLists.txt

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00