blis/kernels at 90f30e4c37531ff2e077f0b1bd2a057dba8e272a - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-12 01:59:59 +00:00

Files

History

managalv 90f30e4c37 Optimised dotv kernel by SIMD approach and by removing framework overhead

Details:
    - Kernel is called directly from API call to avoid framework overhead in case of complex float and complex double precisions.
    - Added SIMD code for complex float and complex double and unrolled for loop 5 times to improve performance

AMD-Internal: [CPUPL-1057]

Change-Id: I3b9d202398cacc0168882c9d6da2b450c27466a0

2020-10-13 18:59:31 +05:30

..

New kernel set for Arm SVE using assembly (#396 )

2020-05-21 11:56:45 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

avoid loading twice in armv8a gemm kernel (#403 )

2020-05-21 12:37:53 +05:30

Replaced use of bool_t type with C99 bool.

2020-08-03 11:27:13 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Added a dummy file to kernels/generic.

2017-11-21 12:34:20 -06:00

Added debug trace and log support for gemmt and TRSM APIs

2020-10-02 12:31:47 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Added missing schema arg to knl packm kernels.

2019-09-17 18:00:29 -05:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Replaced use of bool_t type with C99 bool.

2020-08-03 11:27:13 +05:30

BLIS library porting on to Windows:

2020-06-16 18:29:00 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Reverted minor temp/wspace changes from b426f9e.

2019-11-04 13:57:12 -06:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Optimised dotv kernel by SIMD approach and by removing framework overhead

2020-10-13 18:59:31 +05:30

Optimised dotv kernel by SIMD approach and by removing framework overhead

2020-10-13 18:59:31 +05:30

Added support for zen3 configuration

2020-07-22 18:24:26 +05:30

CMakeLists.txt

BLIS library porting on to Windows:

2020-06-16 18:29:00 +05:30