blis/kernels at 008b77e94d10f9f474ee499fa55ea38f848326d5 - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-13 02:25:39 +00:00

Files

History

Vignesh Balasubramanian 5f9c8c6929 Bugfix : Fallback mechanism in SNRM2 and SCNRM2 kernels if packing fails

- Abstracted packing from the vectorized kernels for SNRM2 and SCNRM2 to
  a layer higher.

- Added a scalar loop to handle compute in case of non-unit strides.
  This loop ensures functionality in case packing fails at the
  framework level.

AMD-Internal: [CPUPL-3633]
Change-Id: I555aea519d7434d43c541bb0f661f81105135b98

2023-11-08 15:16:10 +05:30

..

Added 512b SVE-based a64fx subconfig + SVE kernels.

2021-05-19 09:52:29 -05:00

Squash-merge 'pr' into 'squash'. (#457 )

2020-11-14 09:39:48 -06:00

Armv8A Rename Regs for Safe Darwin Compile

2021-05-29 18:44:47 +09:00

Replaced use of bool_t type with C99 bool.

2020-08-03 11:27:13 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Added a dummy file to kernels/generic.

2017-11-21 12:34:20 -06:00

Merge commit 'e366665c' into amd-main

2023-10-18 09:09:54 -04:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Type saga continues; fixed sgemm ukernel signature.

2020-09-12 17:48:15 -05:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Optionally disable trsm diagonal pre-inversion.

2020-12-04 16:08:15 -06:00

BLIS library porting on to Windows:

2020-06-16 18:29:00 +05:30

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

Merge commit 'e366665c' into amd-main

2023-10-18 09:09:54 -04:00

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00

Remove UT-Austin from copyright headers' clause 3.

2018-12-04 14:31:06 -06:00

BLIS: Missing clobbers (batch 6)

2023-08-07 10:52:23 -04:00

Bugfix : Fallback mechanism in SNRM2 and SCNRM2 kernels if packing fails

2023-11-08 15:16:10 +05:30

BLIS:merge:

2021-04-27 11:09:48 +05:30

Added support for zen3 configuration

2020-07-22 18:24:26 +05:30

Added k=1 avx512 dgemm kernel.

2023-11-07 01:10:09 -05:00

CMakeLists.txt

Code cleanup: No newline at end of file

2023-04-21 10:02:48 -04:00