blis/frame at 77161c1e5d218822d5f378d59f17618e2b185cc1 - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-13 02:25:39 +00:00

Files

History

Vignesh Balasubramanian bd0b50a077 Introduced fast-path to kernels in DNRM2_ and DZNRM2_ APIs

- Added a conditional check to see if the vectorized kernels
  for DNRM2_ and DZNRM2_ can be called directly, without
  incurring any framework overhead.

- The condition to satisfy this fast-path is for the size to be
  such that the ideal threads required is 1, with the vector having
  unit stride( so that packing at the framework-level can be avoided ).

AMD-Internal: [CPUPL-4045]
Change-Id: Ie37e86f802ada0e226dff88e74f0341e97ebfe28

2023-11-09 21:13:10 +05:30

..

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

Added Parameter Checks and DTL Trace for Extension APIs

2023-11-09 18:53:59 +05:30

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

Added Parameter Checks and DTL Trace for Extension APIs

2023-11-09 18:53:59 +05:30

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

Added Parameter Checks and DTL Trace for Extension APIs

2023-11-09 18:53:59 +05:30

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30

Added Parameter Checks and DTL Trace for Extension APIs

2023-11-09 18:53:59 +05:30

Introduced fast-path to kernels in DNRM2_ and DZNRM2_ APIs

2023-11-09 21:13:10 +05:30

CMakeLists.txt

CMake: Adding new portable CMake system.

2023-11-09 15:49:45 +05:30