Files
blis/kernels
Shubham Sharma b70347d0d4 DGEMMT SUP Optimizations for AVX512
- In DGEMMT SUP AVX2 code path, traingular kernels
  are added in order to avoid temporary C buffer.
- Since these kernels did not exist for AVX512,
  AVX2 kernels were being used in GEMMT.
- AVX512 triangular GEMM kernel has been added
  to make sure that AVX512 kernels can be used without
  creating a temporary buffer.
- This kernel is added only for Lower variant of GEMMT,
   for upper variant of DGEMMT, temporary C buffer is
   created, full GEMM kernel is called on temporary C and
   traingular region from temporary C is copied to C
   buffer.

AMD-Internal: [CPUPL-4881]
Change-Id: Id70645f79ae078ab9a7006e83d328505f1fae8a9
2024-05-03 05:11:11 -04:00
..
2023-11-23 08:54:31 -05:00
2023-11-22 17:51:46 -05:00
2023-11-23 08:54:31 -05:00
2024-05-03 05:10:41 -04:00
2023-11-23 08:54:31 -05:00
2020-07-22 18:24:26 +05:30
2024-05-03 05:11:11 -04:00