Files
blis/kernels
Rayan, Rohan 88745841e0 Adding the cgemmt and sgemm memory fix patches
Fixing a memory issue in the cgemm zen4 packing kernel
In the loop section where the leftover m and k iterations were handled, the load operations (in the k-direction) were missing the mask instructions which has now been added.
Resolved memory-access issues in the SGEMM SUP kernels on AVX2 and AVX-512 by correcting instructions that could read invalid addresses in the C matrix.
Standardized all instruction macros for sgemm to lowercase in the Zen4 kernel to improve readability and code consistency.

AMD-Internal: CPUPL-8117
AMD-Internal: CPUPL-8189
Co-authored-by: Rohan Rayan rohrayan@amd.com
2026-03-30 11:43:04 +05:30
..
2021-10-08 02:35:58 +09:00
2024-08-05 15:35:08 -04:00
2025-09-04 17:14:06 +01:00
2024-08-05 15:35:08 -04:00
2024-08-05 15:35:08 -04:00
2023-11-23 08:54:31 -05:00
2020-07-22 18:24:26 +05:30