amd/blis - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-05-05 23:11:15 +00:00

Author	SHA1	Message	Date
Deepak Negi	3a7523b51b	Element wise post-op APIs are upgraded with new post-ops Description: 1. Added new output types for f32 element wise API's to support s8, u8, s32 , bf16 outputs. 2. Updated the base f32 API to support all the post-ops supported in gemm API's AMD Internal: [SWLCSG-3384] Change-Id: I1a7caac76876ddc5a121840b4e585ded37ca81e8	2025-02-10 01:06:39 -05:00
Deepak Negi	615789e196	Fixed compilation issue with clang 18 on windows Description -In enum AOCL_PARAMS_STORAGE_TYPES the member FLOAT was declared and the clang 18 compiler in msvc throwing issue with multiple definition. We replace FLOAT and BFLOAT16 to AOCL_GEMM_<F32/BF16>. AMD-Internal: CPUPL-6174 Change-Id: Ic061af068854d51629b82b495efd0eb54543f329	2024-12-12 06:37:06 -05:00
Deepak Negi	6dcf500703	Element wise operations API for float(f32) input matrix in LPGEMM. This API supports applying element wise operations (eg: post-ops) on a float(f32) input matrix to get an output matrix of the same (float(f32)). Change-Id: I387a544f0d33d2231f5f6a92e212f17b1103dd24 AMD Internal: [SWLCSG-2947] Change-Id: I387a544f0d33d2231f5f6a92e212f17b1103dd24	2024-08-27 03:28:52 -04:00
mkadavil	f040ba617f	Element wise operations API for bfloat16 input matrix in LPGEMM. -This API supports applying element wise operations (eg: post-ops) on a bfloat16 input matrix to get an output matrix of the same(bfloat16) or upscaled data type (float). -Benchmarking/testing framework for the same is added. AMD Internal: SWLCSG-2947 Change-Id: I43f1c269be1a1997d4912d8a3a97be5e5f3442d2	2024-08-05 07:17:08 -04:00

4 Commits