Files
blis/bench/bench_aocl_gemm/bench_batch_input.txt
V, Varsha 1f9d1a85d3 Updated aocl_batch_gemm_ APIs aligning to CBLAS batch API. (#58)
* Updated aocl_batch_gemm_ APIs aligning to CBLAS batch API.

 - Modified Batch-Gemm API to align with cblas_?gemm_batch_ API,
 and added a parameter group_size to the existing APIs.
 - Updated bench batch_gemm code to align to the new API definition.
 - Modified the hardcoded number in lpgemm_postop file.
 - Added necessary early return condition to account for group_count/group_size < 0.

AMD-Internal: [ SWLCSG - 3592 ]
2025-06-30 11:16:04 +05:30

24 lines
846 B
Plaintext

f32f32f32of32:group_count=1
group_size=3
r t t n n 92 1479 589 92 589 1479 scale=vector,zp=vector,bias=na,clip
s8s8s32obf16:group_count=1
group_size=5
r n n n r 67 21 1823 1823 21 21 scale=vector,zp=scalar,relu,clip
f32f32f32of32:group_count=1
group_size=7
r n t n n 43 2240 1553 1553 1553 2240 scale=vector,zp=scalar,relu,clip
bf16bf16f32obf16:group_count=1
group_size=6
r n n n r 79 2676 1995 1995 2676 2676 bias=na,swish
bf16bf16f32of32:group_count=1
group_size=6
r t n n r 143 1943 730 143 1943 1943 bias=na,clip
bf16s4f32of32:group_count=1
group_size=6
r t n n r 79 1177 1968 79 1177 1177 scale=vector,zp=scalar,relu,clip
bf16s4f32obf16:group_count=1
group_size=6
r n n n r 17 2714 468 468 2714 2714 scale=vector,zp=vector,bias=na
s8s8s32obf16:group_count=1
group_size=4
r n n n n 43 2240 1553 1553 2240 2240 scale=vector,zp=scalar,relu,clip