GEMV support for S8S8S32O32 Symmetric Quantization

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-04-20 07:38:53 +00:00

Introduced support for GEMV operations with group-level symmetric quantization for the S8S8S32032 API.

Framework Changes:
- Added macro definitions and function prototypes for GEMV with symmetric quantization in lpgemm_5loop_interface_apis.h and lpgemm_kernels.h.
  - LPGEMV_M_EQ1_KERN2 for the lpgemv_m_one_s8s8s32os32_sym_quant kernel, and
  - LPGEMV_N_EQ1_KERN2 for the lpgemv_n_one_s8s8s32os32_sym_quant kernel.
- Implemented the main GEMV framework for symmetric quantization in lpgemm_s8s8s32_sym_quant.c.

Kernel Changes:
- lpgemv_m_one_s8s8s32os32_sym_quant for handling the case where M = 1 and implemented in lpgemv_m_kernel_s8_grp_amd512vnni.c.
- lpgemv_n_one_s8s8s32os32_sym_quant for handling the case where N = 1 and implemented in lpgemv_n_kernel_s8_grp_amd512vnni.c.
- Updated the buffer reordering logic for group quantization for N=1 cases in aocl_gemm_s8s8s32os32_utils.c.

Notes
- Ensure that group_size is a factor of both K (and KC when K > KC).
- The B matrix must be provided in reordered format (mtag_b == REORDERED).

AMD-Internal: [SWLCSG-3604]

This commit is contained in:

Sharma, Arnav

2025-08-14 13:41:25 +05:30

committed by

GitHub

parent 3a14417ce1

commit 76c4872718

6 changed files with 3280 additions and 142 deletions

1291

kernels/zen4/lpgemm/s8s8s32/lpgemv_m_kernel_s8_grp_amd512vnni.c Normal file

View File

File diff suppressed because it is too large Load Diff

1738

kernels/zen4/lpgemm/s8s8s32/lpgemv_n_kernel_s8_grp_amd512vnni.c Normal file

View File

File diff suppressed because it is too large Load Diff

GEMV support for S8S8S32O32 Symmetric Quantization

1291 kernels/zen4/lpgemm/s8s8s32/lpgemv_m_kernel_s8_grp_amd512vnni.c Normal file View File

1738 kernels/zen4/lpgemm/s8s8s32/lpgemv_n_kernel_s8_grp_amd512vnni.c Normal file View File

1291

kernels/zen4/lpgemm/s8s8s32/lpgemv_m_kernel_s8_grp_amd512vnni.c Normal file

View File

1738

kernels/zen4/lpgemm/s8s8s32/lpgemv_n_kernel_s8_grp_amd512vnni.c Normal file

View File