Files
blis/bench
Meghana Vankadari eb5ab3f762 LPGEMM: Added transB support for bf16bf16f32o<bf16|f32> APIs
Details:
- Modified aocl_get_reorder_buf_size_ and aocl_reorder_ APIs
  to allow reordering from column major input matrix.
- Added new pack kernels that packs/reorders B matrix from
  column-major input format.
- Updated Early-return check conditions to account for trans
  parameters.
- Updated bench file to test/benchmark transpose support.

AMD-Internal: [CPUPL-2268]
Change-Id: Ida66d7e3033c52cca0229c6b78d16976fbbecc4c
2023-10-12 23:36:18 +05:30
..
2022-09-20 06:05:01 -04:00
2022-07-25 15:38:30 +00:00
2021-06-04 17:45:04 +05:30
2021-05-19 14:05:01 +05:30
2021-06-08 11:54:55 +05:30