mirror of
https://github.com/amd/blis.git
synced 2026-05-11 01:30:00 +00:00
Added experimental bli_gemm_ker_var5().
Details: - Added support for an experimental gemm macro-kernel incrementally packs one micro-panel of B at a time. This is useful for certain special cases of gemm where m is small. - Minor changes to default values of clarksville configuration. - Defined BLIS_PACKED_BLOCKS as part of pack_t type, even though we do not yet have any use (or implementation support) for block storage. - Comment update to bli_packm_init.c.
This commit is contained in:
@@ -243,6 +243,12 @@
|
||||
#define bli_cndup BLIS_DEFAULT_NUM_DUPL_C
|
||||
#define bli_zndup BLIS_DEFAULT_NUM_DUPL_Z
|
||||
|
||||
// Incremental packing factors
|
||||
|
||||
#define bli_snifac BLIS_DEFAULT_NI_FAC
|
||||
#define bli_dnifac BLIS_DEFAULT_NI_FAC
|
||||
#define bli_cnifac BLIS_DEFAULT_NI_FAC
|
||||
#define bli_znifac BLIS_DEFAULT_NI_FAC
|
||||
|
||||
|
||||
#endif
|
||||
|
||||
Reference in New Issue
Block a user