mirror of
https://github.com/amd/blis.git
synced 2026-05-11 09:39:59 +00:00
Details: - Imported the 24x16 knl sgemm microkernel (and its corresonding spackm kernel) from TBLIS and enabled its use in the knl sub-config. Also Added sgemm microkernel prototype to bli_kernels_knl.h. - Updated dgemm and dpackm microkernels from TBLIS, which included an important change regarding the offsets array (changed from extern declaration to static declaration/definition). - Activated use of level-1v and -1f zen kernels in skx and knl sub-configs. - Removed some old macros no longer needed in bli_family_skx.h now that libmemkind support exists in configure. - Moved bli_avx512_macros.h to frame/include and adjusted #includes in skx and knl kernels accordingly. - Moved unused kernels in kernels/knl/3 to kernels/knl/3/other directory. - Fixed a minor bug in the 'make' output per compile when verboseness is not turned on. The rule-generating function 'make-kernel-rule' was previously passing in the name of the config, rather than the name of the kernel set returned by get-config-for-kset, which could give misleading information to the user when the kconfig_map mapped a kernel set to a sub-configuration that did not share the same name. (This didn't affect the CFLAGS that were actually used.) - Updated test/3m4m/Makefile, removing acml targets and renaming the remaining targets.
38 lines
911 B
Plaintext
38 lines
911 B
Plaintext
#
|
|
# config_registry
|
|
#
|
|
# Please refer to the BLIS wiki on configurations for information on the
|
|
# syntax and semantics of this file [1].
|
|
#
|
|
# [1] https://github.com/flame/blis/wiki/ConfigurationHowTo
|
|
#
|
|
|
|
# Processor families.
|
|
x86_64: intel64 amd64
|
|
intel64: skx knl haswell sandybridge penryn generic
|
|
amd64: zen excavator steamroller piledriver bulldozer generic
|
|
arm64: cortexa57 generic
|
|
arm32: cortexa15 cortexa9 generic
|
|
|
|
# Intel architectures.
|
|
skx: skx/skx/zen
|
|
knl: knl/knl/zen
|
|
haswell: haswell/haswell/zen
|
|
sandybridge: sandybridge
|
|
penryn: penryn
|
|
|
|
# AMD architectures.
|
|
zen: zen
|
|
excavator: excavator/piledriver
|
|
steamroller: steamroller/piledriver
|
|
piledriver: piledriver
|
|
bulldozer: bulldozer
|
|
|
|
# ARM architectures.
|
|
cortexa57: cortexa57/armv8a
|
|
cortexa15: cortexa15/armv7a
|
|
cortexa9: cortexa9/armv7a
|
|
|
|
# Generic architectures.
|
|
generic: generic
|