mirror of
https://github.com/amd/blis.git
synced 2026-05-12 18:15:37 +00:00
Details: - pack and compute extension APIs derive blocksizes(MR, NR...) from SUP cntx. - SUP blocksizes are not set for generic/skx configs. As a result pack and compute APIs cause floating point exceptions. - To fix these issues, we have enabled non-zero SUP blocksizes for generic config and zen4 SUP blocksizes for skx config. - However, these changes will not enable SUP path for skx/generic config as thresholds are set to zero. - To enable SUP path for skx config, more work is needed like non-zero thresholds and modifications to build system. Change-Id: I54483ab0c196845ca175b8cb8deeb9e9ac2a42b9
52 lines
1.4 KiB
Plaintext
52 lines
1.4 KiB
Plaintext
#
|
|
# config_registry
|
|
#
|
|
# Please refer to the BLIS wiki on configurations for information on the
|
|
# syntax and semantics of this file [1].
|
|
#
|
|
# [1] https://github.com/flame/blis/blob/master/docs/ConfigurationHowTo.md
|
|
#
|
|
|
|
# Processor families.
|
|
x86_64: intel64 amdzen amd64_legacy
|
|
intel64: skx knl haswell sandybridge penryn generic
|
|
amd64_legacy: excavator steamroller piledriver bulldozer generic
|
|
amdzen: zen4 zen3 zen2 zen generic
|
|
|
|
# NOTE: ARM families will remain disabled until runtime hardware detection
|
|
# logic is added to BLIS.
|
|
#arm64: cortexa57 generic
|
|
#arm32: cortexa15 cortexa9 generic
|
|
|
|
# Intel architectures.
|
|
skx: skx/skx/haswell/zen/zen4
|
|
knl: knl/knl/haswell/zen
|
|
haswell: haswell/haswell/zen
|
|
sandybridge: sandybridge
|
|
penryn: penryn
|
|
|
|
# AMD architectures.
|
|
zen4: zen4/zen4/skx/zen3/zen2/zen/haswell
|
|
zen3: zen3/zen3/zen2/zen/haswell
|
|
zen2: zen2/zen2/zen/haswell
|
|
zen: zen/zen/haswell
|
|
excavator: excavator/piledriver
|
|
steamroller: steamroller/piledriver
|
|
piledriver: piledriver
|
|
bulldozer: bulldozer
|
|
|
|
# ARM architectures.
|
|
thunderx2: thunderx2/armv8a
|
|
cortexa57: cortexa57/armv8a
|
|
cortexa53: cortexa53/armv8a
|
|
cortexa15: cortexa15/armv7a
|
|
cortexa9: cortexa9/armv7a
|
|
|
|
# IBM architectures.
|
|
power10: power10
|
|
power9: power9
|
|
bgq: bgq
|
|
|
|
# Generic architectures.
|
|
generic: generic
|