mirror of
https://github.com/amd/blis.git
synced 2026-04-19 23:28:52 +00:00
Details: - Added a 'template' configuration, which contains stub implementations of the level 1, 1f, and 3 kernels with one datatype implemented in C for each, with lots of in-file comments and documentation. - Modified some variable/parameter names for some 1/1f operations. (e.g. renaming vector length parameter from m to n.) - Moved level-1f fusing factors from axpyf, dotxf, and dotxaxpyf header files to bli_kernel.h. - Modifed test suite to print out fusing factors for axpyf, dotxf, and dotxaxpyf, as well as the default fusing factor (which are all equal in the reference and template implementations). - Cleaned up some sloppiness in the level-1f unb_var1.c files whereby these reference variants were implemented in terms of front-end routines rather that directly in terms of the kernels. (For example, axpy2v was implemented as two calls to axpyv rather than two calls to AXPYV_KERNEL.) - Changed the interface to dotxf so that it matches that of axpyf, in that A is assumed to be m x b_n in both cases, and for dotxf A is actually used as A^T. - Minor variable naming and comment changes to reference micro-kernels in frame/3/gemm/ukernels and frame/3/trsm/ukernels.
14 lines
930 B
Plaintext
14 lines
930 B
Plaintext
1 # Number of repeats per experiment (where best result is reported)
|
|
c #rg # Matrix storage scheme(s) to test ('c' = col-major; 'r' = row-major; 'g' = general stride)
|
|
c #rji # Vector storage scheme(s) to test ('c' = colvec/unit; 'r' = rowvec/unit; 'j' = colvec/non-unit; 'i' = rowvec/non-unit)
|
|
0 # Test all combinations of storage schemes?
|
|
32 # General stride spacing (for cases when testing general stride)
|
|
sdcz #sdcz # Datatype(s) to test
|
|
100 # Problem size: first to test
|
|
300 # Problem size: maximum to test
|
|
100 # Problem size: increment between experiments
|
|
1 # Error-checking level (0 = disable error checking; 1 = full error checking)
|
|
i # Reaction to test failure ('i' = ignore; 's' = sleep() and continue; 'a' = abort)
|
|
0 # Output results in matlab/octave format (0 = output without formatting)
|
|
0 # Output results to stdout AND files (0 = output only to stdout)
|