Files
blis/frame/include/level0
Field G. Van Zee 375eb30b0a Added mixed-precision support to 1m method.
Details:
- Lifted the constraint that 1m only be used when all operands' storage
  datatypes (along with the computation datatype) are equal. Now, 1m may
  be used as long as all operands are stored in the complex domain. This
  change largely consisted of adding the ability to pack to 1e and 1r
  formats from one precision to another. It also required adding logic
  for handling complex values of alpha to bli_packm_blk_var1_md()
  (similar to the logic in bli_packm_blk_var1()).
- Fixed a bug in several virtual microkernels (bli_gemm_md_c2r_ref.c,
  bli_gemm1m_ref.c, and bli_gemmtrsm1m_ref.c) that resulted in the wrong
  ukernel output preference field being read. Previously, the preference
  for the native complex ukernel was being read instead of the pref for
  the native real domain ukernel. This bug would not manifest if the
  preference for the native complex ukernel happened to be equal to that
  of the native real ukernel.
- Added support for testing mixed-precision 1m execution via the gemm
  module of the testsuite.
- Tweaked/simplified bli_gemm_front() and bli_gemm_md.c so that pack
  schemas are always read from the context, rather than trying to
  sometimes embed them directly to the A and B objects. (They are still
  embedded, but now uniformly only after reading the schemas from the
  context.)
- Redefined cpp macro bli_l3_ind_recast_1m_params() as a static function
  and renamed to bli_gemm_ind_recast_1m_params() (since gemm is the only
  consumer).
- Added 1m optimization logic (via bli_gemm_ind_recast_1m_params()) to
  bli_gemm_ker_var2_md().
- Added explicit handling for beta == 1 and beta == 0 in the reference
  gemm1m virtual microkernel in ref_kernels/ind/bli_gemm1m_ref.c.
- Rewrote various level-0 macro defs, including axpyris, axpbyris,
  scal2ris, and xpbyris (and their conjugating counterparts) to
  explicitly support three operand types and updated invocations to
  xpbyris in bli_gemmtrsm1m_ref.c.
- Query and use the storage datatype of the packed object instead of the
  storage datatype of the source object in bli_packm_blk_var1().
- Relocated and renamed frame/ind/misc/bli_l3_ind_opt.h to
  frame/3/gemm/ind/bli_gemm_ind_opt.h.
- Various whitespace/comment updates.
2018-12-03 17:49:52 -06:00
..
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00
2018-08-29 18:06:41 -05:00