Field G. Van Zee fcc10054a1 Tweaks to gemm4m, gemm3m virtual ukernels.
Details:
- Fixed a potential, but as-yet unobserved bug in gemm3m that would
  allow undesirable inf/NaN propogation, since C was being scaled by
  beta even if it was equal to zero.
- In gemm3m micro-kernel, we now avoid copying C to the temporary
  micro-tile if beta is zero.
- Rearranged computation in gemm4m so that the temporary C micro-tile
  is accessed less, and C is accessed only after the micro-kernel
  calls. This improves performance marginally in most situations.
- Comment updates to both gemm4m and gemm3m micro-kernels.
2014-08-13 12:32:06 -05:00
2014-08-07 13:21:15 -05:00
2014-07-13 22:50:56 -07:00
2014-08-04 16:01:59 -05:00
2014-07-14 16:14:33 -05:00
2012-12-17 12:35:54 -06:00
2014-08-04 16:01:58 -05:00

BLIS framework
README
---

Thank you for deciding to try out the BLIS framework!

BLIS is a portable framework for instantiating BLAS-like libraries. The
framework was designed to isolate essential kernels of computation that,
when optimized, immediately enable optimized implementations of most of
its commonly used and computationally intensive operations.

BLIS has many features. For more detailed information about the project,
please check the BLIS homepage:

  http://code.google.com/p/blis/

You can keep in touch with developers and other users of the project by
joining one or more of the following mailing lists:

  o blis-announce - http://groups.google.com/group/blis-announce 
    Used only for announcements and other important messages regarding
    BLIS.

  o blis-discuss - http://groups.google.com/group/blis-discuss
    Please join and post to this mailing list if you have general questions
    or feedback regarding BLIS. Application developers (end users) should
    probably post here.

  o blis-devel - http://groups.google.com/group/blis-devel
    Please join and post to this mailing list if you are a BLIS developer
    (i.e., you are trying to use BLIS to create libraries, you want to
    write kernels for the framework, or you are trying to modify or extend
    the framework itself).

Also, please read the LICENSE file for information on copying and
distributing this software.

For a step-by-step guide on configuring, compiling, and installing BLIS,
please read the INSTALL file. Also, please check the BLIS website's wiki
page for other useful how-to guides.

Thanks again for your interest in BLIS!

Regards,

Field G. Van Zee
field@cs.utexas.edu

Description
BLAS-like Library Instantiation Software Framework
Readme BSD-3-Clause 71 MiB
Languages
C 86.2%
C++ 9.7%
Fortran 1.9%
Makefile 0.8%
MATLAB 0.4%
Other 0.9%