CHANGELOG update (0.1.4)

This commit is contained in:
Field G. Van Zee
2014-07-27 18:20:13 -05:00
parent a7537071b1
commit af7a8e6c04

403
CHANGELOG
View File

@@ -1,4 +1,403 @@
commit 036cc634918463b1caa0fd89c9a211f2f5639af7 (HEAD, tag: 0.1.3, master)
commit a7537071b152ecff671f8716595d37dc09e4fd51 (HEAD, tag: 0.1.4, master)
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Sun Jul 27 18:20:12 2014 -0500
Version file update (0.1.4)
commit acff74041bf02c7b9fdfa24b507bca782a4c5fce (origin/master)
Merge: cdb9413 47b243e
Author: Tyler Smith <tms@cs.utexas.edu>
Date: Wed Jul 23 15:07:30 2014 -0500
Merge branch 'master' of https://github.com/flame/blis
commit cdb9413e140f8a198666250ec88fa34b5425a9c3
Author: Tyler Smith <tms@cs.utexas.edu>
Date: Wed Jul 23 15:05:15 2014 -0500
Enabled threading for a couple more loops in TRSM
JC loop is now enabled for the left-sided case
IC loop is now enabled for the right-sided case
commit 47b243ef08f4101de3d936f2373343e67eaa4dd5
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Wed Jul 23 13:41:13 2014 -0500
Call setid for early return from herk/her2k.
Details:
- Added setid call (to zero imaginary parts of diagonal elements) to
early return branches of herk_front() and her2k_front() for cases
where alpha is zero. Thanks to Murtaza Ali for suggesting this fix.
- Comment update.
commit 3e7b0db5b0e24f5fd66c60bacabc019885ddbec5
Merge: 2f8a357 ed3e33d
Author: Tyler Smith <tms@cs.utexas.edu>
Date: Wed Jul 23 13:40:44 2014 -0500
Merge branch 'master' of https://github.com/flame/blis
commit 2f8a357de5fb55163a969d888cf059f24b78125c
Author: Tyler Smith <tms@cs.utexas.edu>
Date: Wed Jul 23 13:40:12 2014 -0500
Some TRSM threading fixes/additions
commit ed3e33d548047be3283ff41268fdf716563bc542
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Tue Jul 22 14:40:43 2014 -0500
Tweaked behavior of herk, her2k for BLAS compat.
Details:
- Updated herk_front() and her2k_front() to explicitly set the imaginary
components of the diagonal entries of C to zero after the computation
is complete. This is needed in case downstream applications read the
full diagonal entries (i.e., including imaginary part), which could, in
the absence of this modification, accumulate numerical error from
subsequent rank-k/rank-2k updates.
- Updated BLAS compatibility wrappers for herk and her2k to return early
if:
n == 0 || ( ( alpha == 0 || k == 0 ) && beta == 1 )
This also results in the imaginary components of diagonal entries NOT
being set to zero (see above), which is consistent with BLAS.
- Updated mkherm to use setid instead of an inlined loop over the
diagonal.
commit ea59a5c93cde1467a3715abc53dda4aecf961873
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Tue Jul 22 14:36:02 2014 -0500
Added new level-1d operation: setid.
Details:
- Defined a new level-1d operation, setid, which sets the imaginary
elements of an object's diagonal to a single scalar. This can be
useful, for example, when trying to make the diagonal of a Hermitian
matrix real-valued.
commit 8965a965931318619ceaebd7c32edccf3022d0c7
Merge: 1785efb 5b73e80
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Tue Jul 22 14:34:32 2014 -0500
Merge branch 'master' of github.com:flame/blis
commit 1785efb5420bc7b9c850a068cb5d99837071e877
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Tue Jul 22 14:33:01 2014 -0500
Minor improvements to invertd and setd.
Details:
- Added missing call to invertd_check() from front-end.
- Changed setd front-end call of scald_check() to setd_check().
commit 5b73e80b71c054c1945a06aff044ef629bc1a9a0
Merge: a41e68e 20690fe
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Fri Jul 18 12:21:20 2014 -0500
Merge pull request #16 from Maratyszcza/emscripten
Emscripten port
commit a41e68e09e73b999fab0bb430a43dccfc63aab45
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Thu Jul 17 13:25:56 2014 -0500
Reimplemented BLIS initialization/finalization.
Details:
- Rewrote bli_init() and bli_finalize() with OpenMP critical sections
for thread-safety. Also added lots of explanatory comments.
- Renamed bli_init_safe() and bli_finalize_safe() with the _auto()
suffix, and reimplemented for simplicity. Updated all invocations
in BLAS compatibility layer to use _auto() suffix.
commit 36358948ea75074bda32a9f8c008f835b87d21db
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Thu Jul 17 10:58:10 2014 -0500
Retired frame/3/gemm/other directory.
Details:
- Removed frame/3/gemm/other directory, which contained some outdated
and/or experimental variants.
commit c73261f17edf589e76bdbe297702a1fbbd69275f
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jul 14 16:23:51 2014 -0500
More minor cleanups post-copyright update.
commit 2a09d24463d358be6243b24f112fad057c2aefe0
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jul 14 16:17:09 2014 -0500
Reverted power7 symlinks destroyed by sed script.
Details:
- Reverted two symlinks, in kernels/power7/3/test, back to being symlinks
after recursive-sed.sh mistakenly replaced them with copies of the
actual files to which they referred. Meant to include this in previous
commit.
commit 7ed415824d3b2e78541b6f64e404ca5347c06d3d
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jul 14 16:14:33 2014 -0500
Updated copyright headers (continued).
Details:
- Inserted "at Austin" into third clause of license declarations.
Meant to include this change in previous commit.
commit 5c2c6c85616834ff2716ece083118201d9df6dde
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jul 14 16:05:03 2014 -0500
Updated copyright headers to contain "at Austin".
Details:
- Updated copyright headers to include "at Austin" in the name of the
University of Texas.
- Updated the copyright years of a few headers to 2014 (from 2011 and
2012).
commit fcec68cda3f6e90ae055e7304e6674c1c5c8d010
Merge: 94c0df7 4a20ed1
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jul 14 11:35:34 2014 -0500
Merge branch 'master' of github.com:flame/blis
commit 94c0df797eda377931f29a41ba6a89c0ed58daca
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jul 14 11:24:36 2014 -0500
Changed order of zero dim / error checking.
Details:
- Updated level-2 and level-3 internal back-ends so that the operation's
_check() function is called BEFORE any attempt to return early due to
the presence of zero dimensions. This ordering makes more sense because
(for example) object dimensions should match even if one of them is
zero. Previously, a dimension mismatch could result in an early return
with no error message.
- Updated bli_check_object_buffer() so that NULL buffers result in an
error only if the object is dimensionally non-empty (i.e., only if both
of the object's dimensions are non-zero). This allows BLIS operations
to be performed on dimensionally empty objects (i.e., where at least one
dimension is zero).
- Updated the error message associated with bli_check_object_buffer()
to mention the newly relaxed constraint mentioned above, vis-a-vis
non-zero dimensions.
commit 20690fe3018ce17c8df61ce0bffecaa7911dc3a5
Author: Marat Dukhan <maratek@gmail.com>
Date: Sun Jul 13 22:50:56 2014 -0700
Emscripten port
commit 4a20ed1a3f5e9e5232df30aa0e568e6c00c56ce1
Merge: 6a515e9 8ccdfae
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Sun Jul 13 17:45:01 2014 -0500
Merge pull request #14 from Maratyszcza/master
Support "make test" for PNaCl configuration
commit 6a515e988f2ae1628258a6dec2c0e9cf2d04790f
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Sun Jul 13 17:38:33 2014 -0500
Implemented dsdot() and sdsdot() in compat layer.
Details:
- Replaced "not yet implemented" error messages in dsdot() and sdsdot()
with actual implementations. (These routines are so rarely used that
this log message will probably lead to some people learning of their
existence for the first time.)
commit 255668ddd1004552c6cc65035ec6486671ce99bb
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Sun Jul 13 17:30:44 2014 -0500
Inserted gemv beta-scaling bug into compat layer.
Details:
- BLAS has a peculiar bug (or feature) whereby calling gemv on a vector
y of non-zero length and a vector x of zero length results in no action.
Given that the operation is y := beta*y + A*x, many (most?) individuals
would expect vector y to still be scaled by beta. BLIS, when called
natively, handles these cases intuitively (with beta scaling).
Unfortunately, many BLAS test suites actually check for the way this
situation is handled. Therefore, we have decided to implement this "bug"
in the compatibility layer so as to provide "bug-for-bug" compatibility
with BLAS.
commit 570a154581bdb353fa13a219c7cb3c81d3dceffd
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Sat Jul 12 17:51:05 2014 -0500
Comment/formatting updates to build scripts.
Details:
- Minor updates to comments and formatting in bump-version.sh and
update-version-file.sh scripts.
commit 26cd81990631ff799791629206e068126ff9e3a1
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Thu Jul 10 13:16:07 2014 -0500
Added bli_info_*() query functions.
Details:
- Added a new API family, bli_info_*(), which can be used to query
information about how BLIS was configured. Most of these values are
returned as gint_t, with the exception of the version string which
is char*.
- Changed how the testsuite driver queries information about how BLIS
was configured (from using macro constants directly to using the
new bli_info API).
- Removed bli_version.c and its header file.
- Added STRINGIFY_INT() macro to bli_macro_defs.h
- Renamed info_t type in bli_type_defs.h to objbits_t (not because of
an actual naming conflict, but because the name 'info_t' would now be
somewhat misleading in the presence of the new bli_info API, as the
two are unrelated).
commit 970b43141697d8c31a033f59513bb59d7cc78ab0
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Thu Jul 10 09:30:00 2014 -0500
Minor bugfixes to BLAS compatibility layer.
Details:
- Changed bla_amax.c so that i?amax() routines now correctly return 0
if ( n < 1 || incx <= 0 ).
- Changed bla_rotg.c and bla_rotmg.c to use bli_fabs() macro instead of
f2c's abs() macro for float and double cases.
- Thanks to Murtaza Ali for suggesting the two fixes above.
- Updated label of fnormv to normfv in testsuite/input.operations.
commit 8ccdfaef4c42ad8957af8607a1a9ee29b9277d4b
Author: Marat Dukhan <maratek@gmail.com>
Date: Tue Jul 8 23:14:36 2014 -0700
Replicated logic from testsuite/Makefile in top-level Makefile to support make test
commit caa6507ff3724c80d60987f309b8bbc5b50a9841
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Tue Jul 8 10:25:27 2014 -0500
Minor cleanup to standalone test drivers.
Details:
- Very minor code changes to standalone test drivers in 'test' directory.
- Added *.so files to '.gitignore'.
commit 6c65e9a58fe55990ebb99ec3986443e18af35338
Merge: cb12e45 daca500
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Tue Jul 8 10:13:49 2014 -0500
Merge branch 'master' of github.com:flame/blis
commit cb12e456f94c196c093e52f02a7cbca0032fc86e
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Tue Jul 8 10:07:46 2014 -0500
Fixed possible level-3 inf/NaN issue when beta=0.
Details:
- Redefined xpbys_mxn and xpbys_mxn_u/_l macros to employ a copy
(instead of scaling by beta) when beta is zero. This will stamp out
any possible infs or NaNs in the output matrix, if it happens to be
uninitialized. Thanks to Tony Kelman for isolating this bug.
commit daca500db5e2448ba0da8047b75eb0f88d9f40e3
Merge: ab3bc91 4702350
Author: Tyler Smith <tms@cs.utexas.edu>
Date: Thu Jul 3 12:52:52 2014 -0500
Merge branch 'master' of http://github.com/flame/blis
commit 4702350278af31f662b458127777dd4d85a3192f
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Thu Jul 3 11:48:23 2014 -0500
Defined _ukernel_void() wrappers to micro-kernels.
Details:
- Added wrappers for micro-kernels so that users may invoke the
micro-kernels without knowing what the function names actually are.
This is useful when an application wishes to call the micro-kernel
from a shared library instance of BLIS, where the application may not
necessarily have the luxury of grabbing the micro-kernel name(s) from
C preprocessor macros at compile-time. Also, since the wrappers use
void* pointers, one's environment does not need to be aware of some
BLIS types such as scomplex and dcomplex. These wrappers now join the
level-1 and level-1f kernel wrappers, which pre-dated this commit.
- Removed the wrapper definitions and prototypes from the micro-kernel
test suite modules, and replaced calls to them with calls to the new
wrappers mentioned above.
commit ab3bc9153b914fbaf259e15b66c91d628e7c8661
Author: Tyler Smith <tms@cs.utexas.edu>
Date: Thu Jul 3 11:19:43 2014 -0500
Fixed a bug for TRSM when BLIS_ENABLE_MULTITHREADING is not set but the multithreading environment variables are turned on
commit b8134b720b985783ee6a582a3eb5d6c51f00d051
Author: Tyler Smith <tms@cs.utexas.edu>
Date: Wed Jul 2 16:02:39 2014 -0500
Quick and dirty multithreading for TRSM
Should work fine for small number of threads (up to 8 or maybe even 16).
However, performance is yet untested.
This parallelizes the "JR" loop for the left sided cases
and the "IR" loop for the right sided cases.
Future work is to parallelize the outer loops as well.
commit e8ef69692831db07ddbe9485a5e504ac3f03e496
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Wed Jul 2 14:59:27 2014 -0500
Added shared library support to build system.
Details:
- Modified top-level Makefile to support building shared (dynamic)
libraries.
- Updated most configurations' make_defs.mk files to include necessary
compiler/linker flags needed by top-level Makefile.
- Note that by default, all configurations presently do NOT build
shared libraries. To enable, one must change the value of
BLIS_ENABLE_DYNAMIC_BUILD to 'yes'.
commit b80df0f2cffb015da02e70a82b8512da9891ab67
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jun 23 13:52:39 2014 -0500
Added bump-version.sh script to 'build' directory.
Details:
- Added a bash script, bump-version.sh, to aid in incrementing the BLIS
version string.
commit 9ef1f1e21d083697fc730e48d7d9169c201f3da2
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jun 23 13:48:17 2014 -0500
CHANGELOG update (0.1.3)
commit 036cc634918463b1caa0fd89c9a211f2f5639af7 (tag: 0.1.3)
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jun 23 13:48:17 2014 -0500
@@ -26,7 +425,7 @@ Date: Mon Jun 23 10:42:29 2014 -0500
Removed 'version' from .gitignore file.
commit b40dcefc5ee31f67aa3990e2e9d2ef8ed1386a25 (origin/master)
commit b40dcefc5ee31f67aa3990e2e9d2ef8ed1386a25
Merge: 7101a8e b693b0c
Author: Field G. Van Zee <field@cs.utexas.edu>
Date: Mon Jun 23 10:39:05 2014 -0500