amd/blis - blis - Public git mirror

amd/blis

mirror of https://github.com/amd/blis.git synced 2026-04-13 12:30:23 +00:00

Author	SHA1	Message	Date
Chandrashekara K R	ae698be825	Updated version string from 5.0.1 to 5.1.1	2025-06-13 11:24:50 +05:30
Chandrashekara K R	cfbe202868	Updated version string from 4.2.1 to 5.0.1 Change-Id: I4cbd8d9ae7e35fa235a6707fe7ddbd157eb63b98 (cherry picked from commit `7f1824b8ee`)	2024-10-17 03:07:17 -04:00
Chandrashekara K R	d0f890e8d5	Updated version string from 4.1.1 to 4.2.1 Change-Id: I18ff0043a2269269f251078a6ff7c51e70618b6e	2024-03-12 02:07:58 -04:00
Chandrashekara K R	248d09c722	Version String Update AOCL-BLIS: Updated version string to AOCL-BLIS 4.1.1 Build <YYMMDD> Change-Id: Iced62a66d0859b3c7d4bcfe6f0e0527922e41cae	2023-08-08 07:27:41 -04:00
Chandrashekara K R	5f1ea3246a	Updated blis library version string to 4.0.1 Change-Id: I1f3eb9081bcc05ccec300e00860d905b23d9709a	2022-11-24 10:35:34 +05:30
Chandrashekara K R	fde812015f	Updated blis library version from 4.0 to 3.2.1 AMD-Internal: [CPUPL-2322] Change-Id: I3a6a61543dd2754e2590d7f5f22442c9fdeaee95	2022-07-29 15:55:10 +05:30
Dipal M Zambare	16de63c818	Updated version and copyright notice. Changed AMD-BLIS version to 3.1.2 AMD-Internal: [CPUPL-2111] Change-Id: Id8fc3fbc112f08bd5e5def646c472047352e65b5	2022-05-17 18:10:39 +05:30
Dipal M. Zambare	d3b22f590f	Updated version number to 3.2 Change-Id: Iea5712d8cb854d4eaffea510e0fe2d9657e4d21f	2022-05-17 18:08:57 +05:30
Dipal M Zambare	31921b9974	Updated windows build system to define BLIS_CONFIG_EPYC flag. All AMD specific optimization in BLIS are enclosed in BLIS_CONFIG_EPYC pre-preprocessor, this was not defined in CMake which are resulting in overall lower performance. Updated version number to 3.1.1 Change-Id: I9848b695a599df07da44e77e71a64414b28c75b9	2022-05-17 18:03:09 +05:30
Dipal M Zambare	f698afc567	Updated version number to 3.1.0 AMD-Internal: [CPUPL-1811] Change-Id: I6b485e7622e526791094ae621d9f84d2526e6569	2021-11-12 08:58:48 +05:30
Dipal M Zambare	849e1cee0a	Updated version number to 3.0.1. Change-Id: I07d5c26bb96b590854e1f81d41ed49a5e320f60e	2021-06-03 15:48:05 +05:30
dzambare	267a959af1	Rebased amd-staging-milan-3.0 branch on master -- Rebased on top of master commit # `6e522e5823` -- Updated merged code to remove duplicated code added by auto-merging -- Updated merged code to rename bool_t type -- Updated merged code to rename bli_thread_obarrier -- Updated merged code to rename bli_thread_obroadcast Change-Id: I39879f1ef3b42ecbe5808af3b559d88c36dbbf6c AMD-Internal: [CPUPL-1067]	2020-08-06 10:09:29 +05:30
Meghana Vankadari	6896f927da	Fixed bug in SUP code path Details: - Since GEMM kernel prefers row-storage, if input C matrix is in col-major order, entire operation is transposed. In that case uplo(c) needs to be toggled before kernel-variant selection. - disabled "bli_gemmsup_ref_var1n2m_opt_cases" inside gemmtsup. - Updated version number to 2.2.1 Change-Id: I0a85df1141fc4a98d98ea4e0c3d42db8602fa69b	2020-07-15 19:41:24 +05:30
prangana	711f26129e	Update AMD BLIS version to 2.2 Also updated Makefile to fix issue of multiple symbolic links being created Change-Id: Ie9a680cedd5c96fcd7f6af1ce0f849a58c3ed4d3	2020-05-31 21:37:32 +05:30
prangana	d21c726003	update version 2.1 Change-Id: I531fe8005f63ad138077320c3f0b03a05a7c7dd2	2019-10-30 15:33:37 +05:30
pradeeptrgit	1720efe630	Update version number to 1.2 Change-Id: Ibb31f6683cdecca6b218bc2f0c14701d7e92ebf3	2019-08-23 14:18:09 +05:30
Field G. Van Zee	f808d829c5	Handle edge cases, zero-filling in packm kernels. Details: - Updated the API and semantics of packm kernels such that they must now handle edge cases, meaning that a c-by-k packm kernel must be able to pack edge cases that are fewer than c rows/columns and be able to zero-fill the remaining elements. They must also be able to zero-fill the equivalent region when copying fewer than k columns/rows (which is needed by trsm). The new packm kernel API is generally: void packm_kernel ( conj_t conja, dim_t cdim, dim_t n, dim_t n_max, ctype* restrict kappa, ctype* restrict a, inc_t inca, inc_t lda, ctype* restrict p, inc_t ldp, cntx_t* restrict cntx ); where cdim and n are the dimensions (short and long, respectively) of the submatrix being copied from the source matrix A, and n_max is the "full" long dimension (corresponding to the k dimension in gemm) of the micropanel. The "full" short dimension (corresponding to the register blocksize MR or NR) is not part of the API because it is known intrinsically by the packm kernel implementation. Thanks to Devin Matthews for prompting us to make this change (#282). - Updated all reference packm kernels in ref_kernels/1m according to above changes, as well as all optimized packm kernels (which only consisted of those for knl). - Bumped the major soname version number in 'so_version' to 2. At first I was considering leaving it unchanged, but I couldn't escape the reality that the packm kernel API is much closer to an expert API than it is some obscure helper function interface within the framework that nobody would ever notice. - Removed reference packm kernels for mr/nr = 30. The only sub-config that would have been using those kernels is knc, which is likely no longer being used by very many people (if any). (This also mostly offset the larger object code footprint incurred by moving the edge- case handling into the individual packm kernels.) - Fixed an obscure race condition for 3mh and 4mh induced methods in which those implementations were modifying the contexts stored in the gks rather than a local copy. - Fixed a minor bug in the testsuite that prevented non-1m-based induced method implementations of trsm from executing.	2018-12-12 15:22:59 -06:00
Field G. Van Zee	e88aedae73	Separated expert, non-expert typed APIs. Details: - Split existing typed APIs into two subsets of interfaces: one for use with expert parameters, such as the cntx_t, and one without. This separation was already in place for the object APIs, and after this commit the typed and object APIs will have similar expert and non- expert APIs. The expert functions will be suffixed with "_ex" just as is the case for expert interfaces in the object APIs. - Updated internal invocations of typed APIs (functions such as bli_?setm() and bli_?scalv()) throughout BLIS to reflect use of the new explictly expert APIs. - Updated example code in examples/tapi to reflect the existence (and usage) of non-expert APIs. - Bumped the major soname version number in 'so_version'. While code compiled against a previous version/commit will likely still work (since the old typed function symbol names still exist in the new API, just with one less function argument) the semantics of the function have changed if the cntx_t parameter the application passes in is non-NULL. For example, calling bli_daxpyv() with a non-NULL context does not behave the same way now as it did before; before, the context would be used in the computation, and now the context would be ignored since the interace for that function no longer expects a context argument.	2018-07-06 19:14:02 -05:00
Field G. Van Zee	b699bb1ff0	Adopt Linux-like .so versioning at install-time. Details: - Changed the naming conventions used for installed libraries and symlinks to more closely mirror patterns used by typical GNU/Linux libraries. Whereas previously static and shared libraries were installed and symlinked as follows: (library) libblis-0.3.2-15-haswell.a (library) libblis-0.3.2-15-haswell.so (symlink) libblis.a -> libblis-0.3.2-15-haswell.a (symlink) libblis.so -> libblis-0.3.2-15-haswell.so we now use the following naming conventions: (library) libblis.a (symlink) libblis.so -> libblis.so.0.1.2 (symlink) libblis.so.0 -> libblis.so.0.1.2 (library) libblis.so.0.1.2 where 0.1.2 indicates shared library major, minor, and build versions of 0, 1, and 2, respectively. The conventional version string can still be queried by linking to the library in question and then calling bli_info_get_version_str(). (The testsuite binary does this automatically at startup.) - Added logic to common.mk to set the soname field in the shared library via the -soname linker flag. - Added a 'so_version' file to the top-level directory containing two lines. The first line specifies the .so major version number, and the second line specifies the minor and build version numbers joined with a '.'. This file is read by configure and those values substituted into build/config.mk.in to define SO_MAJOR, SO_MINORB, and SO_MMB variables.	2018-05-10 15:54:17 -05:00

19 Commits