Commit Graph

708 Commits

Author SHA1 Message Date
sthangar
64598ee4cf fixed the symlink issue
Change-Id: I2186d529f295c576597c189e1ae219bc1a83f955
2016-08-31 12:54:50 +05:30
sthangar
8a2373f26b Norm 2 optimization
Change-Id: Ide9decaccd20bf0ccc32c9abb6556e038dceed2b
2016-08-29 14:28:39 +05:30
sthangar
fdc6639023 Placed 1 and 1f AMD optimized AVX routines under zen folder
Change-Id: I26795211ef11d232ed794ce36dd0a9c1f8706328
2016-08-29 10:43:38 +05:30
praveeng
22dd6a353d Merge master code as on 2016_08_23 to amd-staging branch by praveeng
Changes to be committed:
	modified:   frame/thread/bli_mutex_openmp.h
	modified:   frame/thread/bli_mutex_pthreads.h

Change-Id: Ica522edbb1d0173f53f38d5057b1f7aef73666be
2016-08-23 15:16:49 +05:30
praveeng
f20ed3885d Merge branch 'master' of https://github.com/clMathLibraries/blis-amd for "Fixed bugs in bli_mutex_init() and friends." 2016-08-22 15:27:33 +05:30
praveeng
02ac597e4b Revert commits 357c990bdd
Change-Id: I12a34456d7eed93fda4369e76bcddb42ba7ccb99
2016-08-22 15:12:27 +05:30
praveeng
84e41cc73c Revert commits 8aee306
Change-Id: I3dd999c77c6779332a40dbb84371ca487216f189
2016-08-22 15:12:27 +05:30
praveeng
30ccfcee82 removed changes from readme file which are giving confilcts
Change-Id: Ic71ad1313e1404fed444e899466043704d875af6
2016-08-22 15:12:27 +05:30
praveeng
aeca25cd63 first commit
Change-Id: Ib50c81acda3b2c1583da3d421efc0ca547ef68e2
2016-08-22 15:12:27 +05:30
praveeng
6b2274864b small modification to readme for git push test
Change-Id: I68506a49586b07eaa907f3f85304ee40d4c92d0a
2016-08-22 15:12:27 +05:30
praveeng
daa7a9ecb2 first commit
Change-Id: Ib50c81acda3b2c1583da3d421efc0ca547ef68e2
2016-08-22 15:12:27 +05:30
praveeng
5f66a4aa05 small modification to readme for git push test
Change-Id: I68506a49586b07eaa907f3f85304ee40d4c92d0a
2016-08-22 15:11:44 +05:30
praveeng
c6cbd78d23 Revert commits 357c990bdd
Change-Id: I12a34456d7eed93fda4369e76bcddb42ba7ccb99
2016-08-22 15:11:09 +05:30
praveeng
9219a90607 Revert commits 8aee306
Change-Id: I3dd999c77c6779332a40dbb84371ca487216f189
2016-08-22 15:11:09 +05:30
praveeng
728573296e removed changes from readme file which are giving confilcts
Change-Id: Ic71ad1313e1404fed444e899466043704d875af6
2016-08-22 15:11:09 +05:30
praveeng
ad7862e291 first commit
Change-Id: Ib50c81acda3b2c1583da3d421efc0ca547ef68e2
2016-08-22 15:11:09 +05:30
praveeng
ad4b471a25 small modification to readme for git push test
Change-Id: I68506a49586b07eaa907f3f85304ee40d4c92d0a
2016-08-22 15:09:22 +05:30
praveeng
55d641363f first commit
Change-Id: Ib50c81acda3b2c1583da3d421efc0ca547ef68e2
2016-08-22 15:08:00 +05:30
praveeng
f3b6b15f6d small modification to readme for git push test
Change-Id: I68506a49586b07eaa907f3f85304ee40d4c92d0a
2016-08-22 15:08:00 +05:30
Field G. Van Zee
16a4c7a823 Fixed bugs in bli_mutex_init() and friends.
Details:
- Fixed a couple of bugs that affected OpenMP and POSIX threads
  configurations that resulted in compiler errors and warnings due
  to type mismatch, and in the case of pthreads, a missing function
  argument. The bugs are fairly recent, introduced in a017062.
2016-08-19 11:38:36 -05:00
praveeng
cdfb3c3f29 Merge master code as on 2016_07_29 to amd-staging branch by praveeng
Change-Id: Ic78b84d8b8d10158fb2a612f9a64bbc7b1f9b486
2016-07-29 12:46:21 +05:30
praveeng
4bc842ca3a Merge branch 'master' of publicrepo 2016-07-28 17:32:12 +05:30
praveeng
117f883851 Revert commits 357c990bdd
Change-Id: I12a34456d7eed93fda4369e76bcddb42ba7ccb99
2016-07-28 17:30:53 +05:30
praveeng
2fcdc28f10 Revert commits 8aee306
Change-Id: I3dd999c77c6779332a40dbb84371ca487216f189
2016-07-28 17:30:53 +05:30
praveeng
1b5d104afe removed changes from readme file which are giving confilcts
Change-Id: Ic71ad1313e1404fed444e899466043704d875af6
2016-07-28 17:30:52 +05:30
praveeng
d81273047b first commit
Change-Id: Ib50c81acda3b2c1583da3d421efc0ca547ef68e2
2016-07-28 17:30:04 +05:30
praveeng
65905c3011 small modification to readme for git push test
Change-Id: I68506a49586b07eaa907f3f85304ee40d4c92d0a
2016-07-28 17:28:55 +05:30
praveeng
23cca231be first commit
Change-Id: Ib50c81acda3b2c1583da3d421efc0ca547ef68e2
2016-07-28 17:26:49 +05:30
praveeng
922e309170 small modification to readme for git push test
Change-Id: I68506a49586b07eaa907f3f85304ee40d4c92d0a
2016-07-28 17:26:49 +05:30
praveeng
b0d510bf0e Revert commits 357c990bdd
Change-Id: I12a34456d7eed93fda4369e76bcddb42ba7ccb99
2016-07-28 15:11:08 +05:30
praveeng
5ebeece5b4 Revert commits 8aee306
Change-Id: I3dd999c77c6779332a40dbb84371ca487216f189
2016-07-28 15:01:36 +05:30
Field G. Van Zee
c31b1e7b9d Relax alignment restrictions for sandybridge ukrs.
Details:
- Relaxed the base pointer and leading dimension alignment restrictions
  in the sandybridge gemm microkernels, allowing the use of vmovups/vmovupd
  instead of vmovaps/vmovapd. These change mimic those made to the haswell
  microkernels in e0d2fa0 and ee2c139.
- Updated testsuite modules as well as standalone test drivers in 'test'
  directory to use DBL_MAX as the initial time candidate. Thanks to Devin
  Matthews for suggesting this change.
- Inserted #include "float.h" into bli_system.h (to gain access to DBL_MAX).
- Minor update (vis-a-vis contexts) to driver code in test/3m4m.
2016-07-27 15:58:07 -05:00
praveeng
50a2f2efcb Merge master code as on 2016_07_25 to amd-staging branch by praveeng
Change-Id: I84886ae241db2aac0bef6b7ef399f04aa8bca16d
2016-07-25 17:07:38 +05:30
praveeng
cfd46c88d5 Merge remote-tracking branch 'publicrepo/master' 2016-07-25 15:38:13 +05:30
praveeng
f493bf4d70 removed changes from readme file which are giving confilcts
Change-Id: Ic71ad1313e1404fed444e899466043704d875af6
2016-07-25 14:14:00 +05:30
Field G. Van Zee
a017062fdf Integrated "memory broker" (membrk_t) abstraction.
Details:
- Integrated a patch originally authored and submitted by Ricardo Magana
  of HP Enterprise. The changeset inserts use of a new object type, membrk_t,
  (memory broker) that allows multiple sets of memory pools on, for example,
  separate NUMA nodes, each of which has a separate memory space.
- Added membrk field to cntx_t and defined corresponding accessor macros.
- Added membrk field to mem_t object and defined corresponding accessor macros.
- Created new bli_membrk.c file, which contains the new memory broker API,
  including:
    bli_membrk_init(), bli_membrk_finalize()
    bli_membrk_acquire_[mv](), bli_membrk_release(),
    bli_membrk_init_pools(), bli_membrk_reinit_pools(),
    bli_membrk_finalize_pools(),
    bli_membrk_pool_size()
- In bli_mem.c, changed function calls to
    bli_mem_init_pools()     -> bli_membrk_init()
    bli_mem_reinit_pools()   -> bli_membrk_reinit()
    bli_mem_finalize_pools() -> bli_membrk_finalize()
- In bli_packv_init.c, bli_packm_init.c, changed function calls to:
    bli_mem_acquire_[mv]() -> bli_membrk_acquire_[mv]()
    bli_mem_release()      -> bli_membrk_release()
- Added bli_mutex.c and related files to frame/thread. These files define
  abstract mutexes (locks) and corresponding APIs for pthreads, openmp, or
  single-threaded execution. This new API is employed within functions
  such as bli_membrk_acquire_[mv]() and bli_membrk_release().
2016-07-22 17:02:59 -05:00
Field G. Van Zee
ce59f81108 Merge pull request #88 from devinamatthews/32bit-dim_t
Handle 32-bit dim_t in 64-bit microkernels.
2016-07-22 14:48:14 -05:00
Devin Matthews
707a2b7fac Somehow forgot the most important microkernel. 2016-07-22 13:49:44 -05:00
Devin Matthews
47ec045056 Merge remote-tracking branch 'upstream/master' into 32bit-dim_t 2016-07-22 13:45:23 -05:00
Devin Matthews
08f1d6b6fa Use 64-bit intermediate variable for k for architectures that do 64-bit loads in case dim_t is 32-bit. 2016-07-22 13:44:37 -05:00
Field G. Van Zee
ff41153f4e Merge pull request #86 from devinamatthews/haswell-vmovups
Remove alignment restrictions on C in haswell kernel.
2016-07-22 13:21:03 -05:00
Devin Matthews
e0d2fa0d83 Relax alignment restrictions for haswell sgemm. 2016-07-22 12:56:51 -05:00
Field G. Van Zee
f9214ced97 Merge pull request #85 from devinamatthews/qopenmp
Change -openmp to -fopenmp for icc.
2016-07-22 12:16:39 -05:00
Devin Matthews
ee2c139df6 Remove alignment restrictions on C in haswell kernel. 2016-07-22 12:06:03 -05:00
Devin Matthews
08666eaa20 Change -openmp to -fopenmp for icc. 2016-07-22 11:07:34 -05:00
praveeng
1aa77dfc1d Merge master code as on 2016_07_21 to amd-staging branch by praveeng
Change-Id: Ic7d0a21101358f08147736e7f1884e7409937344
2016-07-21 14:23:41 +05:30
praveeng
ec9f59836b Merge branch 'master' of https://github.com/clMathLibraries/blis-amd 2016-07-18 12:56:25 +05:30
praveeng
197e182fcb first commit
Change-Id: Ib50c81acda3b2c1583da3d421efc0ca547ef68e2
2016-07-18 12:55:10 +05:30
praveeng
41fb327110 small modification to readme for git push test
Change-Id: I68506a49586b07eaa907f3f85304ee40d4c92d0a
2016-07-18 12:55:10 +05:30
sthangar
9101a9c880 Checked in optimized 1V kernels along with benchmark codes. Also incorporated review comments for 1F kernels
Change-Id: I035c0d39e6b0bed28e6e2041242186c49f6ed55b
2016-07-13 16:51:14 +05:30