mkurumel
f8525a888e
SGEMV performance improvement.
1.bli_sdotxf_zen_int_8 :
added hadd_ps intrinsic instead of dp_ps for
add partial dot outputs.
AMD Internal : [CPUPL-1512]
Change-Id: I6e8e71a9cf8c1f30a1710dd1c67f193a998beb03
2021-04-12 10:47:23 +05:30
..
2020-05-21 11:56:45 +05:30
2018-12-04 14:31:06 -06:00
2020-05-21 12:37:53 +05:30
2020-08-03 11:27:13 +05:30
2018-12-04 14:31:06 -06:00
2017-11-21 12:34:20 -06:00
2021-03-08 22:32:13 +05:30
2018-12-04 14:31:06 -06:00
2019-09-17 18:00:29 -05:00
2018-12-04 14:31:06 -06:00
2020-08-03 11:27:13 +05:30
2020-06-16 18:29:00 +05:30
2018-12-04 14:31:06 -06:00
2019-11-04 13:57:12 -06:00
2018-12-04 14:31:06 -06:00
2018-12-04 14:31:06 -06:00
2021-04-12 10:47:23 +05:30
2020-10-13 18:59:31 +05:30
2020-07-22 18:24:26 +05:30
2021-03-08 19:04:17 +05:30