Files
blis/frame
Madan mohan Manokar 70e9d327a2 squarePacked(sqp) framework and multi-instance handling
1. kx partitions added to k loop for dgemm and zgemm.
2. mx loop based threading model added for dgemm as prototype of zgemm.
3. nx loop added for 3m_sqp and dgemm_sqp.
4. single 3m_sqp workspace allocation with smaller memory footprint.
5. sqp framework done from dgemm and zgemm.
6. sqp kernels moved to seperate kernel file.
7. residue kernel core added to handle mx<8.
8. multi-instance tuning for 3m_sqp done.
9. user can set env "BLIS_MULTI_INSTANCE" to 1 for better multi-instance behavior of 3m_sqp.

AMD-Internal: [CPUPL-1521]
Change-Id: Ibef50a8a37fe99f164edb4621acb44fc0c86514c
2021-06-28 15:40:11 +05:30
..
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30
2021-04-27 11:09:48 +05:30
2021-06-02 21:21:00 +05:30
2021-05-26 10:06:54 +05:30
2021-06-17 05:17:37 -04:00
2021-03-08 19:04:17 +05:30
2021-03-08 19:04:17 +05:30