Bartlomiej Wroblewski
ad0a8e4cd2
Optimize fp16 direct load GEMM instances ( #1086 )
...
This PR optimizes fp16 instances of direct load GEMM kernel introduced in #999 and #1052 .
Measured the performance of new instances on CDNA2 GPU and compared it against the performance of the best non-direct-load GEMM instances. Used 76 different GEMM problems.
On average, this change improves the performance of the tested problems by 47%. For cases known as latency-bound, the speedup is around 126%.
2023-12-18 11:09:10 +01:00
..
2023-08-02 10:32:22 -05:00
2023-12-11 17:49:27 -08:00
2023-05-31 18:46:57 -05:00
2023-09-18 14:08:23 +02:00
2023-05-31 18:46:57 -05:00
2023-09-18 14:08:23 +02:00
2023-05-31 18:46:57 -05:00
2023-09-18 14:08:23 +02:00
2023-05-31 18:46:57 -05:00
2023-05-30 07:07:17 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-08-23 11:36:17 -07:00
2023-09-18 14:08:23 +02:00
2023-08-23 11:36:17 -07:00
2023-09-18 14:08:23 +02:00
2023-08-23 11:36:17 -07:00
2023-09-18 14:08:23 +02:00
2023-08-23 11:36:17 -07:00
2023-08-23 11:36:17 -07:00
2023-09-12 10:05:23 -05:00
2023-09-12 10:05:23 -05:00
2023-09-12 10:05:23 -05:00
2023-09-12 10:05:23 -05:00
2023-09-12 10:05:23 -05:00
2023-09-12 10:05:23 -05:00
2023-09-12 10:05:23 -05:00
2023-09-12 10:05:23 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-11-11 07:09:32 -08:00
2023-11-11 07:09:32 -08:00
2023-12-08 14:32:37 -08:00
2023-12-08 14:32:37 -08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-11-17 07:06:24 -06:00
2023-11-07 09:09:58 -06:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-09-26 14:59:33 -05:00
2023-09-26 14:59:33 -05:00
2023-12-11 17:49:27 -08:00
2023-12-11 17:49:27 -08:00
2023-10-23 16:52:53 -05:00
2023-08-23 11:36:17 -07:00
2023-08-23 11:36:17 -07:00
2023-08-23 11:36:17 -07:00
2023-08-23 11:36:17 -07:00
2023-12-18 11:09:10 +01:00
2023-12-03 23:08:47 +01:00
2023-12-03 23:08:47 +01:00
2023-12-03 23:08:47 +01:00
2023-12-03 23:08:47 +01:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00