Adam Osewski
061ac0649c
Polished Grouped GEMM APIs and new BF16 instances ( #1600 )
...
* Few small fixes.
* New GroupedGemm instances (BF16)
* Unify and refactor GroupedGEMM device API.
* Adapt changes to new API.
* Adapt grouped gemm profiler.
* Accept multiple kbatches for grouped gemm profiler.
- delete obsolete two stage as it is now covered by grouped gemm
* Update unit test for grouped gemm.
* Fix thresholds for BF16 and F8. Unblock tests.
* Fix few instances.
* Multiple small fixes.
* Adapt to new API, check dynamic casting.
* Uncomment few data types in grouped gemm profiler.
* Fix call to SetDeviceArgs.
* Fix profile grouped gemm multiply tile loop.
* Fix grouped gemm tile loop kernel args in client examples.
* Review comments.
2024-11-27 13:02:44 +01:00
..
2024-09-12 11:47:52 +02:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-19 13:31:17 +02:00
2023-10-31 10:46:32 +01:00
2024-04-09 23:46:21 +02:00
2024-04-09 23:46:21 +02:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-19 13:31:17 +02:00
2024-11-19 10:00:17 -08:00
2024-08-14 10:42:30 +08:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-05-23 09:17:02 -07:00
2024-04-02 09:42:17 -07:00
2024-11-20 07:03:56 -08:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-11-20 07:03:56 -08:00
2024-11-18 14:03:45 +01:00
2024-07-19 22:01:22 +08:00
2024-11-21 08:21:37 -08:00
2024-10-22 16:18:28 +02:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-11-08 10:04:33 +01:00
2024-11-04 16:33:20 -08:00
2024-11-04 13:34:17 -08:00
2024-08-16 16:07:52 -06:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-11-08 10:04:33 +01:00
2024-08-16 16:07:52 -06:00
2024-08-16 16:07:52 -06:00
2024-09-02 10:39:49 +02:00
2024-04-02 09:42:17 -07:00
2024-06-10 14:48:49 -05:00
2024-08-21 15:22:41 -07:00
2024-07-24 15:49:55 -05:00
2024-08-21 15:22:41 -07:00
2024-11-04 13:34:17 -08:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-11-27 13:02:44 +01:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-02 09:42:17 -07:00
2024-04-26 07:26:30 -05:00
2024-11-27 13:02:44 +01:00
2023-10-31 10:46:32 +01:00
2024-09-16 10:15:06 +02:00
2024-11-05 10:09:52 +01:00
2023-12-19 04:23:11 +08:00
2024-01-25 19:53:15 +08:00
2023-12-19 04:23:11 +08:00
2024-08-20 10:30:56 -05:00
2024-09-13 10:18:21 -07:00
2024-09-17 15:57:10 +02:00
2024-04-02 09:42:17 -07:00
2024-08-20 10:30:56 -05:00
2023-12-20 14:34:53 -08:00
2023-12-18 21:35:00 -06:00
2024-11-21 08:21:37 -08:00