BingYuan.Zhou
6a3960c1e1
Flatmm merge ( #2168 )
...
* sync with function interface of cshuffleepiloge,fix flatmm build fail
* move code from solin/flatmm which add mfma16*16*32fp8 and optimize flatmm
---------
Co-authored-by: solin <bingzhou@amd.com >
2025-05-08 12:59:57 +08:00
..
2025-04-30 10:20:16 -07:00
2025-04-30 10:20:16 -07:00
2025-05-07 18:37:31 -07:00
2025-04-30 10:20:16 -07:00
2025-04-30 10:20:16 -07:00
2025-04-30 10:20:16 -07:00
2025-04-30 10:20:16 -07:00
2025-04-30 10:20:16 -07:00
2025-04-30 10:20:16 -07:00
2025-04-30 10:20:16 -07:00
2025-05-06 17:32:07 +08:00
2025-04-30 10:20:16 -07:00
2025-05-06 17:32:07 +08:00
2025-05-05 18:46:44 +02:00
2025-05-05 18:46:44 +02:00
2025-05-08 12:59:57 +08:00
2025-04-30 10:20:16 -07:00
2025-05-07 00:02:59 -07:00
2025-05-07 00:02:59 -07:00
2024-04-15 19:27:12 -05:00