Enrico Degregori
440358c168
Wave Tile Transfer supporting global load with transpose ( #3027 )
...
* Initial implementation:
- add new thread group transfer supporting transpose instruction
- refactor AB transfer to switch between thread and wave tiles methods
* Add some comments and remove explicit wave and lane calculations
* Remove compiler option for performance
* fp16 example: use tuned instance
* Missing cleanup
* Integrate wave transfer in existing gemm and batched gemm instances
* Add fast instances
* extend implementation for 8 bit datatypes
packed types not supported
* Address review comments
* Optimize pipeline v1 and re-introduce compiler option
* Disable wave tile approach for b scale gemm
* Fix for clang20
* Avoid code duplication of amd_global_load_transpose_to_vgpr function
2025-10-16 11:33:56 -07:00
..
2023-08-14 15:46:27 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 14:10:24 -07:00
2025-09-04 14:10:24 -07:00
2025-10-16 11:33:56 -07:00
2025-09-04 14:10:24 -07:00
2025-04-28 10:14:21 +05:00
2025-02-20 14:00:27 -08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-07-06 15:42:00 +08:00
2025-07-06 15:42:00 +08:00
2025-07-06 15:42:00 +08:00
2025-06-12 09:25:59 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-02-20 14:00:27 -08:00
2025-09-25 09:27:18 +08:00
2025-06-12 09:25:59 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-06-12 09:25:59 +08:00
2025-09-04 08:33:40 +08:00
2025-09-04 08:33:40 +08:00
2025-06-05 13:54:15 -06:00
2025-07-06 15:42:00 +08:00
2025-06-12 09:25:59 +08:00
2025-06-12 09:25:59 +08:00
2025-06-12 09:25:59 +08:00
2025-06-12 09:25:59 +08:00
2025-07-06 15:42:00 +08:00
2025-07-06 15:42:00 +08:00
2025-06-05 13:54:15 -06:00
2025-02-20 14:00:27 -08:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-06-05 13:54:15 -06:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-07-18 14:35:54 +08:00
2025-06-05 13:54:15 -06:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-09-25 09:27:18 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2024-10-28 19:02:48 -07:00
2025-09-04 08:33:40 +08:00
2025-09-17 14:50:15 -07:00
2023-08-23 11:36:17 -07:00
2023-05-31 18:46:57 -05:00
2023-07-06 10:58:55 -05:00
2023-07-06 10:58:55 -05:00
2025-07-28 11:34:07 -07:00
2025-07-28 11:34:07 -07:00
2025-10-16 11:33:56 -07:00
2024-03-08 17:11:51 -08:00
2025-04-23 10:35:34 +08:00
2025-02-20 14:00:27 -08:00
2024-03-22 10:40:43 +01:00
2023-05-31 18:46:57 -05:00
2023-07-26 14:18:15 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2025-01-31 09:48:39 -08:00
2025-04-23 10:35:34 +08:00
2024-05-28 12:04:22 -05:00