Enrico Degregori
440358c168
Wave Tile Transfer supporting global load with transpose ( #3027 )
...
* Initial implementation:
- add new thread group transfer supporting transpose instruction
- refactor AB transfer to switch between thread and wave tiles methods
* Add some comments and remove explicit wave and lane calculations
* Remove compiler option for performance
* fp16 example: use tuned instance
* Missing cleanup
* Integrate wave transfer in existing gemm and batched gemm instances
* Add fast instances
* extend implementation for 8 bit datatypes
packed types not supported
* Address review comments
* Optimize pipeline v1 and re-introduce compiler option
* Disable wave tile approach for b scale gemm
* Fix for clang20
* Avoid code duplication of amd_global_load_transpose_to_vgpr function
2025-10-16 11:33:56 -07:00
..
2025-06-17 11:54:30 -07:00
2025-09-09 11:22:36 +08:00
2025-07-28 11:34:07 -07:00
2025-09-09 11:22:36 +08:00
2024-08-06 09:10:39 -07:00
2023-05-31 18:46:57 -05:00
2024-08-06 09:10:39 -07:00
2025-07-28 11:34:07 -07:00
2023-05-31 18:46:57 -05:00
2025-10-16 11:33:56 -07:00
2025-10-16 11:33:56 -07:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2025-07-28 11:34:07 -07:00
2023-12-20 14:34:53 -08:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2024-08-06 10:06:10 +02:00
2025-07-28 11:34:07 -07:00
2025-07-28 13:01:07 -07:00
2025-09-25 09:27:18 +08:00
2025-09-09 11:22:36 +08:00
2025-09-16 17:47:28 +02:00
2025-09-17 14:50:15 -07:00
2025-09-29 07:56:33 -07:00
2025-09-09 11:22:36 +08:00
2025-02-20 18:58:14 -08:00
2024-03-08 17:11:51 -08:00
2023-09-26 18:40:00 -05:00
2023-07-26 14:18:15 -05:00
2023-12-03 23:08:47 +01:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2023-05-31 18:46:57 -05:00
2025-10-16 11:33:56 -07:00
2025-10-16 11:33:56 -07:00
2025-10-16 11:33:56 -07:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-10-10 15:28:17 +08:00
2025-09-24 11:28:20 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-25 09:27:18 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 11:34:07 -07:00
2023-08-10 12:04:35 +08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2025-03-05 14:33:28 -08:00
2023-08-23 11:36:17 -07:00
2025-07-28 13:01:07 -07:00