Enrico Degregori
4ebc48a3cd
WMMA gemm_add_relu_add_layernorm ( #2989 )
...
* Summary:
- Refactor epilogue (with CShuffle) to support fused operations:
- EpilogueCShuffleBase holds common parts
- EpilogueCShuffle: runs CShuffle and write out
- EpilogueWelfordCShuffle: holds Welford specific arguments, runs CShuffle, write out, Welford first part and Welford write out
- Extend thread transfer v7r3:
- Support for intermediate data type different from src and dst type
- New functionality to write to dst buffer and keep data (to be able to use them for additional operations)
* Adress review comments
2025-10-31 11:19:26 -07:00
..
2025-06-17 11:54:30 -07:00
2025-09-09 11:22:36 +08:00
2025-07-28 11:34:07 -07:00
2025-09-09 11:22:36 +08:00
2025-10-31 11:19:26 -07:00
2025-10-31 11:19:26 -07:00
2025-10-31 11:19:26 -07:00
2024-08-06 09:10:39 -07:00
2023-05-31 18:46:57 -05:00
2024-08-06 09:10:39 -07:00
2025-07-28 11:34:07 -07:00
2023-05-31 18:46:57 -05:00
2025-10-16 11:33:56 -07:00
2025-10-16 11:33:56 -07:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2024-04-19 13:31:17 +02:00
2025-07-28 11:34:07 -07:00
2023-12-20 14:34:53 -08:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2024-08-06 10:06:10 +02:00
2025-07-28 11:34:07 -07:00
2025-07-28 13:01:07 -07:00
2025-09-25 09:27:18 +08:00
2025-09-09 11:22:36 +08:00
2025-09-16 17:47:28 +02:00
2025-09-17 14:50:15 -07:00
2025-09-29 07:56:33 -07:00
2025-09-09 11:22:36 +08:00
2025-02-20 18:58:14 -08:00
2024-03-08 17:11:51 -08:00
2023-09-26 18:40:00 -05:00
2023-07-26 14:18:15 -05:00
2023-12-03 23:08:47 +01:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2023-05-31 18:46:57 -05:00
2025-10-31 11:19:26 -07:00
2025-10-31 11:19:26 -07:00
2025-10-31 11:19:26 -07:00
2025-09-16 17:47:28 +02:00
2025-10-21 15:41:02 +02:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-10-29 09:54:42 +01:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-10-21 15:41:02 +02:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-10-10 15:28:17 +08:00
2025-09-24 11:28:20 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-25 09:27:18 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 11:34:07 -07:00
2023-08-10 12:04:35 +08:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2023-05-31 18:46:57 -05:00
2025-03-05 14:33:28 -08:00
2023-08-23 11:36:17 -07:00
2025-07-28 13:01:07 -07:00