Files
mscclpp/src
Binyang Li 28a57b0610 NVLS support for msccl++ executor (#375)
- Support mote datatype for multicast operation
- Add new OP MULTI_LOAD_REDUCE_STORE to support NVLS
- Modify allocSharedPhysicalCuda, which return std::shared_ptr<T>
instead of std::shared_ptr<PhysicalCudaMemory>
- Add Python support for allocSharedPhysicalCuda

Test passed for `allreduce_nvls.json`
2024-11-20 06:43:28 +00:00
..
2024-10-17 21:25:46 -07:00
2024-04-25 11:06:43 -07:00
2023-11-24 16:41:56 +08:00
2024-10-17 21:25:46 -07:00
2024-10-17 21:25:46 -07:00
2024-03-27 10:24:24 +08:00
2023-11-24 16:41:56 +08:00
2023-11-24 16:41:56 +08:00
2023-09-01 21:22:11 +08:00
2024-05-25 23:12:57 -07:00