Files
mscclpp/python/mscclpp
Binyang Li 28a57b0610 NVLS support for msccl++ executor (#375)
- Support mote datatype for multicast operation
- Add new OP MULTI_LOAD_REDUCE_STORE to support NVLS
- Modify allocSharedPhysicalCuda, which return std::shared_ptr<T>
instead of std::shared_ptr<PhysicalCudaMemory>
- Add Python support for allocSharedPhysicalCuda

Test passed for `allreduce_nvls.json`
2024-11-20 06:43:28 +00:00
..
2023-09-01 21:22:11 +08:00
2023-09-01 21:22:11 +08:00
2024-02-04 20:46:10 -08:00