Commit Graph

9 Commits

Author SHA1 Message Date
Changho Hwang
67f9933ba1 fix data direct 2026-04-01 10:20:43 +00:00
Changho Hwang
7ce841bed0 Updates 2026-03-05 23:28:39 +00:00
Changho Hwang
448ceb66f6 updates 2026-03-05 22:59:33 +00:00
Changho Hwang
3b56b08bcb data direct 2026-03-04 23:36:39 +00:00
Changho Hwang
ac022c333c a few updates 2026-02-24 20:25:25 -08:00
Changho Hwang
98b023adc6 rocm fixes 2026-02-23 18:13:57 -08:00
Changho Hwang
febdbf9230 WIP; need amd fix 2026-02-21 00:02:03 -08:00
Changho Hwang
42be3660e0 Add a new IB stack impl that doesn't use RDMA atomics (#728)
* Added configurable InfiniBand (IB) signaling mode.
`EndpointConfig::Ib::Mode` enum selects the mode (`Default`, `Host`,
`HostNoAtomic`). `Default` is equivalent to `Host` unless specified
different by envrionment `MSCCLPP_IBV_MODE`. `Host` corresponds to the
previous implementation using RDMA atomics for signaling, while
`HostNoAtomic` uses write-with-immediate instead.
* Regarding updates in Python bindings and API.
2026-02-10 01:07:53 +00:00
Binyang Li
a707273701 Torch integration (#692)
Reorganize current native algorithm implementation and DSL algorithm
implementation.
Provide unified API for DSL algo and native algo and provide interface
to tune the algo
Provide interface for pytorch integration with native API and DSL

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com>
Co-authored-by: chhwang <8018170+chhwang@users.noreply.github.com>
2026-01-21 20:32:24 -08:00