Changho Hwang
d63f9403c0
IB host-no-atomic: GDRCopy + mlx5dv Data Direct for memory-consistent low-latency signaling ( #753 )
...
Major enhancements to the IB signal forwarding mechanisms
(`host-no-atomic` mode), primarily adding support for GDRCopy and MLX5
Direct Verbs, and refactoring the signal forwarding path for IB
HostNoAtomic mode. The changes fix memory consistency issues and reduce
signaling latency.
- GDRCopy and MLX5 Direct Verbs MR integration
- Signal forwarding path redesign
- Semaphore and connection API updates
- Environment (`MSCCLPP_FORCE_DISABLE_GDR`) and documentation updates
2026-04-09 09:24:30 +00:00
Changho Hwang
fc0aaaf1b4
Auto-detect CUDA arch in CMake GPU check ( #666 )
...
Compute capability 60 support is dropped from CUDA 13
Co-authored-by: Binyang Li <binyli@microsoft.com >
2025-10-27 11:25:24 -07:00
Changho Hwang
5b84c8a3d1
Separate linters from cmake ( #587 )
2025-07-28 09:59:20 +08:00
Binyang Li
65c10fa8ec
Support any GPUs per node for NCCL_API ( #566 )
...
Support any GPUs per node for NCCL API
2025-07-11 13:42:39 -07:00
Changho Hwang
1989d4be9c
Fix CMake build messages ( #443 )
2025-01-08 02:44:01 +00:00
Changho Hwang
e2230aab26
Tackle build warnings ( #422 )
...
* Comply with
[CMP0165](https://cmake.org/cmake/help/latest/policy/CMP0165.html )
* Tackle other warnings during build
2024-12-19 16:51:50 -08:00
caiomcbr
7493e2f075
Double buffering for NCCL APIs ( #324 )
...
Using two scratch buffers in each peer to exchange data.
---------
Co-authored-by: Changho Hwang <changhohwang@microsoft.com >
2024-07-15 22:18:53 +00:00
Changho Hwang
544ff0c21d
ROCm support ( #213 )
...
Co-authored-by: Binyang Li <binyli@microsoft.com >
2023-11-24 16:41:56 +08:00
Changho Hwang
15f6dcca49
Update documentation ( #217 )
...
Co-authored-by: Saeed Maleki <saemal@microsoft.com >
2023-11-22 12:58:04 -08:00
Saeed Maleki
8d1b984bed
Change device handle interfaces & others ( #142 )
...
* Changed device handle interfaces
* Changed proxy service interfaces
* Move device code into separate files
* Fixed FIFO polling issues
* Add configuration arguments in several interface functions
---------
Co-authored-by: Changho Hwang <changhohwang@microsoft.com >
Co-authored-by: Binyang Li <binyli@microsoft.com >
Co-authored-by: root <root@a100-saemal0.qxveptpukjsuthqvv514inp03c.gx.internal.cloudapp.net >
2023-08-16 20:00:56 +08:00
Saeed Maleki
e7d5e652df
Python bindings ( #125 )
...
Co-authored-by: Olli Saarikivi <olsaarik@microsoft.com >
Co-authored-by: Changho Hwang <changhohwang@microsoft.com >
Co-authored-by: Binyang Li <binyli@microsoft.com >
2023-07-19 15:35:54 +08:00
Changho Hwang
4114d65c60
Documents & minor updates ( #119 )
...
Co-authored-by: Saeed Maleki <saemal@microsoft.com >
Co-authored-by: Binyang Li <binyli@microsoft.com >
2023-07-07 17:35:05 +08:00
Changho Hwang
21eed722af
Add license comments ( #106 )
2023-06-25 12:40:12 +08:00
Olli Saarikivi
4f528d29a0
Make clang-format style file explicit
2023-05-05 19:15:38 +00:00
Olli Saarikivi
86be901d98
CMake improvements
2023-05-05 19:11:33 +00:00
Olli Saarikivi
503cdd5c7e
CMake build system transition WIP
2023-05-03 23:52:13 +00:00