Commit Graph

114 Commits

Author SHA1 Message Date
Saeed Maleki
f3f53a4148 lint 2023-04-08 06:32:57 +00:00
Saeed Maleki
e336c93dc9 Merge branch 'main' into binyli/mscclpp-test 2023-04-08 06:30:48 +00:00
Saeed Maleki
5ca7212065 removed ip_port 2023-04-08 06:29:54 +00:00
Saeed Maleki
ec83a27e83 wip 2023-04-08 01:57:22 +00:00
Changho Hwang
b6ea0ca266 IB unit test (#47) 2023-04-07 21:45:14 +08:00
Binyang Li
e303a532ff address comments 2023-04-07 05:42:21 +00:00
Binyang Li
bf472ff864 Fix bug & remove pthread related code 2023-04-07 03:37:47 +00:00
Saeed Maleki
1b2db68e93 filename changes 2023-04-06 23:55:11 +00:00
Saeed Maleki
8c6e361044 Merge branch 'main' into binyli/mscclpp-test 2023-04-06 22:00:01 +00:00
Binyang Li
674d30a813 minor fix 2023-04-04 08:37:16 +00:00
Binyang Li
5648ac5d7d fix lint 2023-04-04 07:04:12 +00:00
Binyang Li
69a49c189f Add correctness check 2023-04-04 07:01:21 +00:00
Binyang Li
7d6dad226b fix 2023-04-03 09:02:58 +00:00
Binyang Li
4de3dba818 fix lint 2023-04-03 06:14:41 +00:00
Saeed Maleki
2c6460ce72 bug fix for allgather0 2023-04-03 04:36:20 +00:00
Binyang Li
d3c7cc721a fix lint 2023-04-03 04:04:16 +00:00
Binyang Li
617f39daf1 change makefile 2023-04-03 03:45:36 +00:00
Binyang Li
212f92cfba fix 2023-04-02 11:13:43 +00:00
Binyang Li
36c418a239 Merge branch 'main' into binyli/mscclpp-test 2023-04-02 08:13:48 +00:00
Saeed Maleki
5ff64d36f4 documents for allgather2 + refactoring local allgather 2023-04-02 03:36:22 +00:00
Saeed Maleki
0887cfe768 no need for remapping anymore 2023-04-02 02:35:08 +00:00
Binyang Li
f02ba52b0a fix 2023-04-01 07:10:20 +00:00
Binyang Li
9c1be4c03a fix 2023-04-01 05:10:14 +00:00
Saeed Maleki
701255959e lint 2023-03-31 23:34:43 +00:00
Saeed Maleki
97dadd8d64 merged with main 2023-03-31 23:32:01 +00:00
Binyang Li
a3adc99b5c more code 2023-03-31 15:10:51 +00:00
Binyang Li
98020f5b52 update 2023-03-31 07:10:43 +00:00
Saeed Maleki
44e8760af7 allgather kernel2 2023-03-31 06:31:25 +00:00
Binyang Li
c7b3d0b0fd update 2023-03-31 03:58:09 +00:00
Saeed Maleki
fef0bff945 a third kernel for allgather cross-node 2023-03-30 23:24:04 +00:00
Saeed Maleki
29254439e5 Merge pull request #38 from microsoft/saemal/removing_gdrcopy
removing gdrcopy and adding flush functionality
2023-03-30 13:17:47 -07:00
Binyang Li
20b644b88d port more code from nccl-test 2023-03-30 09:13:28 +00:00
Binyang Li
22a977e730 init 2023-03-30 06:29:38 +00:00
Saeed Maleki
debd110874 fused flush instructions 2023-03-29 22:17:02 +00:00
Saeed Maleki
d97bee6973 flush mechanism 2023-03-29 17:31:20 +00:00
Bin Wang
7880be8ee2 Fix the 2 GiB limit in allgather test. (#36) 2023-03-29 19:02:43 +08:00
Saeed Maleki
43c52367fb merged with main and simplified the callback requirements 2023-03-27 23:41:27 +00:00
Saeed Maleki
19bf369dc1 link format correction 2023-03-27 20:40:15 +00:00
Saeed Maleki
0898214f0a added mscclppGetErrorString 2023-03-24 22:57:14 +00:00
Saeed Maleki
56b599b5e7 a bit of api change and clean up on docs 2023-03-24 17:41:04 +00:00
Changho Hwang
274e921009 Minor fixes 2023-03-24 07:28:30 +00:00
Saeed Maleki
c042112b6b perf debug for allgather 2023-03-24 06:49:38 +00:00
Saeed Maleki
777e93ee47 merged with main 2023-03-24 02:35:15 +00:00
Saeed Maleki
56d86472e6 done with allgather_test commenting 2023-03-24 00:22:47 +00:00
Saeed Maleki
58595b1410 Merge pull request #23 from microsoft/saemal/apipush
cleaner allgather_test
2023-03-23 16:17:55 -07:00
Saeed Maleki
86f79ef442 cleaner allgather_test 2023-03-23 23:13:00 +00:00
Madan Musuvathi
e6ee81e4fa fixed the order of remote rank and tag in mscclppConnect API 2023-03-23 21:09:04 +00:00
Madan Musuvathi
72edabe2a6 added GetDevConn api to retrieve a connection from remoteRank and tag 2023-03-23 21:03:30 +00:00
Madan Musuvathi
896539b236 Comm owns all state including devcons 2023-03-22 22:43:32 +00:00
Saeed Maleki
e1cd88ca0b bootstrap test with and without uniq_id 2023-03-22 21:11:27 +00:00