Commit Graph

72 Commits

Author SHA1 Message Date
Saeed Maleki
e7d5e652df Python bindings (#125)
Co-authored-by: Olli Saarikivi <olsaarik@microsoft.com>
Co-authored-by: Changho Hwang <changhohwang@microsoft.com>
Co-authored-by: Binyang Li <binyli@microsoft.com>
2023-07-19 15:35:54 +08:00
Saeed Maleki
df2f0c14ab bootstrap now takes interface (#113)
This PR fixes the issue regarding taking the interface as an input.
2023-06-29 00:16:06 +08:00
Changho Hwang
21eed722af Add license comments (#106) 2023-06-25 12:40:12 +08:00
Changho Hwang
c4a5958dfc Fix hanging bootstrap issues (#100)
* Renew socket interfaces and error handling into C++ style
* Fix bootstrap hanging bugs
* Misc code cleanup

---------

Co-authored-by: Binyang Li <binyli@microsoft.com>
Co-authored-by: Saeed Maleki <saemal@microsoft.com>
2023-06-15 11:29:49 +08:00
Olli Saarikivi
5d5e9a1805 Make bootstrap use persistent sockets (#98) 2023-06-12 15:13:30 +08:00
Changho Hwang
5a4885ccbb Misc updates (#95) 2023-06-12 13:53:43 +08:00
Changho Hwang
9cee6c4a74 Cleanup old files and functions (#86) 2023-06-01 17:34:57 +08:00
Olli Saarikivi
457c422791 Remove alloc.h and beef up cuda_utils.hpp (#82) 2023-05-24 08:34:18 +00:00
Olli Saarikivi
4e4d1972e3 Cuda smart pointers 2023-05-16 16:16:00 -07:00
Olli Saarikivi
9f6c48cbf9 Format all files 2023-05-11 00:23:14 +00:00
Olli Saarikivi
ccf45b33a2 Delete old init code and other C-style code 2023-05-10 22:03:42 +00:00
Olli Saarikivi
beaf2aea39 Move public headers under include/ 2023-05-10 20:46:49 +00:00
Saeed Maleki
1769138568 Host Epoch + Error code 2023-05-09 23:10:12 +00:00
Binyang2014
8650dbaff8 Add exception class for mscclpp (#67)
Add exception class for mscclpp
2023-05-06 16:27:25 +08:00
Saeed Maleki
82c27625e6 ipc uses a base ptr now 2023-04-27 21:33:15 +00:00
Saeed Maleki
8fc822c848 more tests for bootstrap 2023-04-25 22:26:48 +00:00
Saeed Maleki
b73b0132ba using find instead of searching 2023-04-25 21:27:23 +00:00
Saeed Maleki
8f2f053f2f more clean up 2023-04-25 21:08:49 +00:00
Changho Hwang
71b075e0d7 Rename 2023-04-25 12:29:32 +00:00
Changho Hwang
4115559c2f cleanup 2023-04-25 12:25:08 +00:00
Changho Hwang
bb195b2f29 PascalCase for type names 2023-04-25 11:57:02 +00:00
Changho Hwang
31f7897d5d integrate with new interfaces in mscclpp.hpp 2023-04-25 11:47:58 +00:00
Saeed Maleki
8428b49858 a few minor changes 2023-04-25 01:51:47 +00:00
Saeed Maleki
3546e80aa0 unique ptr for pimpl_ in bootstrap 2023-04-25 00:47:48 +00:00
Saeed Maleki
3fd95265fd Revert "lint"
This reverts commit 2c52ab37ce.
2023-04-24 23:22:56 +00:00
Saeed Maleki
2c52ab37ce lint 2023-04-24 23:09:12 +00:00
Saeed Maleki
d6e91338d4 bootstrap tests pass 2023-04-24 23:07:38 +00:00
Saeed Maleki
27114d91fb bootstrap tests pass 2023-04-24 21:50:03 +00:00
Saeed Maleki
f0f058410a working bootstrap initialization 2023-04-24 19:25:06 +00:00
Saeed Maleki
6f4dc57331 fixed 2023-04-24 07:45:01 +00:00
Saeed Maleki
a9cfb82fcb wip 2023-04-24 05:58:11 +00:00
Binyang Li
073460c341 fx compile issue 2023-04-23 14:25:56 +00:00
Binyang Li
7e1a77a132 make build pass 2023-04-21 09:41:52 +00:00
Binyang Li
7ac861b1e9 Refactor bootstrap 2023-04-21 08:41:33 +00:00
Binyang2014
804692f282 Binyli/bootstrap (#60)
Bootstrap refactor.
2023-04-21 13:59:42 +08:00
Saeed Maleki
9c8942f7ac wip 2023-04-19 22:09:53 +00:00
Saeed Maleki
ec9737db82 progress 2023-04-19 00:34:47 +00:00
Changho Hwang
d2c2ae72a7 Some cleanup 2023-04-11 08:45:22 +00:00
Saeed Maleki
f3f53a4148 lint 2023-04-08 06:32:57 +00:00
Saeed Maleki
ec83a27e83 wip 2023-04-08 01:57:22 +00:00
Saeed Maleki
e2cfd5ac83 a lot of documentation 2023-03-30 00:37:33 +00:00
Saeed Maleki
be5e422021 merged with main 2023-03-29 23:03:12 +00:00
Binyang2014
62279b0063 Add mscclppSetBootstrapConnTimeout (#34) 2023-03-28 14:01:56 +08:00
Saeed Maleki
fa26bdd9fc no gdr copy anywhere in the code except for the files that are not compiled 2023-03-28 05:40:40 +00:00
Saeed Maleki
19bf369dc1 link format correction 2023-03-27 20:40:15 +00:00
Saeed Maleki
35b8ebaf64 retry for almost 20 seconds 2023-03-24 19:42:00 +00:00
Changho Hwang
7a4c27778f 30 sec timeout for socket accept 2023-03-24 08:29:00 +00:00
Changho Hwang
9f2eef35d3 Init from a given mscclppUniqueId 2023-03-22 11:25:49 +00:00
Changho Hwang
ae01fa4958 Remove mscclpp_net.h and net.h 2023-03-14 08:32:19 +00:00
Saeed Maleki
e000eb9177 some compilation clean up 2023-03-14 05:26:54 +00:00