Commit Graph

293 Commits

Author SHA1 Message Date
Saeed Maleki
f8dcb59f76 Merge pull request #52 from microsoft/ziyyang/npkit-fix-numa
NPKit: remove timestamp update thread
2023-04-08 00:18:19 -07:00
Ziyue Yang
5f0b58abda fix lint 2023-04-08 07:16:32 +00:00
Ziyue Yang
09de60854e fix lint 2023-04-08 07:15:25 +00:00
Ziyue Yang
3796f5251e Merge branch 'ziyyang/npkit-fix-numa' of https://github.com/microsoft/mscclpp into ziyyang/npkit-fix-numa 2023-04-08 07:13:08 +00:00
Ziyue Yang
748d3d1596 separate flag and data 2023-04-08 07:12:46 +00:00
Saeed Maleki
3d2a3a3b3a Merge branch 'main' into ziyyang/npkit-fix-numa 2023-04-08 06:37:58 +00:00
Saeed Maleki
fc3b317f1d Merge pull request #42 from microsoft/binyli/mscclpp-test
Init version of mscclpp-test
2023-04-07 23:34:59 -07:00
Saeed Maleki
f3f53a4148 lint 2023-04-08 06:32:57 +00:00
Saeed Maleki
e336c93dc9 Merge branch 'main' into binyli/mscclpp-test 2023-04-08 06:30:48 +00:00
Saeed Maleki
5ca7212065 removed ip_port 2023-04-08 06:29:54 +00:00
Saeed Maleki
7e091d3806 all good now 2023-04-08 06:22:12 +00:00
Ziyue Yang
f68eeba2d4 change clock collection approach 2023-04-08 05:29:34 +00:00
Saeed Maleki
ec83a27e83 wip 2023-04-08 01:57:22 +00:00
Changho Hwang
b6ea0ca266 IB unit test (#47) 2023-04-07 21:45:14 +08:00
Changho Hwang
b7461facff Fix Makefile 2023-04-07 13:09:56 +00:00
Binyang Li
e303a532ff address comments 2023-04-07 05:42:21 +00:00
Changho Hwang
949a9cd0a3 Optional use of gdrcopy (#48)
Co-authored-by: Saeed Maleki <saemal@microsoft.com>
2023-04-07 13:36:59 +08:00
Binyang Li
bf472ff864 Fix bug & remove pthread related code 2023-04-07 03:37:47 +00:00
Saeed Maleki
1b2db68e93 filename changes 2023-04-06 23:55:11 +00:00
Saeed Maleki
6c1ebed569 combining ./python and ./ lint formats into makefile 2023-04-06 23:26:56 +00:00
Saeed Maleki
8c6e361044 Merge branch 'main' into binyli/mscclpp-test 2023-04-06 22:00:01 +00:00
Saeed Maleki
e82a75c132 typo fix 2023-04-06 21:58:48 +00:00
Ziyue Yang
352a10a33d NPKit: improve event collection for async requests (#45) 2023-04-06 16:21:34 +08:00
Saeed Maleki
cd3cd2c157 lint 2023-04-06 03:20:21 +00:00
Saeed Maleki
08275e93d7 added barrier API + pushed one after mscclppsetup 2023-04-06 03:15:54 +00:00
Saeed Maleki
ef851d2557 Merge pull request #44 from microsoft/crutcher-fixformat
Fixup formating for python
2023-04-05 16:28:37 -07:00
Crutcher Dunnavant
f7e330da21 Fixup formating for python 2023-04-05 21:47:08 +00:00
Saeed Maleki
fa7d2ad877 Merge pull request #43 from microsoft/crutcher-bootstrap
[python] Pull in bootstrap all gather, log callbacks.
2023-04-04 18:43:22 -07:00
Saeed Maleki
5fe87e5da6 lint fixes 2023-04-05 01:41:21 +00:00
Crutcher Dunnavant
151b29f70c docs and format 2023-04-04 18:55:08 +00:00
Crutcher Dunnavant
659a88a767 remove env hook 2023-04-04 18:31:08 +00:00
Crutcher Dunnavant
aaf93c858d extract level 2023-04-04 18:29:45 +00:00
Crutcher Dunnavant
0df50830e1 log callbacks 2023-04-04 17:57:46 +00:00
Binyang Li
674d30a813 minor fix 2023-04-04 08:37:16 +00:00
Binyang Li
5648ac5d7d fix lint 2023-04-04 07:04:12 +00:00
Binyang Li
69a49c189f Add correctness check 2023-04-04 07:01:21 +00:00
Crutcher Dunnavant
423affeaa6 all gather bytes, json, pickle 2023-04-03 23:39:06 +00:00
Crutcher Dunnavant
17e1885981 allocation fixes 2023-04-03 23:39:06 +00:00
Crutcher Dunnavant
8cac41c8ac [python] working on bootstrap all gather bug 2023-04-03 23:39:06 +00:00
Binyang Li
7d6dad226b fix 2023-04-03 09:02:58 +00:00
Binyang Li
4de3dba818 fix lint 2023-04-03 06:14:41 +00:00
Saeed Maleki
2c6460ce72 bug fix for allgather0 2023-04-03 04:36:20 +00:00
Binyang Li
d3c7cc721a fix lint 2023-04-03 04:04:16 +00:00
Binyang Li
617f39daf1 change makefile 2023-04-03 03:45:36 +00:00
Binyang Li
212f92cfba fix 2023-04-02 11:13:43 +00:00
Binyang Li
36c418a239 Merge branch 'main' into binyli/mscclpp-test 2023-04-02 08:13:48 +00:00
Saeed Maleki
bfbdaf6b05 Merge pull request #41 from microsoft/saemal/allgather_hier
Saemal/allgather hier
2023-04-01 20:37:43 -07:00
Saeed Maleki
5ff64d36f4 documents for allgather2 + refactoring local allgather 2023-04-02 03:36:22 +00:00
Saeed Maleki
0887cfe768 no need for remapping anymore 2023-04-02 02:35:08 +00:00
Saeed Maleki
5cf3f3c524 Merge pull request #39 from microsoft/binyli/numabindAPI
bind proxy threads and host allocation are now done automatically and closest to the device
2023-04-01 19:16:43 -07:00