Commit Graph

39 Commits

Author SHA1 Message Date
Changho Hwang
b6ea0ca266 IB unit test (#47) 2023-04-07 21:45:14 +08:00
Changho Hwang
b7461facff Fix Makefile 2023-04-07 13:09:56 +00:00
Changho Hwang
949a9cd0a3 Optional use of gdrcopy (#48)
Co-authored-by: Saeed Maleki <saemal@microsoft.com>
2023-04-07 13:36:59 +08:00
Saeed Maleki
6c1ebed569 combining ./python and ./ lint formats into makefile 2023-04-06 23:26:56 +00:00
Saeed Maleki
be5e422021 merged with main 2023-03-29 23:03:12 +00:00
Binyang2014
62279b0063 Add mscclppSetBootstrapConnTimeout (#34) 2023-03-28 14:01:56 +08:00
Saeed Maleki
33af4bfb67 no gdr copy anywhere in the code except for the files that are not compiled 2023-03-28 05:36:31 +00:00
Changho Hwang
72431957fd Use clang-format-12 2023-03-27 14:00:03 +00:00
Binyang Li
7ec6ae9d6a add cpplint and CI 2023-03-27 03:32:10 +00:00
Saeed Maleki
3fb9383621 Merge pull request #24 from microsoft/madanm-apipush
simplified API for CUDA level communication calls.
2023-03-24 10:42:07 -07:00
Ziyue Yang
f92b428cba Port NPKit 2023-03-24 06:41:16 +00:00
Changho Hwang
e7459032e0 Add patch version 2023-03-24 05:19:25 +00:00
Saeed Maleki
777e93ee47 merged with main 2023-03-24 02:35:15 +00:00
Madan Musuvathi
e569175832 added documentation 2023-03-23 00:39:45 +00:00
Changho Hwang
9a6ddfd244 Update makefile 2023-03-22 09:19:47 +00:00
Saeed Maleki
0a707d84ec new api works -- single node is not performant 2023-03-22 02:19:49 +00:00
Olli Saarikivi
0cfe2dcffb Add allpairs allreduce test
To support this include separate source and destination offsets in the trigger.
Add functions for getting the rank and world size from a communicator.
2023-03-21 19:00:13 +00:00
Saeed Maleki
93afed3e54 new allgather algorithm with both DMA and IB on a single node 2023-03-19 21:53:36 +00:00
Saeed Maleki
2061ea91f7 Add allgather_test (#14) 2023-03-17 12:55:20 +08:00
Saeed Maleki
2279a690d1 mscclpp_net.h is not required anywhere 2023-03-14 05:38:15 +00:00
Changho Hwang
29a430e7a8 NUMA binding 2023-02-23 08:18:12 +00:00
Changho Hwang
48b81edf6d Move some files 2023-02-22 11:07:22 +00:00
Changho Hwang
4cfb9b6727 GDRCopy support 2023-02-22 09:19:40 +00:00
Changho Hwang
7459a08699 Rename test code 2023-02-22 06:56:25 +00:00
Changho Hwang
368f8f4d24 Merge branch 'saemal/cleanup' into chhwang/p2p-simple 2023-02-22 06:54:51 +00:00
Changho Hwang
33e20aceb9 IB all-to-all works 2023-02-17 11:39:16 +00:00
Saeed Maleki
4f3418aa77 more clean up 2023-02-16 07:13:04 +00:00
Changho Hwang
8e57fd9896 p2p all-to-all works 2023-02-13 11:25:20 +00:00
Saeed Maleki
dfe1c4500a test without mpi 2023-02-07 22:48:37 +00:00
Saeed Maleki
b9a253f82a Revert "Add transport"
This reverts commit 692e9acd8f.
2023-02-07 22:28:14 +00:00
Changho Hwang
692e9acd8f Add transport 2023-02-07 13:20:48 +00:00
Changho Hwang
8f7ebe99e3 Build into a shared library 2023-02-07 07:36:37 +00:00
Changho Hwang
d3f46ebfb9 Add bootstrap_test 2023-02-06 11:00:01 +00:00
Changho Hwang
b4bd7489f0 Move bootstrap components to bootstrap/. 2023-02-06 08:02:13 +00:00
Saeed Maleki
f3e6f7fe8b prints the unique id 2023-02-06 06:11:29 +00:00
Saeed Maleki
0902ce89c6 compiles 2023-02-06 05:32:24 +00:00
Saeed Maleki
722786de59 even more bootstrap files 2023-02-06 04:00:00 +00:00
v-xiaoxshi
200f5637bb more bootstrap files 2023-02-04 05:07:48 +00:00
Changho Hwang
82fe0b667d Add a makefile and logging functions 2023-02-03 12:29:27 +00:00