Changho Hwang
|
b6ea0ca266
|
IB unit test (#47)
|
2023-04-07 21:45:14 +08:00 |
|
Changho Hwang
|
b7461facff
|
Fix Makefile
|
2023-04-07 13:09:56 +00:00 |
|
Changho Hwang
|
949a9cd0a3
|
Optional use of gdrcopy (#48)
Co-authored-by: Saeed Maleki <saemal@microsoft.com>
|
2023-04-07 13:36:59 +08:00 |
|
Saeed Maleki
|
6c1ebed569
|
combining ./python and ./ lint formats into makefile
|
2023-04-06 23:26:56 +00:00 |
|
Saeed Maleki
|
be5e422021
|
merged with main
|
2023-03-29 23:03:12 +00:00 |
|
Binyang2014
|
62279b0063
|
Add mscclppSetBootstrapConnTimeout (#34)
|
2023-03-28 14:01:56 +08:00 |
|
Saeed Maleki
|
33af4bfb67
|
no gdr copy anywhere in the code except for the files that are not compiled
|
2023-03-28 05:36:31 +00:00 |
|
Changho Hwang
|
72431957fd
|
Use clang-format-12
|
2023-03-27 14:00:03 +00:00 |
|
Binyang Li
|
7ec6ae9d6a
|
add cpplint and CI
|
2023-03-27 03:32:10 +00:00 |
|
Saeed Maleki
|
3fb9383621
|
Merge pull request #24 from microsoft/madanm-apipush
simplified API for CUDA level communication calls.
|
2023-03-24 10:42:07 -07:00 |
|
Ziyue Yang
|
f92b428cba
|
Port NPKit
|
2023-03-24 06:41:16 +00:00 |
|
Changho Hwang
|
e7459032e0
|
Add patch version
|
2023-03-24 05:19:25 +00:00 |
|
Saeed Maleki
|
777e93ee47
|
merged with main
|
2023-03-24 02:35:15 +00:00 |
|
Madan Musuvathi
|
e569175832
|
added documentation
|
2023-03-23 00:39:45 +00:00 |
|
Changho Hwang
|
9a6ddfd244
|
Update makefile
|
2023-03-22 09:19:47 +00:00 |
|
Saeed Maleki
|
0a707d84ec
|
new api works -- single node is not performant
|
2023-03-22 02:19:49 +00:00 |
|
Olli Saarikivi
|
0cfe2dcffb
|
Add allpairs allreduce test
To support this include separate source and destination offsets in the trigger.
Add functions for getting the rank and world size from a communicator.
|
2023-03-21 19:00:13 +00:00 |
|
Saeed Maleki
|
93afed3e54
|
new allgather algorithm with both DMA and IB on a single node
|
2023-03-19 21:53:36 +00:00 |
|
Saeed Maleki
|
2061ea91f7
|
Add allgather_test (#14)
|
2023-03-17 12:55:20 +08:00 |
|
Saeed Maleki
|
2279a690d1
|
mscclpp_net.h is not required anywhere
|
2023-03-14 05:38:15 +00:00 |
|
Changho Hwang
|
29a430e7a8
|
NUMA binding
|
2023-02-23 08:18:12 +00:00 |
|
Changho Hwang
|
48b81edf6d
|
Move some files
|
2023-02-22 11:07:22 +00:00 |
|
Changho Hwang
|
4cfb9b6727
|
GDRCopy support
|
2023-02-22 09:19:40 +00:00 |
|
Changho Hwang
|
7459a08699
|
Rename test code
|
2023-02-22 06:56:25 +00:00 |
|
Changho Hwang
|
368f8f4d24
|
Merge branch 'saemal/cleanup' into chhwang/p2p-simple
|
2023-02-22 06:54:51 +00:00 |
|
Changho Hwang
|
33e20aceb9
|
IB all-to-all works
|
2023-02-17 11:39:16 +00:00 |
|
Saeed Maleki
|
4f3418aa77
|
more clean up
|
2023-02-16 07:13:04 +00:00 |
|
Changho Hwang
|
8e57fd9896
|
p2p all-to-all works
|
2023-02-13 11:25:20 +00:00 |
|
Saeed Maleki
|
dfe1c4500a
|
test without mpi
|
2023-02-07 22:48:37 +00:00 |
|
Saeed Maleki
|
b9a253f82a
|
Revert "Add transport"
This reverts commit 692e9acd8f.
|
2023-02-07 22:28:14 +00:00 |
|
Changho Hwang
|
692e9acd8f
|
Add transport
|
2023-02-07 13:20:48 +00:00 |
|
Changho Hwang
|
8f7ebe99e3
|
Build into a shared library
|
2023-02-07 07:36:37 +00:00 |
|
Changho Hwang
|
d3f46ebfb9
|
Add bootstrap_test
|
2023-02-06 11:00:01 +00:00 |
|
Changho Hwang
|
b4bd7489f0
|
Move bootstrap components to bootstrap/.
|
2023-02-06 08:02:13 +00:00 |
|
Saeed Maleki
|
f3e6f7fe8b
|
prints the unique id
|
2023-02-06 06:11:29 +00:00 |
|
Saeed Maleki
|
0902ce89c6
|
compiles
|
2023-02-06 05:32:24 +00:00 |
|
Saeed Maleki
|
722786de59
|
even more bootstrap files
|
2023-02-06 04:00:00 +00:00 |
|
v-xiaoxshi
|
200f5637bb
|
more bootstrap files
|
2023-02-04 05:07:48 +00:00 |
|
Changho Hwang
|
82fe0b667d
|
Add a makefile and logging functions
|
2023-02-03 12:29:27 +00:00 |
|