Changho Hwang
|
dc41c58769
|
Alloc proxy states on demand
|
2023-03-14 10:05:56 +00:00 |
|
Changho Hwang
|
c2859d258c
|
Use aligned ld/st
|
2023-03-14 09:22:28 +00:00 |
|
Changho Hwang
|
135520a14a
|
cleanups
|
2023-03-14 09:21:52 +00:00 |
|
Changho Hwang
|
aacee9727b
|
trigger wrappers
|
2023-03-14 09:14:51 +00:00 |
|
Changho Hwang
|
75ec82d257
|
Store fifo tail in proxy state
|
2023-03-14 09:00:38 +00:00 |
|
Changho Hwang
|
e89d154503
|
Check run state periodically
|
2023-03-14 08:38:55 +00:00 |
|
Changho Hwang
|
ae01fa4958
|
Remove mscclpp_net.h and net.h
|
2023-03-14 08:32:19 +00:00 |
|
Saeed Maleki
|
2279a690d1
|
mscclpp_net.h is not required anywhere
|
2023-03-14 05:38:15 +00:00 |
|
Saeed Maleki
|
e000eb9177
|
some compilation clean up
|
2023-03-14 05:26:54 +00:00 |
|
Saeed Maleki
|
2bcf418b13
|
more comments
|
2023-03-14 03:36:38 +00:00 |
|
Saeed Maleki
|
ab9298d6e0
|
fixed the bits for trigger
|
2023-03-13 23:21:27 +00:00 |
|
Changho Hwang
|
9b124cabdb
|
cleanup
|
2023-03-13 14:27:29 +00:00 |
|
Changho Hwang
|
86dd8e2e17
|
Remove unused macro
|
2023-03-13 14:21:37 +00:00 |
|
Changho Hwang
|
e357beef00
|
One fifo per proxy
|
2023-03-13 14:19:36 +00:00 |
|
Saeed Maleki
|
ea7134549e
|
vector instructions for trigger
|
2023-03-13 07:02:26 +00:00 |
|
Changho Hwang
|
1be76d128d
|
128-bit trigger
|
2023-03-10 10:49:36 +00:00 |
|
Ubuntu
|
8f2831330c
|
a few todos + some clean up in the test
|
2023-03-10 04:34:14 +00:00 |
|
Changho Hwang
|
38a2adfe97
|
Single IB proxy per IB dev
|
2023-03-09 05:11:08 +00:00 |
|
Changho Hwang
|
e3fde881bc
|
Single P2P proxy per GPU
|
2023-03-09 04:43:56 +00:00 |
|
Saeed Maleki
|
8c4ab6dad6
|
merged with main
|
2023-03-09 02:06:04 +00:00 |
|
Saeed Maleki
|
6c3197a492
|
fifo for ib works as well
|
2023-03-09 02:02:14 +00:00 |
|
Saeed Maleki
|
160060ec77
|
fifo works now
|
2023-03-08 20:10:09 +00:00 |
|
Changho Hwang
|
5403508ac4
|
Type bug fix
|
2023-03-07 12:41:56 +00:00 |
|
Changho Hwang
|
d9f97e63ee
|
Merge branch 'main' into chhwang/ib-proxy-merge
|
2023-03-07 13:47:04 +08:00 |
|
Changho Hwang
|
881dfcd6f2
|
Add a TODO
|
2023-03-07 05:44:57 +00:00 |
|
Changho Hwang
|
1a382a8e1d
|
Fix fifo triggers
|
2023-03-07 03:31:26 +00:00 |
|
Changho Hwang
|
1c52143c90
|
Merge branch 'main' into chhwang/proxy-run-states
|
2023-03-07 10:09:47 +08:00 |
|
Saeed Maleki
|
3e4c45d73a
|
compiles
|
2023-03-06 20:36:54 +00:00 |
|
Saeed Maleki (saemal)
|
dced4c4c14
|
done with the design
|
2023-03-06 12:26:46 -08:00 |
|
Saeed Maleki (saemal)
|
b663469bcd
|
making a fifo for proxy threads
|
2023-03-06 10:58:12 -08:00 |
|
Saeed Maleki (saemal)
|
ae7407146b
|
removing unnecessary stat probe for cuda graph
|
2023-03-06 08:03:44 -08:00 |
|
Changho Hwang
|
0440096954
|
Add proxy run states
|
2023-03-06 08:34:24 +00:00 |
|
Changho Hwang
|
73b8d02089
|
Merge two IB proxy modes
|
2023-03-06 08:19:31 +00:00 |
|
Saeed Maleki
|
0216ceb34e
|
added todos
|
2023-03-06 08:04:33 +00:00 |
|
Changho Hwang
|
5ac2ea6e9f
|
IB more fixes
|
2023-03-06 07:01:03 +00:00 |
|
Saeed Maleki
|
7e4bacf20c
|
works
|
2023-03-03 23:10:41 +00:00 |
|
Saeed Maleki (saemal)
|
3def04e72d
|
separating dma data/flag/sync logic
|
2023-03-03 15:05:54 -08:00 |
|
Changho Hwang
|
2c10142c89
|
IB fixes
|
2023-03-03 08:37:23 +00:00 |
|
Changho Hwang
|
9e5573f16b
|
Misc changes and comments
|
2023-03-03 08:32:47 +00:00 |
|
Changho Hwang
|
9674830db2
|
Change flags into uint64_t
|
2023-03-03 07:41:34 +00:00 |
|
Changho Hwang
|
89b89fbae5
|
Erase duplicate TODO
|
2023-03-03 07:31:08 +00:00 |
|
Saeed Maleki
|
ac1bf6dc52
|
cudagraph now works with p2p proxy
|
2023-02-28 23:25:51 +00:00 |
|
Changho Hwang
|
3d051a985f
|
Add p2p proxy: doesn't work with cuda graph yet
|
2023-02-28 18:58:03 +00:00 |
|
madanm
|
e67024a9d7
|
initial interface proposal for zero-copy push communication
|
2023-02-27 14:16:13 -08:00 |
|
Changho Hwang
|
a78b78aa43
|
Erase unnecessary memsets
|
2023-02-23 09:42:55 +00:00 |
|
Changho Hwang
|
29a430e7a8
|
NUMA binding
|
2023-02-23 08:18:12 +00:00 |
|
Saeed Maleki
|
1a528a3aa3
|
merged with main
|
2023-02-22 23:16:26 +00:00 |
|
Saeed Maleki
|
bca3362c12
|
a few clean ups
|
2023-02-22 22:43:18 +00:00 |
|
Changho Hwang
|
48b81edf6d
|
Move some files
|
2023-02-22 11:07:22 +00:00 |
|
Changho Hwang
|
89ca0451a8
|
Support incremental flag & add perf test
|
2023-02-22 10:55:35 +00:00 |
|