Commit Graph

80 Commits

Author SHA1 Message Date
Changho Hwang
dc41c58769 Alloc proxy states on demand 2023-03-14 10:05:56 +00:00
Changho Hwang
c2859d258c Use aligned ld/st 2023-03-14 09:22:28 +00:00
Changho Hwang
135520a14a cleanups 2023-03-14 09:21:52 +00:00
Changho Hwang
aacee9727b trigger wrappers 2023-03-14 09:14:51 +00:00
Changho Hwang
75ec82d257 Store fifo tail in proxy state 2023-03-14 09:00:38 +00:00
Changho Hwang
e89d154503 Check run state periodically 2023-03-14 08:38:55 +00:00
Changho Hwang
ae01fa4958 Remove mscclpp_net.h and net.h 2023-03-14 08:32:19 +00:00
Saeed Maleki
2279a690d1 mscclpp_net.h is not required anywhere 2023-03-14 05:38:15 +00:00
Saeed Maleki
e000eb9177 some compilation clean up 2023-03-14 05:26:54 +00:00
Saeed Maleki
2bcf418b13 more comments 2023-03-14 03:36:38 +00:00
Saeed Maleki
ab9298d6e0 fixed the bits for trigger 2023-03-13 23:21:27 +00:00
Changho Hwang
9b124cabdb cleanup 2023-03-13 14:27:29 +00:00
Changho Hwang
86dd8e2e17 Remove unused macro 2023-03-13 14:21:37 +00:00
Changho Hwang
e357beef00 One fifo per proxy 2023-03-13 14:19:36 +00:00
Saeed Maleki
ea7134549e vector instructions for trigger 2023-03-13 07:02:26 +00:00
Changho Hwang
1be76d128d 128-bit trigger 2023-03-10 10:49:36 +00:00
Ubuntu
8f2831330c a few todos + some clean up in the test 2023-03-10 04:34:14 +00:00
Changho Hwang
38a2adfe97 Single IB proxy per IB dev 2023-03-09 05:11:08 +00:00
Changho Hwang
e3fde881bc Single P2P proxy per GPU 2023-03-09 04:43:56 +00:00
Saeed Maleki
8c4ab6dad6 merged with main 2023-03-09 02:06:04 +00:00
Saeed Maleki
6c3197a492 fifo for ib works as well 2023-03-09 02:02:14 +00:00
Saeed Maleki
160060ec77 fifo works now 2023-03-08 20:10:09 +00:00
Changho Hwang
5403508ac4 Type bug fix 2023-03-07 12:41:56 +00:00
Changho Hwang
d9f97e63ee Merge branch 'main' into chhwang/ib-proxy-merge 2023-03-07 13:47:04 +08:00
Changho Hwang
881dfcd6f2 Add a TODO 2023-03-07 05:44:57 +00:00
Changho Hwang
1a382a8e1d Fix fifo triggers 2023-03-07 03:31:26 +00:00
Changho Hwang
1c52143c90 Merge branch 'main' into chhwang/proxy-run-states 2023-03-07 10:09:47 +08:00
Saeed Maleki
3e4c45d73a compiles 2023-03-06 20:36:54 +00:00
Saeed Maleki (saemal)
dced4c4c14 done with the design 2023-03-06 12:26:46 -08:00
Saeed Maleki (saemal)
b663469bcd making a fifo for proxy threads 2023-03-06 10:58:12 -08:00
Saeed Maleki (saemal)
ae7407146b removing unnecessary stat probe for cuda graph 2023-03-06 08:03:44 -08:00
Changho Hwang
0440096954 Add proxy run states 2023-03-06 08:34:24 +00:00
Changho Hwang
73b8d02089 Merge two IB proxy modes 2023-03-06 08:19:31 +00:00
Saeed Maleki
0216ceb34e added todos 2023-03-06 08:04:33 +00:00
Changho Hwang
5ac2ea6e9f IB more fixes 2023-03-06 07:01:03 +00:00
Saeed Maleki
7e4bacf20c works 2023-03-03 23:10:41 +00:00
Saeed Maleki (saemal)
3def04e72d separating dma data/flag/sync logic 2023-03-03 15:05:54 -08:00
Changho Hwang
2c10142c89 IB fixes 2023-03-03 08:37:23 +00:00
Changho Hwang
9e5573f16b Misc changes and comments 2023-03-03 08:32:47 +00:00
Changho Hwang
9674830db2 Change flags into uint64_t 2023-03-03 07:41:34 +00:00
Changho Hwang
89b89fbae5 Erase duplicate TODO 2023-03-03 07:31:08 +00:00
Saeed Maleki
ac1bf6dc52 cudagraph now works with p2p proxy 2023-02-28 23:25:51 +00:00
Changho Hwang
3d051a985f Add p2p proxy: doesn't work with cuda graph yet 2023-02-28 18:58:03 +00:00
madanm
e67024a9d7 initial interface proposal for zero-copy push communication 2023-02-27 14:16:13 -08:00
Changho Hwang
a78b78aa43 Erase unnecessary memsets 2023-02-23 09:42:55 +00:00
Changho Hwang
29a430e7a8 NUMA binding 2023-02-23 08:18:12 +00:00
Saeed Maleki
1a528a3aa3 merged with main 2023-02-22 23:16:26 +00:00
Saeed Maleki
bca3362c12 a few clean ups 2023-02-22 22:43:18 +00:00
Changho Hwang
48b81edf6d Move some files 2023-02-22 11:07:22 +00:00
Changho Hwang
89ca0451a8 Support incremental flag & add perf test 2023-02-22 10:55:35 +00:00