Commit Graph

989 Commits

Author SHA1 Message Date
empyreus
8fb751470b add multi node 2026-04-07 17:15:05 +00:00
empyreus
88e1ac71c7 fix paths 2026-04-06 20:53:39 +00:00
empyreus
ea97444a8d change path 2026-04-06 19:44:14 +00:00
empyreus
68cf67d24e unit test 2026-04-06 18:12:46 +00:00
empyreus
58c5234243 ignore version mismatch 2026-04-03 23:20:11 +00:00
empyreus
e68125f270 change to h100 machine 2026-04-03 22:20:22 +00:00
empyreus
e8266a1794 running on a100 2026-04-03 15:08:12 +00:00
empyreus
53d6f76a24 simplify container 2026-04-03 14:07:46 +00:00
empyreus
7b03ece609 add prints 2026-04-03 14:02:44 +00:00
empyreus
10648a42c5 add --priveldged 2026-04-02 20:35:15 +00:00
empyreus
149be8e828 fix - 2026-04-02 19:47:30 +00:00
empyreus
376a6a299d rmobe build 2026-04-02 19:27:09 +00:00
empyreus
61e0540cbc update for new cmake 2026-04-02 17:55:58 +00:00
empyreus
6fd8b18e83 change cmake version 2026-04-02 17:48:43 +00:00
empyreus
32e2b6464d try new msccl 2026-04-02 17:15:27 +00:00
empyreus
faa600c3be Retry 2026-04-02 16:53:10 +00:00
empyreus
dba58416b9 comment out tests 2026-04-02 16:32:41 +00:00
empyreus
7fddcaf89d reset msccl 2026-04-02 16:31:50 +00:00
empyreus
827ca0935c fully remove msccl isntall step 2026-04-02 14:23:03 +00:00
empyreus
8e4ddc15ff remove msccl tests 2026-04-02 14:20:40 +00:00
empyreus
566cf93349 remove mscl build 2026-04-02 13:56:15 +00:00
empyreus
7948682dfa fix 2026-04-01 21:40:15 +00:00
empyreus
e4244c4466 fix clone 2026-04-01 21:19:04 +00:00
empyreus
d6dd64f463 fix copy 2026-04-01 20:35:21 +00:00
empyreus
3196758efb remove container 2026-04-01 18:29:43 +00:00
empyreus
ea8e6af959 fix missing container 2026-04-01 17:40:30 +00:00
empyreus
2080baad44 try new removal 2026-04-01 17:17:19 +00:00
empyreus
4f37637507 fixes 2026-04-01 16:09:53 +00:00
empyreus
131f128b6a comment out old docker pull 2026-04-01 15:37:28 +00:00
empyreus
d7b0dd627e trying to rework image pull 2026-04-01 15:31:32 +00:00
empyreus
f5159b0e16 check if pipeline needs creation 2026-04-01 14:45:59 +00:00
empyreus
503647c128 fix missing quote 2026-03-31 20:38:55 +00:00
empyreus
36c496dc98 readd tests 2026-03-31 19:05:37 +00:00
empyreus
49aeea0660 fix container deletion 2026-03-31 19:05:11 +00:00
empyreus
0c8f4fd583 find directory 2026-03-31 18:55:22 +00:00
empyreus
80194b2803 fix directory 2026-03-31 16:28:35 +00:00
empyreus
48a6a2e441 add sglang all_reduce 2026-03-31 15:47:36 +00:00
empyreus
f938f60505 update sglang-test 2026-03-30 18:08:25 +00:00
empyreus
a22104c391 add remaining tests 2026-03-30 17:00:18 +00:00
empyreus
83d9301e24 full run 2026-03-27 22:20:05 +00:00
empyreus
6ac12fa1d5 comment out to fix pipeline 2026-03-27 22:16:06 +00:00
empyreus
a9d7bd8918 fix 2026-03-27 22:14:51 +00:00
empyreus
f171663d4e fix batch size 2026-03-27 21:57:07 +00:00
empyreus
38552a6f9c fix remote run and clean up files 2026-03-27 21:13:07 +00:00
empyreus
324254d57c finish adding sglang steps 2026-03-27 20:28:45 +00:00
empyreus
4107fa9644 fix run remote 2026-03-27 20:05:17 +00:00
empyreus
35148991e8 fix cmake 2026-03-27 19:32:47 +00:00
empyreus
fa30289415 update for new remote run 2026-03-26 23:51:07 +00:00
empyreus
0a6d329bb8 add sshke 2026-03-26 23:37:14 +00:00
empyreus
e423ca8952 rename files 2026-03-26 23:34:42 +00:00