Binyang Li
|
88d28e07a7
|
Select algo according to json config (#396)
The way to run nccl-test over mscclpp:
mpirun -np 8 --bind-to numa --allow-run-as-root -x
LD_PRELOAD=$(pwd)/build/apps/nccl/libmscclpp_nccl.so -x NCCL_DEBUG=WARN
-x MSCCLPP_EXECUTION_PLAN_DIR=/execution-files
/root/nccl-tests/build/all_reduce_perf -b 1K -e 1G -f 2 -d half -G 20 -w
10 -n 20
|
2024-12-03 22:39:20 +00:00 |
|
Binyang Li
|
b30bb260e3
|
Tune threads per block for mscclpp executor (#345)
|
2024-09-18 17:21:47 -07:00 |
|
Ziyue Yang
|
76328fe623
|
Add NPKit GPU event support (#310)
|
2024-06-13 13:59:50 +08:00 |
|
Binyang Li
|
6226556ce2
|
Optimized the execution kernel (#294)
|
2024-05-03 11:54:50 -07:00 |
|
Binyang Li
|
64d837f9ab
|
Add executor to execute schedule-plan file (#283)
Add executor to execute the JSON schedule file generated by msccl-tools
---------
Co-authored-by: Changho Hwang <changhohwang@microsoft.com>
|
2024-04-18 19:10:41 +00:00 |
|