Changho Hwang
2eadbaf86f
python doc auto generation ( #605 )
...
Add Python API references
2025-08-11 10:34:29 -07:00
Changho Hwang
9650e5c37e
Update documentation ( #576 )
...
Documentation overhaul
2025-08-07 15:37:37 -07:00
Binyang Li
4136153a76
[Doc] mscclpp docs ( #348 )
...
Generate docs for mescclpp.
Setup github action to auto-deploy github-page
doc link here: https://microsoft.github.io/mscclpp
---------
Co-authored-by: Changho Hwang <changhohwang@microsoft.com >
Co-authored-by: Caio Rocha <caiorocha@microsoft.com >
2024-10-18 06:08:31 +00:00
caiomcbr
b1b9d0626c
Support NCCL APIs ( #319 )
...
Start supporting NCCL APIs with a few limitations.
---------
Co-authored-by: Caio Rocha <caio.rocha@microsoft.com >
Co-authored-by: Changho Hwang <changhohwang@microsoft.com >
2024-06-27 23:54:06 +00:00
Angelica Moreira
34f4d9d006
Update quickstart.md ( #314 )
...
Updating the docker image name tag and the python benchmark path.
2024-06-19 22:26:13 +00:00
Changho Hwang
1a7cb98e3a
v0.4.3 ( #279 )
2024-03-27 11:53:09 -07:00
Changho Hwang
cdaf3aea3d
New packet format & optimizations ( #256 )
...
Co-authored-by: Binyang Li <binyli@microsoft.com >
2024-02-20 20:01:37 -08:00
Binyang Li
f1b2c9df12
Fix performance downgrade issue & update doc ( #229 )
...
For push function, we only need to make sure the instruction `st.global`
will be executed after the while loop. Since there is a Write-After-Read
hazard for `trigger.fst` (Check `this->triggers[curFifoHead % size].fst
!= 0` first then write value to `triggers[curFifoHead % size]`), we can
expect the compiler and hardware can handle this situation correctly.
Remove the `release.sys` there.
BTW, `st.global.release.sys.v2.u64` will cause perf regression issue.
Previous we use `st.global.release.cta.v2.u64`, but seems not necessary.
2023-12-04 10:20:10 -08:00
Changho Hwang
351b95b926
Update documents ( #225 )
...
Adding AMD supports on the docs
2023-11-24 17:00:18 +08:00
Changho Hwang
15f6dcca49
Update documentation ( #217 )
...
Co-authored-by: Saeed Maleki <saemal@microsoft.com >
2023-11-22 12:58:04 -08:00
Changho Hwang
f68820436c
Explicit build dependency on nvidia_peermem ( #201 )
2023-10-23 04:29:30 +00:00
Changho Hwang
3df18d20a3
Update install guidelines ( #159 )
2023-08-30 10:40:40 -07:00
Changho Hwang
4114d65c60
Documents & minor updates ( #119 )
...
Co-authored-by: Saeed Maleki <saemal@microsoft.com >
Co-authored-by: Binyang Li <binyli@microsoft.com >
2023-07-07 17:35:05 +08:00
Changho Hwang
85e664c2f7
Update docs ( #88 )
2023-06-05 13:13:10 +08:00