Ziyue Yang
|
b5a48f836c
|
Separate NPKit CPU timestamp access from different blocks for AMD platform (#321)
Reference: https://github.com/ROCm/rccl/pull/1229
|
2024-07-02 19:36:48 +08:00 |
|
Ziyue Yang
|
f29095b3b1
|
Fix NPKit support for AMD (#312)
|
2024-06-14 16:22:14 +08:00 |
|
Ziyue Yang
|
76328fe623
|
Add NPKit GPU event support (#310)
|
2024-06-13 13:59:50 +08:00 |
|
Changho Hwang
|
544ff0c21d
|
ROCm support (#213)
Co-authored-by: Binyang Li <binyli@microsoft.com>
|
2023-11-24 16:41:56 +08:00 |
|
Changho Hwang
|
21eed722af
|
Add license comments (#106)
|
2023-06-25 12:40:12 +08:00 |
|
Changho Hwang
|
60b3dd5a61
|
Bug fixes & resolve warnings (#107)
* Fix a bug in host hashing
* Fix a bug in `HostEpoch::wait()`
* Remove misc warnings
|
2023-06-16 09:31:23 +00:00 |
|
Olli Saarikivi
|
457c422791
|
Remove alloc.h and beef up cuda_utils.hpp (#82)
|
2023-05-24 08:34:18 +00:00 |
|
Olli Saarikivi
|
9f6c48cbf9
|
Format all files
|
2023-05-11 00:23:14 +00:00 |
|
Olli Saarikivi
|
ccf45b33a2
|
Delete old init code and other C-style code
|
2023-05-10 22:03:42 +00:00 |
|