Files
mscclpp/python
Binyang Li a6e00cc449 remove unnecessary sync (#461)
`nop` instruction is only for synchronization within the same
threadblock. Cross threadblock synchronization is handled by `barrier`
instruction. So insert `nop` only if the dependency is within the same
threadblock.
2025-02-10 15:31:49 +08:00
..
2025-02-10 15:31:49 +08:00
2024-03-27 11:53:09 -07:00
2024-03-27 11:53:09 -07:00
2024-09-15 22:30:54 +00:00