mirror of
https://github.com/microsoft/mscclpp.git
synced 2026-06-06 07:52:00 +00:00
`nop` instruction is only for synchronization within the same threadblock. Cross threadblock synchronization is handled by `barrier` instruction. So insert `nop` only if the dependency is within the same threadblock.