Files
Binyang Li 7c390fffd6 Expose NVLS multicast granularity option for GpuBuffer (#815)
Add a public Granularity enum (MultiCastMinimum, MultiCastRecommended)
and let GpuBuffer choose the NVLS multicast allocation granularity via a
constructor argument, defaulting to MultiCastMinimum to minimize memory
usage. Expose the same option through the C++ and Python (nanobind)
APIs.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-06-04 13:16:18 -07:00
..
2026-04-07 13:37:02 -07:00
2026-04-13 09:59:42 -07:00