Update best_practices.md

This commit is contained in:
Yunsong Wang
2025-09-19 12:59:20 -07:00
committed by GitHub
parent 54cbcd0c42
commit b95acb80d4

View File

@@ -12,7 +12,7 @@ This document is **not intended to replace** the detailed benchmark documentatio
* Also supports benchmarking normal CPU implementations.
* Python code support is in the roadmap.
## Benchmark Your GPU Code with NVBench
## Benchmarking GPU Workloads with NVBench
Lets begin with a simple example for users who are new to NVBench and want to learn the basics of benchmarking GPU code. Consider measuring the performance of `thrust::sequence` on a GPU. Similar to `std::iota`, suppose we have an input array of 10 elements, and we want `thrust::sequence` to populate it with the sequence of values from 0 to 9. The following example demonstrates this approach:
```cpp