Update best_practices.md

This commit is contained in:
Yunsong Wang
2025-09-19 12:58:37 -07:00
committed by GitHub
parent df27dcb5df
commit 54cbcd0c42

View File

@@ -6,8 +6,7 @@ This document is **not intended to replace** the detailed benchmark documentatio
## NVBench
* Purpose-built for CUDA GPU workloads.
* Provides GPU-aware features: warmup runs, synchronization, throughput/latency metrics, and parameter sweeps, etc.
* Provides GPU-aware features: runtime customization, throughput calculations, and parameter sweeps, etc.
* Produces machine-readable output (JSON, CSV) suitable for regression tracking and CI pipelines.
* The natural choice for benchmarking GPU-accelerated code.
* Also supports benchmarking normal CPU implementations.