nvbench

mirror of https://github.com/NVIDIA/nvbench.git synced 2026-06-29 18:57:44 +00:00

Files

Oleksandr Pavlyk d4283f77a5 Refactor nvbench-compare timing comparison state

Introduce GpuTimingData, SummaryComparison, ComparisonStats, and
ComparisonRunData to make timing extraction, classification, and run-level
state explicit.

Load sample-time and SM-frequency bulk data from JSON binary output into
GpuTimingData when available, preserving count validation between paired
sample and frequency arrays.

Move GPU timing comparison logic into compare_gpu_timings(), prefer robust
median/IQR data when available, and fall back to mean/stdev summaries otherwise.
Keep missing or invalid noise on the unknown path.

Replace module-level comparison counters and selected-device globals with
per-run data passed into compare_benches(). Update tests to validate timing
classification, bulk-data loading, device pairing, filtered duplicate matching,
and summary counters through the new structures.

2026-06-02 15:04:39 -05:00

nvbench_json

Build multi architecture cuda wheels (#302 )

2026-01-29 01:13:24 +00:00

__init__.py

Build multi architecture cuda wheels (#302 )

2026-01-29 01:13:24 +00:00

nvbench_compare.py

Refactor nvbench-compare timing comparison state

2026-06-02 15:04:39 -05:00

nvbench_histogram.py

Build multi architecture cuda wheels (#302 )