nvbench

mirror of https://github.com/NVIDIA/nvbench.git synced 2026-05-12 17:25:41 +00:00

Author	SHA1	Message	Date
Oleksandr Pavlyk	e53a1a2654	Use median and IR/relative as cmp_time/ref_time and cmp_noise/ref_noise These measures are less sensitive to outliers	2026-05-04 16:14:56 -05:00
Oleksandr Pavlyk	ea592b6444	Tweaks for nvbench_compare 1. For JSON files that contains repeated measurements of run-time axis values, make sure that scripts compares corresponding reference entries. If cmp had two states with the same name and ref had two, we would compare measurements for each state in cmp against the first state in ref. Change here introduces counters tracking how many times each particular axis value, and retrieve corresponding entry in ref. Previously, I had ``` \| BlockSize \| NumBlocks \| Ref Time \| Ref Noise \| Cmp Time \| Cmp Noise \| Diff \| %Diff \| Status \| \|-------------\|-------------\|------------\|-------------\|------------\|-------------\|-----------\|---------\|----------\| \| 2^8 \| 64 \| 1.776 ms \| 0.46% \| 1.777 ms \| 0.40% \| 1.024 us \| 0.06% \| SAME \| \| 2^8 \| 64 \| 1.776 ms \| 0.46% \| 1.774 ms \| 0.52% \| -2.048 us \| -0.12% \| SAME \| \| 2^8 \| 64 \| 1.776 ms \| 0.46% \| 1.773 ms \| 0.52% \| -3.072 us \| -0.17% \| SAME \| \| 2^8 \| 64 \| 1.776 ms \| 0.46% \| 1.774 ms \| 0.58% \| -2.048 us \| -0.12% \| SAME \| \| 2^8 \| 64 \| 1.776 ms \| 0.46% \| 1.773 ms \| 0.58% \| -3.072 us \| -0.17% \| SAME \| ``` and now it becomes ``` \| BlockSize \| NumBlocks \| Ref Time \| Ref Noise \| Cmp Time \| Cmp Noise \| Diff \| %Diff \| Status \| \|-------------\|-------------\|------------\|-------------\|------------\|-------------\|-----------\|---------\|----------\| \| 2^8 \| 64 \| 1.776 ms \| 0.46% \| 1.777 ms \| 0.40% \| 1.024 us \| 0.06% \| SAME \| \| 2^8 \| 64 \| 1.773 ms \| 0.64% \| 1.774 ms \| 0.52% \| 1.024 us \| 0.06% \| SAME \| \| 2^8 \| 64 \| 1.774 ms \| 0.46% \| 1.773 ms \| 0.52% \| -1.024 us \| -0.06% \| SAME \| \| 2^8 \| 64 \| 1.773 ms \| 0.46% \| 1.774 ms \| 0.58% \| 1.024 us \| 0.06% \| SAME \| \| 2^8 \| 64 \| 1.774 ms \| 0.52% \| 1.773 ms \| 0.58% \| -1.024 us \| -0.06% \| SAME \| ``` With the following raw data expected ``` (py313) opavlyk@NV-22T4X34:~/repos/nvbench$ jq '. \| .benchmarks[] \| .states[] \| .summaries[] \| select(.tag == "nv/cold/time/gpu/median") \| .data[] \| .value' base.json "0.0017756160497665405" "0.0017725440263748169" "0.001773568034172058" "0.0017725440263748169" "0.001773568034172058" (py313) opavlyk@NV-22T4X34:~/repos/nvbench$ jq '. \| .benchmarks[] \| .states[] \| .summaries[] \| select(.tag == "nv/cold/time/gpu/median") \| .data[] \| .value' test.json "0.0017766400575637818" "0.001773568034172058" "0.0017725440263748169" "0.001773568034172058" "0.0017725440263748169" ``` 2. nvbench_compare changes from using min_noise = min(ref_noise, cmp_noise) to using max_noise = max(ref_noise, cmp_noise) Using larger of ref and cmp noise level as a reference against which to gauge timing difference ratio makes more sense.	2026-05-04 16:14:56 -05:00
Oleksandr Pavlyk	b0a46f44c2	Modularize color handling (#336 ) * Introduce function colorize to modularize colorization/no-color handling * Use sns.set_theme instead of deprecated sns.set() * Use str.format instead of legacy % syntax * Simplified iteration over list Use f-string (supported since Python 3.6) instead of str.format for better readability and performance	2026-04-14 08:09:44 -05:00
Nader Al Awar	7a68e53df0	Rename flag from markdown to no-color	2026-04-01 17:01:29 -05:00
Nader Al Awar	7e5e784855	Add --markdown flag to nvbench_compare.py which can be use for github issues/prs	2026-04-01 14:53:13 -05:00
Bernhard Manfred Gruber	4164909c52	Feedback	2026-02-28 01:19:18 +01:00
Bernhard Manfred Gruber	0abc8ec82b	Extend nvbench_compare.py with `--plot`, axis/benchmark filtering, and dark mode Co-authored-by: Oleksandr Pavlyk <21087696+oleksandr-pavlyk@users.noreply.github.com>	2026-02-27 11:06:20 +01:00
Bernhard Manfred Gruber	800f640c20	Apply reviewer feedback	2026-02-26 19:23:51 +01:00
Bernhard Manfred Gruber	d3a0bec4a8	Feedback from review	2026-02-05 14:13:16 +01:00
Bernhard Manfred Gruber	28ed32bb47	Implement dark mode using style sheets	2026-02-05 14:00:33 +01:00
Bernhard Manfred Gruber	ec9759037d	I have no idea what I am doing	2026-02-05 11:15:27 +01:00
Bernhard Manfred Gruber	ccde9fc4d4	More	2026-02-05 10:56:36 +01:00
Bernhard Manfred Gruber	0be190b407	Add a script to plot benchmark results	2026-02-05 10:36:52 +01:00
Bernhard Manfred Gruber	c6ef87575c	Allow partial comparison in nvbench_compare.py Fixes: #295	2026-02-03 16:32:11 +01:00
Nader Al Awar	5e7adc5c3f	Build multi architecture cuda wheels (#302 ) * Add cuda architectures to build wheel for * Package scripts in wheel * Separate cuda major version extraction to fix architecutre selection logic * Add back statement printing cuda version * [pre-commit.ci] auto code formatting --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2026-01-29 01:13:24 +00:00

15 Commits