Allison Piper
|
c8909c7d1b
|
Refactoring / renaming.
|
2025-05-02 20:54:55 +00:00 |
|
Allison Piper
|
a2bf266e16
|
Rename some space -> spaces for clarity.
|
2025-05-02 18:53:09 +00:00 |
|
Allison Piper
|
d7989ddf1e
|
Docs update.
|
2025-05-02 17:59:03 +00:00 |
|
Allison Piper
|
0fae0058b6
|
Rename header to match class name
|
2025-05-02 17:43:35 +00:00 |
|
Allison Piper
|
498c73d4e8
|
Updated some docs.
|
2025-05-02 17:43:18 +00:00 |
|
Allison Piper
|
c9d92ebffa
|
Merge remote-tracking branch 'origin/main' into fea/axes_iteration_space
|
2025-05-02 17:08:04 +00:00 |
|
Allison Piper
|
249a74f73b
|
Bump CI to CTK 12.9, regen devcontainers. (#219)
|
2025-05-02 12:05:50 -04:00 |
|
Allison Piper
|
cca9308dd6
|
Update docs, whitespace.
|
2025-05-01 17:40:04 +00:00 |
|
Allison Piper
|
4bd5690cd3
|
Fix warnings on clang.
|
2025-05-01 17:27:41 +00:00 |
|
Allison Piper
|
4defa02f51
|
Precommit reformatting.
|
2025-05-01 16:53:52 +00:00 |
|
Allison Piper
|
250d755bd6
|
Update new test to support device-init changes.
|
2025-05-01 16:40:09 +00:00 |
|
Allison Piper
|
edefcd0f6a
|
Update fmt usage for new version.
|
2025-05-01 16:38:31 +00:00 |
|
Allison Piper
|
62cbdc5ff9
|
Reduce runtime of gaussian iteration example.
|
2025-05-01 16:37:53 +00:00 |
|
Allison Piper
|
5016bec99e
|
Merge remote-tracking branch 'origin/main' into fea/axes_iteration_space
|
2025-05-01 16:36:56 +00:00 |
|
Allison Piper
|
9d189280de
|
Fix get_config_count for CPU-only benchmarks. (#218)
|
2025-05-01 12:34:35 -04:00 |
|
Allison Piper
|
50e764a308
|
Merge remote-tracking branch 'origin/main' into fea/axes_iteration_space
|
2025-05-01 15:14:58 +00:00 |
|
Sergey Pavlov
|
433376fd83
|
Restrict stopping criterion parameter usage in command line (#174)
* restrict stopping criterion parameter usage in command line
* Update docs for stopping criterion.
* Add convenience benchmark_base API for criterion params.
* Add more test cases for stopping criterion parsing.
---------
Co-authored-by: Sergey Pavlov <psvvsp89@gmail.com>
Co-authored-by: Allison Piper <alliepiper16@gmail.com>
|
2025-04-30 15:53:45 -04:00 |
|
Elias Stehle
|
ca0e795b46
|
Merge pull request #113 from elstehle/fix/per-device-stream
Fixes cudaErrorInvalidValue when running on nvbench-created cuda stream
|
2025-04-30 15:40:33 -04:00 |
|
Allison Piper
|
4879607c70
|
Merge pull request #216 from alliepiper/disable_throttle_for_sync
Disable throttling when `sync` exec tag is used.
|
2025-04-24 19:02:39 -04:00 |
|
Allison Piper
|
e4057575c7
|
Disable throttling when sync exec tag is used.
|
2025-04-24 22:48:35 +00:00 |
|
Allison Piper
|
0573ffa9bd
|
Merge pull request #214 from PointKernel/fix-throttle-setters
Fix throttle setter return values and update customization example
|
2025-04-24 13:53:20 -04:00 |
|
Yunsong Wang
|
dbd12f61b8
|
Revert example change
|
2025-04-24 10:12:46 -07:00 |
|
Allison Piper
|
2938a94d49
|
Merge pull request #215 from alliepiper/dynamic_throttle_delay
Dynamically increase recovery delay for consecutive discards.
|
2025-04-24 10:32:45 -04:00 |
|
Allison Piper
|
d12614b5cb
|
Dynamically increase recovery delay for consecutive discards.
|
2025-04-24 14:11:31 +00:00 |
|
Yunsong Wang
|
797f91bc7e
|
Update example to show to customize throttle threshold
|
2025-04-23 14:10:16 -07:00 |
|
Yunsong Wang
|
31efce1ec8
|
Fix throttle setters
|
2025-04-23 14:01:56 -07:00 |
|
Allison Piper
|
89bec09b82
|
Merge pull request #207 from alliepiper/throttle_followup
Throttling followup
|
2025-04-18 08:48:41 -04:00 |
|
Allison Piper
|
46ab283d02
|
Merge pull request #213 from alliepiper/version_prefix_fix
Use the new(ish) PREFIX option of rapids-cmake version, git revision header utils.
|
2025-04-15 17:17:42 -04:00 |
|
Allison Piper
|
109449438b
|
Use the new(ish) PREFIX option of rapids-cmake version, git revision header utils.
Generate macros prefixed with NVBENCH instead of redefining them from NVBench.
|
2025-04-15 20:33:06 +00:00 |
|
Allison Piper
|
eadb913322
|
Merge pull request #211 from alliepiper/clock_api
Fetch clock rates using cudaDeviceGetAttribute.
|
2025-04-14 17:12:42 -04:00 |
|
Allison Piper
|
0c56311174
|
Fetch clock rates using cudaDeviceGetAttribute.
|
2025-04-14 16:59:54 -04:00 |
|
Allison Piper
|
9bf5e987cf
|
Merge branch 'main' into throttle_followup
|
2025-04-14 15:29:44 -04:00 |
|
Allison Piper
|
33fc77aabc
|
Merge pull request #210 from alliepiper/vdc_update
Update verify-devcontainers workflow to match CCCL.
|
2025-04-14 14:50:20 -04:00 |
|
Allison Piper
|
457b9f1064
|
Update verify-devcontainers workflow to match CCCL.
This prevents us from spawning a ton of jobs unless the devcontainers actually change.
|
2025-04-14 14:37:40 -04:00 |
|
Allison Piper
|
965a80f730
|
Formatting.
|
2025-04-14 18:07:27 +00:00 |
|
Allison Piper
|
931888116c
|
Merge branch 'main' into throttle_followup
|
2025-04-14 14:06:39 -04:00 |
|
Allison Piper
|
2c2f40a659
|
Merge pull request #209 from alliepiper/pre-commit-ci
Add pre-commit.ci configs, format.
|
2025-04-14 14:05:48 -04:00 |
|
Allison Piper
|
47bd2838da
|
Remove stale devcontainer.
|
2025-04-14 17:50:18 +00:00 |
|
Allison Piper
|
4c38b2d5f7
|
Clang-format doesn't like the 1'000'000 separators.
|
2025-04-14 17:44:31 +00:00 |
|
Allison Piper
|
a3a2337e04
|
Merge pull request #208 from alliepiper/drop-support-for-11.8
Remove coverage for 11.8.
|
2025-04-14 13:41:54 -04:00 |
|
Allison Piper
|
8cefac8463
|
Update blame-ignore file.
|
2025-04-14 17:31:13 +00:00 |
|
Allison Piper
|
3440855dbd
|
Formatting updates.
|
2025-04-14 17:26:12 +00:00 |
|
Allison Piper
|
de36f1a248
|
Add pre-commit.ci configs.
|
2025-04-14 12:23:44 -04:00 |
|
Allison Piper
|
b89c36a5c2
|
Remove coverage for 11.8.
We're going to be dropping these devcontainers soon in CCCL, and they're causing issues with our pre-commit hooks.
|
2025-04-14 16:03:49 +00:00 |
|
Allison Piper
|
7d5f04ec02
|
Show SM clock info in summaries example.
|
2025-04-14 11:37:48 -04:00 |
|
Allison Piper
|
f2011f2281
|
Add new hidden summary with percent sm clock scaling,
|
2025-04-14 11:37:20 -04:00 |
|
Allison Piper
|
e0a486b03b
|
Reduce memory usage of clock rate logging.
|
2025-04-14 11:35:27 -04:00 |
|
Allison Piper
|
18926ced87
|
Replace references to peak_sm_clock with default_sm_clock.
The actual measured clock speed can exceed this value, so default is less confusing than peak.
|
2025-04-14 11:33:04 -04:00 |
|
Allison Piper
|
87dd03254f
|
Merge pull request #206 from gevtushenko/throttle
Discard measurements while GPU is throttling
|
2025-04-14 10:57:33 -04:00 |
|
Georgy Evtushenko
|
254ac2517f
|
Remove discard on throttle option
|
2025-04-12 21:13:13 -07:00 |
|