chore(deps): upgrade tinybench to v6 by jerome-benoit · Pull Request #7100 · vitest-dev/vitest

jerome-benoit · 2024-12-18T12:01:47Z

Description

Upgrade tinybench from v2.9.0 to v6.0.0.

Breaking changes adapted:

TaskResult is now a discriminated union — BenchmarkResult redefined as standalone vitest-owned interface with BenchmarkStatistics type and createEmptyStatistics() helper for type-safe defaults
Deprecated top-level fields removed (hz, samples, mean, etc.) — using latency.*, throughput.* instead
retainSamples option (defaults to false) replaces manual samples.length = 0 clearing — mapped to vitest's includeSamples via centralized getBenchOptions
BenchEvent is now a typed class — event handlers use typed event API
BenchmarkResult.sampleCount renamed to samplesCount to align with Statistics.samplesCount

Bug fixes included:

Fix complete event handler crash when task errors — tinybench v6 fires complete after error, but errored tasks have no latency property. Added task state guard and error tracking
Fix errored benchmarks silently marked as pass — now correctly marked as fail
Replace Object.assign(result, task.result) with selective field extraction — prevents tinybench runtime metadata (runtime, runtimeVersion, timestampProviderName) from polluting BenchmarkResult and JSON output
Fix comparison display diffFixed check ('1.0.0' → '1.00') with proper else if chain

User-facing type change: BenchmarkResult.sampleCount → BenchmarkResult.samplesCount. BenchTaskResult re-export is now the tinybench v6 TaskResult discriminated union.

Please don't delete this checklist! Before submitting the PR, please make sure you do the following:

It's really useful if your PR references an issue where it is discussed ahead of time. If the feature is substantial or introduces breaking changes without a discussion, PR might be closed.
Ideally, include a test that fails without this PR but passes with it.
Please, don't make changes to pnpm-lock.yaml unless you introduce a new test example.
Please check Allow edits by maintainers to make review process faster.

Tests

Run the tests with pnpm test:ci.

Documentation

If you introduce new functionality, document it. You can run documentation with pnpm run docs command.

Changesets

Changes in changelog are generated from PR name. Please, make sure that it explains your changes in an understandable manner.

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>

netlify · 2024-12-18T12:53:17Z

✅ Deploy Preview for vitest-dev ready!

Built without sensitive environment variables

Name	Link
🔨 Latest commit	`05afd1d`
🔍 Latest deploy log	https://app.netlify.com/projects/vitest-dev/deploys/69b5426b8777e20008eea95b
😎 Deploy Preview	https://deploy-preview-7100--vitest-dev.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>

gperdomor · 2025-04-04T03:24:46Z

@jerome-benoit any progress with this?... Can you migrate this to v4? 🙏🏻

jerome-benoit · 2025-04-04T08:45:47Z

It's still on my TODO list, but I had no time to finish the migration.

gperdomor · 2025-04-05T21:18:42Z

@jerome-benoit thank you for the quick response... I hope you find some time to finish 👍🏻

…rade # Conflicts: # packages/vitest/src/runtime/runners/benchmark.ts # test/benchmark/test/reporter.test.ts

…stream merge - Use latency.mean instead of deprecated mean in benchmark ranking sort - Guard filter with benchmark?.latency to handle in-progress benchmarks - Simplify error handler with null-safe access (e.task?.result?.error) - Remove dead test:benchmark script referencing deleted workspace

Copilot

Pull request overview

Upgrades Vitest’s benchmarking integration to work with tinybench@6, adapting internal result types and benchmark reporting/runner logic to match tinybench’s new discriminated-union results and latency/throughput statistics model.

Changes:

Bump tinybench dependency to ^6.0.0 (lockfile + package dependency).
Refactor runtime benchmark runner/types to use tinybench v6 task/results and map includeSamples to retainSamples.
Update benchmark reporters/tests to read from latency.* / throughput.* instead of deprecated top-level stats.

Reviewed changes

Copilot reviewed 10 out of 11 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
test/cli/test/benchmarking.test.ts	Updates include-samples assertions to match v6 `latency.samples` behavior.
pnpm-lock.yaml	Locks `tinybench@6.0.0` and updates related dependency metadata.
packages/vitest/src/runtime/types/benchmark.ts	Redefines benchmark result/statistics types for v6 (`latency/throughput`, samples handling).
packages/vitest/src/runtime/runners/benchmark.ts	Refactors benchmark execution to `Bench.run()` and v6 events/results.
packages/vitest/src/runtime/benchmark.ts	Tightens benchmark options storage/types and adds formatted name into options.
packages/vitest/src/public/index.ts	Updates public type re-exports to new tinybench v6 aliases.
packages/vitest/src/node/reporters/benchmark/tableRender.ts	Updates table rendering to use `latency.*` and compare via `throughput.mean`.
packages/vitest/src/node/reporters/benchmark/reporter.ts	Updates sorting/ranking logic to use `latency.mean`.
packages/vitest/src/node/reporters/benchmark/json-formatter.ts	Stops forcibly overriding `samples` in JSON output.
packages/vitest/src/node/reporters/base.ts	Updates benchmark summary ratio calculation to use `latency.mean`.
packages/vitest/package.json	Bumps `tinybench` dependency to `^6.0.0`.

Files not reviewed (1)

pnpm-lock.yaml: Language not supported

You can also share your feedback on Copilot code review. Take the survey.

packages/vitest/src/runtime/runners/benchmark.ts

packages/vitest/src/node/reporters/benchmark/tableRender.ts

packages/vitest/package.json

…y, fix comparison display

Guard complete handler against errored/aborted tasks to prevent TypeError crash. Track error state to correctly mark failed benchmarks. Replace Object.assign with selective field extraction to prevent tinybench runtime metadata from leaking into BenchmarkResult and JSON output. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Move retainSamples (includeSamples → retainSamples) mapping from the runner call site into getBenchOptions, which now takes the runner config and returns ready-to-use tinybench BenchOptions. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Rename numberOfSamples to samplesCount to match tinybench Statistics.samplesCount. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

jerome-benoit · 2026-03-06T14:31:23Z

Hey @sheremet-va @hi-ogawa @AriPerkkio 👋

This PR is finally ready for review! It's been sitting around since December 2024, but it's now fully up to date with main and covers the complete tinybench v2.9.0 → v6.0.0 upgrade.

Beyond the API migration, this includes a few correctness fixes found during an audit of the benchmarking code path (error handling in the complete event, Object.assign result pollution, diffFixed comparison bug). All details in the PR description.

CI is green (the browser test flake on the previous run was from the upstream merge and has since been fixed). Happy to address any feedback. Thanks!

sheremet-va · 2026-03-07T22:12:27Z

Hey! Thank you for the PR and sorry for the lack of communication, but we currently don't want to accept PRs targeting benchmarking features because we have a big rewrite on our road map that changes the whole architecture to not treat bench functions as tests: #7850

By accepting this PR we might introduce bugs that need to be fixed in an already kind of softly deprecated feature, so I would rather avoid any turbulence at this time.

jerome-benoit · 2026-03-08T10:59:21Z

By accepting this PR we might introduce bugs that need to be fixed in an already kind of softly deprecated feature, so I would rather avoid any turbulence at this time.

Fair enough. But given the actual minimal code churn and the obviousness of the trivial fixes (data structure fields rename, proper condition and code flow, grouped steps, ... and all of it relying on a library version that does not has single bug report since), I do not see how it be worse with it. The current implementation is buggy. Making it just works better will not introduce any maintenance overhead, at all, unless the revamp has already started and that PR will introduce conflicts in another PR.

sheremet-va · 2026-03-08T12:23:29Z

Fair enough. But given the actual minimal code churn and the obviousness of the trivial fixes (data structure fields rename, proper condition and code flow, grouped steps, ... and all of it relying on a library version that does not has single bug report since), I do not see how it be worse with it. The current implementation is buggy. Making it just works better will not introduce any maintenance overhead, at all, unless the revamp has already started and that PR will introduce conflicts in another PR.

It could be worse because it can break existing integrations, like codespeed, or it might not! There are almost no tests for benchmarking, so it is hard to catch. The changes in interfaces is already breaking enough, for example.

Basically, I don't want to break the ecosystem multiple times.

jerome-benoit · 2026-03-08T12:45:18Z

It could be worse because it can break existing integrations, like codespeed, or it might not! There are almost no tests for benchmarking, so it is hard to catch.

It can't be worse, the code is already telling it.

The changes in interfaces is already breaking enough, for example.

It can be reverted to keep the exact same API if it's an issue for a minor revision beta cycle.

Basically, I don't want to break the ecosystem multiple times.

Understood but here it's basically trivial bug fixes that can be made totally transparent if needed.: vitest already abstract out tinybench internals: API, tunables, in and out data structure, ...

I can add simplified tests ported from tinybench to vitest API in another PR to ensure that PR will pass a more comprehensive of real world usage tests suite for benchmark.

sheremet-va · 2026-03-08T13:14:36Z

All of these just increases maintenance, someone needs to write write this, then review this, triage the issue to see if it needs to be reverted. Benchmarking has a very low priority at the moment, it won't be merged anyway. And when priority increases, the work will be focused on the new API.

jerome-benoit · 2026-03-08T13:51:51Z

All of these just increases maintenance, someone needs to write write this, then review this, triage the issue to see if it needs to be reverted.

Discouraging external contributions made by people having a clue on what they are doing is a very bad move for a project maintenance point of view. Especially on a low priority and under maintained component orthogonal to the project main use case. And is even worse if the proposal is offering a baseline unit tests integration that will establish a solid TDD for any future enhancements.
If your goal was to make potential valuable contribution to the project go away, it's a complete success :)

sheremet-va · 2026-03-08T14:17:56Z

All of these just increases maintenance, someone needs to write write this, then review this, triage the issue to see if it needs to be reverted.

Discouraging external contributions made by people having a clue on what they are doing is a very bad move for a project maintenance point of view. Especially on a low priority and under maintained component orthogonal to the project main use case. And is even worse if the proposal is offering a baseline unit tests integration that will establish a solid TDD for any future enhancements. If your goal was to make potential valuable contribution to the project go away, it's a complete success :)

Anything added to benchmarking will be removed and rewritten within half a year, the feature in its current state is deprecated. So yes, we discourage any contribution to it as I said in my initial comment. Please, do not take this personally.

As a person who has a clue on what you are doing and whose opinion on benchmarking I respect, you should leave feedback on the next iteration of benchmarking here: #7850

chore(deps): update tinybench to 3.x.x

98f0b8b

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>

jerome-benoit marked this pull request as draft December 18, 2024 12:01

jerome-benoit added 2 commits December 18, 2024 13:36

fix: import the proprer FnOptions tinybench type

cd23ff1

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>

fix: import the right BenchOptions type

7a6a69f

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>

jerome-benoit added 3 commits December 18, 2024 14:06

refactor: remove console.log debugging

3a4d73d

Signed-off-by: Jérôme Benoit <jerome.benoit@piment-noir.org>

Merge branch 'main' into chore/tinybench-upgrade

5aa3de5

Merge branch 'main' into chore/tinybench-upgrade

5de0335

jerome-benoit added 3 commits March 4, 2026 18:35

Merge remote-tracking branch 'upstream/main' into chore/tinybench-upg…

1df2a31

…rade # Conflicts: # packages/vitest/src/runtime/runners/benchmark.ts # test/benchmark/test/reporter.test.ts

chore(deps): upgrade tinybench to v6

0609c92

jerome-benoit changed the title ~~chore(deps): update tinybench to 3.x.x~~ chore(deps): upgrade tinybench to v6 Mar 4, 2026

jerome-benoit marked this pull request as ready for review March 4, 2026 19:38

Copilot AI review requested due to automatic review settings March 4, 2026 19:38

Copilot started reviewing on behalf of jerome-benoit March 4, 2026 19:38 View session

Copilot AI reviewed Mar 4, 2026

View reviewed changes

jerome-benoit and others added 7 commits March 4, 2026 21:08

fix: address review comments — type-safe defaults, remove unused arra…

682332b

…y, fix comparison display

Merge branch 'main' into chore/tinybench-upgrade

4c164cc

Merge branch 'main' into chore/tinybench-upgrade

87cd6a7

refactor: align BenchmarkResult.samplesCount with tinybench naming

cbfc1d1

Rename numberOfSamples to samplesCount to match tinybench Statistics.samplesCount. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Merge branch 'main' into chore/tinybench-upgrade

16064f3

Merge branch 'main' into chore/tinybench-upgrade

9fbad46

Merge branch 'main' into chore/tinybench-upgrade

bb4df4d

jerome-benoit added 2 commits March 11, 2026 11:17

Merge branch 'main' into chore/tinybench-upgrade

37a2e76

Merge branch 'main' into chore/tinybench-upgrade

05afd1d

Uh oh!

Conversation

jerome-benoit commented Dec 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Please don't delete this checklist! Before submitting the PR, please make sure you do the following:

Tests

Documentation

Changesets

Uh oh!

netlify bot commented Dec 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vitest-dev ready!

Uh oh!

gperdomor commented Apr 4, 2025

Uh oh!

jerome-benoit commented Apr 4, 2025

Uh oh!

gperdomor commented Apr 5, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jerome-benoit commented Mar 6, 2026

Uh oh!

sheremet-va commented Mar 7, 2026

Uh oh!

jerome-benoit commented Mar 8, 2026

Uh oh!

sheremet-va commented Mar 8, 2026

Uh oh!

jerome-benoit commented Mar 8, 2026

Uh oh!

sheremet-va commented Mar 8, 2026

Uh oh!

jerome-benoit commented Mar 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sheremet-va commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jerome-benoit commented Dec 18, 2024 •

edited

Loading

netlify bot commented Dec 18, 2024 •

edited

Loading

jerome-benoit commented Mar 8, 2026 •

edited

Loading