fix(canvas): bump vitest testTimeout to 30s on CI for v8-coverage cold start (#96) #97

Ghost · 2026-05-08T01:20:32Z

Ghost commented

2026-05-08 01:20:32 +00:00

First-time contributor

Summary

Class A CI red sweep: 3 first-tests in heavyweight Canvas test files time out at vitest's 5000ms default on the self-hosted Gitea Actions Docker runner under npx vitest run --coverage. Same 3 tests fail consistently across 4 unrelated PRs (#82, #81, #54, #53) whose canvas/ deltas don't intersect — clear environmental, not logical.

Fix: CI-conditional testTimeout: process.env.CI ? 30000 : 5000 in canvas/vitest.config.ts. Local-dev keeps the 5000ms sensitivity for genuine waitFor races; CI gets ~5x headroom over the worst observed first-test (6453ms).

Full root-cause analysis + 2 alternatives rejected + hostile self-review in the issue body: #96.

Empirically observed first-test durations (CI run 2321/jobs/2)

Test	Actual	Default	New ceiling
ActivityTab `renders all 7 filter options`	5230ms	5000ms (FAIL)	30000ms (PASS)
CreateWorkspaceDialog `opens the dialog ...`	6453ms	5000ms (FAIL)	30000ms (PASS)
ConfigTab.provider `PUTs the new provider ...`	5605ms	5000ms (FAIL)	30000ms (PASS)
ContextMenu `renders with role='menu'` (currently passing on margin)	4463ms	5000ms (PASS, 537ms margin)	30000ms (PASS)
ExternalConnectionSection (full file)	4955ms	5000ms (PASS, 45ms margin)	30000ms (PASS)

Subsequent tests in the same files: 100-1500ms each. The cold-start tax is per-file, paid by whoever happens to be first.

Why this is environmental, not a code bug

Component code is correct (ActivityTab.FILTERS has 7 entries matching the test).
Failing tests are mostly synchronous — no awaitable race in the body.
Local repro (same branch, same --coverage): 1407 tests pass in 9-15s. CI runs the same suite at 200s. Gap is import/transform/environment overhead per vitest's own footer (environment 589.87s).

Test plan

Run failing 3 tests locally 5x, all 74 green (no process.env.CI)
Sleep-7s probe: FAIL under default 5000ms, PASS under CI=true 30000ms (ternary takes effect)
Full canvas suite under CI=true npx vitest run --coverage: 1407/1407 green, 13.97s
Push PR, wait for Canvas (Next.js) job, confirm green
Verify other 4 PRs (#82, #81, #54, #53) auto-resolve once this lands and they rebase

Hostile self-review (3 weakest spots)

30000ms is a guess. Vitest still emits per-test duration; a real 25s+ test will surface as a duration regression and we dial down.
Doesn't fix the Docker-runner-overhead root-root-cause. True. That's a multi-week perf project (drop JSDOM for happy-dom, switch off --coverage on PRs, faster runner). The right trade today is unblocking 4 PRs from this single class.
Local-default of 5000ms means a real 8s race that flies on CI's 30000ms could pass without local sensitivity. Mitigation: per-test waitFor races are caught at the test level; this only changes the suite-level ceiling.

Closes #96.
Refs: #82, #81, #54, #53.

Co-Authored-By: Claude Opus 4.7 (1M context) noreply@anthropic.com

## Summary Class A CI red sweep: 3 first-tests in heavyweight Canvas test files time out at vitest's 5000ms default on the self-hosted Gitea Actions Docker runner under `npx vitest run --coverage`. Same 3 tests fail consistently across 4 unrelated PRs (#82, #81, #54, #53) whose canvas/ deltas don't intersect — clear environmental, not logical. Fix: CI-conditional `testTimeout: process.env.CI ? 30000 : 5000` in `canvas/vitest.config.ts`. Local-dev keeps the 5000ms sensitivity for genuine waitFor races; CI gets ~5x headroom over the worst observed first-test (6453ms). Full root-cause analysis + 2 alternatives rejected + hostile self-review in the issue body: #96. ## Empirically observed first-test durations (CI run 2321/jobs/2) | Test | Actual | Default | New ceiling | |---|---|---|---| | ActivityTab `renders all 7 filter options` | 5230ms | 5000ms (FAIL) | 30000ms (PASS) | | CreateWorkspaceDialog `opens the dialog ...` | 6453ms | 5000ms (FAIL) | 30000ms (PASS) | | ConfigTab.provider `PUTs the new provider ...` | 5605ms | 5000ms (FAIL) | 30000ms (PASS) | | ContextMenu `renders with role='menu'` (currently passing on margin) | 4463ms | 5000ms (PASS, 537ms margin) | 30000ms (PASS) | | ExternalConnectionSection (full file) | 4955ms | 5000ms (PASS, 45ms margin) | 30000ms (PASS) | Subsequent tests in the same files: 100-1500ms each. The cold-start tax is per-file, paid by whoever happens to be first. ## Why this is environmental, not a code bug 1. Component code is correct (`ActivityTab.FILTERS` has 7 entries matching the test). 2. Failing tests are mostly synchronous — no awaitable race in the body. 3. Local repro (same branch, same `--coverage`): 1407 tests pass in 9-15s. CI runs the same suite at 200s. Gap is import/transform/environment overhead per vitest's own footer (`environment 589.87s`). ## Test plan - [x] Run failing 3 tests locally 5x, all 74 green (no `process.env.CI`) - [x] Sleep-7s probe: FAIL under default 5000ms, PASS under `CI=true` 30000ms (ternary takes effect) - [x] Full canvas suite under `CI=true npx vitest run --coverage`: 1407/1407 green, 13.97s - [ ] Push PR, wait for Canvas (Next.js) job, confirm green - [ ] Verify other 4 PRs (#82, #81, #54, #53) auto-resolve once this lands and they rebase ## Hostile self-review (3 weakest spots) 1. **30000ms is a guess.** Vitest still emits per-test duration; a real 25s+ test will surface as a duration regression and we dial down. 2. **Doesn't fix the Docker-runner-overhead root-root-cause.** True. That's a multi-week perf project (drop JSDOM for happy-dom, switch off `--coverage` on PRs, faster runner). The right trade today is unblocking 4 PRs from this single class. 3. **Local-default of 5000ms means a real 8s race that flies on CI's 30000ms could pass without local sensitivity.** Mitigation: per-test waitFor races are caught at the test level; this only changes the suite-level ceiling. Closes #96. Refs: #82, #81, #54, #53. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Ghost added 1 commit 2026-05-08 01:20:32 +00:00

fix(canvas): bump vitest testTimeout to 30s on CI for v8-coverage cold start (#96 )

CodeQL / Analyze (${{ matrix.language }}) (go) (pull_request) Successful in 1s

Details

CodeQL / Analyze (${{ matrix.language }}) (javascript-typescript) (pull_request) Successful in 1s

Details

CodeQL / Analyze (${{ matrix.language }}) (python) (pull_request) Successful in 1s

Details

Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 13s

Details

Retarget main PRs to staging / Retarget to staging (pull_request) Successful in 9s

Details

Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 10s

Details

Harness Replays / detect-changes (pull_request) Successful in 11s

Details

E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 12s

Details

E2E API Smoke Test / detect-changes (pull_request) Successful in 12s

Details

Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 11s

Details

CI / Detect changes (pull_request) Successful in 14s

Details

Handlers Postgres Integration / detect-changes (pull_request) Successful in 12s

Details

CI / Shellcheck (E2E scripts) (pull_request) Successful in 4s

Details

E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 4s

Details

CI / Platform (Go) (pull_request) Successful in 4s

Details

CI / Python Lint & Test (pull_request) Successful in 4s

Details

Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 3s

Details

Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 4s

Details

Harness Replays / Harness Replays (pull_request) Failing after 43s

Details

CI / Canvas (Next.js) (pull_request) Successful in 3m44s

Details

CI / Canvas Deploy Reminder (pull_request) Has been skipped

Details

E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 4m35s

Details

da1a5af7a4

Class A red sweep — 3 first-tests timing out at the 5000ms default on the
self-hosted Gitea Actions Docker runner across 4 unrelated PRs (#82, #81,
#54, #53). The PRs share zero canvas/ surface — same 3 tests, same
cold-start signature, same shape on every run.

Root cause: `npx vitest run --coverage` cold-start cost (v8 coverage
instrumentation init + JSDOM bootstrap + heavy @/components/* and @/lib/*
import + first React render) consumes 5-7 seconds for the first
synchronous test in a heavyweight test file. Empirically:

  ActivityTab "renders all 7 filter options"           5230ms (FAIL)
  CreateWorkspaceDialog "opens the dialog ..."         6453ms (FAIL)
  ConfigTab.provider "PUTs the new provider on Save"   5605ms (FAIL)

vs subsequent tests in the same files at 100-1500ms each. The component
code is correct (e.g. ActivityTab.FILTERS has 7 entries matching the
test). 1407 tests pass locally with --coverage in 9-15s; CI runs at 200s
under the same flag — the gap is import/transform/environment overhead,
not test logic.

Fix: CI-conditional `testTimeout: process.env.CI ? 30000 : 5000` in
canvas/vitest.config.ts. Local-dev sensitivity to genuine waitFor races
preserved; CI gets ~5x headroom over the worst observed first-test
(6453ms). Same shape Vitest documents at
<https://vitest.dev/config/testtimeout> and
<https://vitest.dev/guide/coverage#profiling-test-performance>.

Verification:
  - Local: 5x runs of the 3 failing test files, all 74 tests green
    (process.env.CI unset → 5000ms applies).
  - Local: 7s sleep probe FAILS at 5000ms default and PASSES under
    CI=true → ternary takes effect as written.
  - Local: full canvas suite under CI=true with --coverage:
    "Test Files 98 passed (98) | Tests 1407 passed (1407)".

Closes #96.
Refs: #82, #81, #54, #53.

Hostile self-review (3 weakest spots):
1. 30000ms is a guess, not a measurement. Mitigation: vitest still
   emits per-test duration; a real 25s+ test will surface as a
   duration regression and we dial down.
2. Doesn't fix the Docker-runner-overhead root-root-cause. True. That
   is a multi-week perf project. The right trade today is unblocking 4
   PRs from this single class.
3. Local-default of 5000ms means a real 8s race that flies on CI's
   30000ms could pass without local sensitivity. Mitigation: dev-time
   waitFor races are caught at the per-test level; suite-level cold-
   start is the only legitimate >5s case here.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Ghost changed target branch from main to staging

2026-05-08 01:20:38 +00:00

Ghost approved these changes 2026-05-08 01:27:43 +00:00

Ghost left a comment

First-time contributor

CI-conditional vitest testTimeout bump (5000→30000ms when CI=true). 1-file 26-line surgical change. Root cause from Phase 1: first-test cold-start (v8 coverage + JSDOM + heavy imports) on Gitea Actions Docker runner ~13x slower than local; 5-7s cold-start exceeds 5000ms default. Verified: run #2370 1407/1407 green in 102s, formerly-failing tests now 789ms/829ms/1316ms. By devops-engineer persona. 4 PRs (#82/#81/#54/#53) will unblock post-merge+rebase. Ready.