docs(runbooks): #1780 compensating-status recovery for stale CI umbrellas #1782

Merged
hongming merged 1 commits from docs/issue-1780-compensating-status-runbook into main 2026-05-24 06:14:19 +00:00

1 Commits

Author SHA1 Message Date
hongming 0ea86df071 docs(runbooks): document compensating-status recovery for stale CI umbrellas (#1780)
ci-arm64-advisory / fast-checks (pull_request) Waiting to run
Lint shellcheck (arm64 pilot) / shellcheck-arm64 (pilot) (pull_request) Successful in 8s
Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 6s
CI / Detect changes (pull_request) Successful in 12s
CI / Python Lint & Test (pull_request) Successful in 4s
E2E API Smoke Test / detect-changes (pull_request) Successful in 9s
E2E Chat / detect-changes (pull_request) Successful in 8s
E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 11s
Handlers Postgres Integration / detect-changes (pull_request) Successful in 10s
Lint forbidden tenant-env keys / Scan workspace_secrets writers for forbidden env keys (pull_request) Successful in 4s
Harness Replays / detect-changes (pull_request) Successful in 9s
Lint no tenant GITEA or GITHUB token write / Scan for repo-host token write into tenant workspace surface (pull_request) Successful in 5s
gate-check-v3 / gate-check (pull_request) Successful in 7s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 8s
qa-review / approved (pull_request) Failing after 6s
security-review / approved (pull_request) Failing after 6s
sop-checklist / review-refire (pull_request) Has been skipped
sop-checklist / na-declarations (pull_request) N/A: (none)
sop-checklist / all-items-acked (pull_request) Successful in 5s
sop-tier-check / tier-check (pull_request) Successful in 5s
lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m4s
CI / Platform (Go) (pull_request) Successful in 4s
CI / Canvas (Next.js) (pull_request) Successful in 4s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 8s
E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 9s
E2E Chat / E2E Chat (pull_request) Successful in 15s
E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 13s
Harness Replays / Harness Replays (pull_request) Successful in 4s
Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 4s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
CI / all-required (pull_request) Successful in 12s
audit-force-merge / audit (pull_request) Successful in 8s
Adds docs/runbooks/ci-umbrella-stale-compensating-status.md documenting
the recovery pattern: when CI/all-required is failure but all 5
required sub-jobs are success in action_run_job, POST a corrected
success status via the Gitea API to unblock the merge gate.

Used twice in the 2026-05-24 CTO-bypass session (PRs #1737 and #1759);
the pattern parallels status-reaper.yml's compensating-status approach
for default-branch (push) drift.

The runbook is explicit about when NOT to use it (any required sub-job
actually failed) and requires WHO+WHY in the description field so the
audit trail stays honest.

Closes #1780 (the docs sub-task). The auto-recovery sub-task tracked in
#1780 stays open pending decision on whether to build umbrella-reaper or
let #1779's runner-pool fix make it unnecessary.
2026-05-23 22:33:27 -07:00