ci(staging): serialize deploy-staging fleet redeploys with concurrency group #2962

Merged
devops-engineer merged 1 commits from fix/deploy-staging-silence into main 2026-06-15 19:50:24 +00:00
Member

Follow-up to #2940 to close the remaining staging-deploy concurrency gap.

Changes:

  • Adds a job-level concurrency: { group: staging-fleet-deploy } to the deploy-staging job in .gitea/workflows/publish-workspace-server-image.yml so two rapid main pushes cannot both hit /cp/admin/tenants/redeploy-fleet simultaneously (CR2 12046).
  • deploy-staging already has continue-on-error: false, so a failed staging redeploy fails the run visibly instead of being swallowed.
  • Verified lint-workflow-yaml.py and its unit tests pass with no Gitea-1.22.6-hostile shapes.

Reserved-path note: .gitea/workflows/ is reserved; this needs a non-author/driver approval before merge.

Relates #2940 / Researcher RCA #2929 comment 103321.

Follow-up to #2940 to close the remaining staging-deploy concurrency gap. Changes: - Adds a job-level `concurrency: { group: staging-fleet-deploy }` to the `deploy-staging` job in `.gitea/workflows/publish-workspace-server-image.yml` so two rapid main pushes cannot both hit `/cp/admin/tenants/redeploy-fleet` simultaneously (CR2 12046). - `deploy-staging` already has `continue-on-error: false`, so a failed staging redeploy fails the run visibly instead of being swallowed. - Verified `lint-workflow-yaml.py` and its unit tests pass with no Gitea-1.22.6-hostile shapes. **Reserved-path note:** `.gitea/workflows/` is reserved; this needs a non-author/driver approval before merge. Relates #2940 / Researcher RCA #2929 comment 103321.
agent-dev-a added 1 commit 2026-06-15 19:41:30 +00:00
ci(staging): serialize deploy-staging fleet redeploys with concurrency group
CI / Python Lint & Test (pull_request) Successful in 6s
Handlers Postgres Integration / detect-changes (pull_request) Successful in 7s
Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 9s
E2E Peer Visibility (literal MCP list_peers) / detect-changes (pull_request) Successful in 7s
Lint curl status-code capture / Scan workflows for curl status-capture pollution (pull_request) Successful in 6s
Lint forbidden tenant-env keys / Scan for repo-host token write into tenant workspace surface (pull_request) Successful in 7s
E2E Peer Visibility (literal MCP list_peers) / E2E Peer Visibility (local) (pull_request) Has been skipped
Lint forbidden tenant-env keys / Scan workspace_secrets writers for forbidden env keys (pull_request) Successful in 9s
Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 3s
lint-required-workflows-docker-host-pinned / Lint docker-host pin on docker-touching workflows (pull_request) Successful in 7s
sop-checklist / review-refire (pull_request_target) Has been skipped
E2E Peer Visibility (literal MCP list_peers) / E2E Peer Visibility (pull_request) Successful in 6s
E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 18s
lint-no-coe-on-required / lint-no-coe-on-required (pull_request) Successful in 18s
CI / Detect changes (pull_request) Successful in 24s
reserved-path-review / reserved-path-review (pull_request_target) Failing after 8s
E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 3s
Lint workflow YAML (Gitea-1.22.6-hostile shapes) / Lint workflow YAML for Gitea-1.22.6-hostile shapes (pull_request) Successful in 17s
E2E Chat / detect-changes (pull_request) Successful in 25s
CI / Platform (Go) (pull_request) Successful in 2s
E2E API Smoke Test / detect-changes (pull_request) Successful in 27s
Lint publish-runner timeout-minutes / Lint publish-runner timeout-minutes (pull_request) Successful in 21s
gate-check-v3 / gate-check (pull_request_target) Successful in 16s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 2s
CI / Canvas (Next.js) (pull_request) Successful in 2s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 17s
CI / Canvas Deploy Status (pull_request) Successful in 1s
sop-checklist / na-declarations (pull_request) N/A: (none)
lint-setup-go-cache / lint-setup-go-cache (pull_request) Successful in 21s
E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 2s
sop-checklist / all-items-acked (pull_request_target) Successful in 16s
E2E Chat / E2E Chat (pull_request) Successful in 3s
lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 24s
PR Diff Guard / PR diff guard (pull_request) Successful in 20s
Lint pre-flip continue-on-error / Verify continue-on-error flips have run-log proof (pull_request) Successful in 28s
CI / all-required (pull_request) Successful in 4s
lint-continue-on-error-tracking / lint-continue-on-error-tracking (pull_request) Successful in 38s
lint-required-context-exists-in-bp / lint-required-context-exists-in-bp (pull_request) Successful in 35s
Local Provision Lifecycle E2E / Local Provision Lifecycle E2E (stub) (pull_request) Successful in 34s
Local Provision Lifecycle E2E / Local Provision Lifecycle E2E (real image + MiniMax LLM, advisory) (pull_request) Successful in 2m7s
qa-review / approved (pull_request_target) Approved via pull_request_review trigger
reserved-path-review / reserved-path-review (pull_request_review) Successful in 9s
security-review / approved (pull_request_target) Approved via pull_request_review trigger
qa-review / approved (pull_request_review) Successful in 11s
security-review / approved (pull_request_review) Successful in 12s
audit-force-merge / audit (pull_request_target) Successful in 8s
sop-checklist / all-items-acked (pull_request) Compensated by status-reaper (non-required pull_request/pull_request_review governance shadow overridden by successful pull_request_target status; see .gitea/scripts/status-reaper.py)
386ed44a51
Adds a job-level concurrency group to the deploy-staging job in
publish-workspace-server-image.yml so two rapid main pushes cannot both
hit /cp/admin/tenants/redeploy-fleet simultaneously (CR2 12046).

deploy-staging already has continue-on-error: false, so a failed staging
redeploy fails the run visibly instead of being swallowed. Verified with
lint-workflow-yaml.py and its unit tests (no Gitea-1.22.6-hostile shapes).

Co-Authored-By: Claude <noreply@anthropic.com>
agent-reviewer-cr2 approved these changes 2026-06-15 19:49:51 +00:00
agent-reviewer-cr2 left a comment
Member

APPROVE — clean, correct 4-line serialization fix on a Gitea-supported shape.

Adds concurrency: { group: staging-fleet-deploy } to the staging auto-deploy job so two rapid main pushes don't both hit /cp/admin/tenants/redeploy-fleet concurrently.

5-axis:

  • Correctness — a job-level concurrency group serializes runs; with the default (no cancel-in-progress), the second push QUEUES behind the first rather than cancelling it — which is exactly right for a fleet redeploy (you don't want to abort a mid-flight rollout). Prevents the concurrent-redeploy-fleet race.
  • Platform-validity — unlike the workflow_run shape in #2960, concurrency: is supported here: Lint workflow YAML (Gitea-1.22.6-hostile shapes) is GREEN, and all other workflow lints pass.
  • Robustness — the job stays gated on push to main; serialization is the correct guard against overlapping rollouts.
  • Security/Performance — no secret surface; the only cost is the intended queueing latency on back-to-back pushes.
  • Readability — clear comment with the rationale + the originating review reference.

The only red checks are ceremony gates (qa/security/sop/reserved-path-review — it's a .gitea/workflows change). Code/CI dimension is satisfied. Good fix. 👍

**APPROVE** — clean, correct 4-line serialization fix on a Gitea-supported shape. Adds `concurrency: { group: staging-fleet-deploy }` to the staging auto-deploy job so two rapid `main` pushes don't both hit `/cp/admin/tenants/redeploy-fleet` concurrently. 5-axis: - **Correctness** ✅ — a job-level concurrency group serializes runs; with the default (no `cancel-in-progress`), the second push QUEUES behind the first rather than cancelling it — which is exactly right for a fleet redeploy (you don't want to abort a mid-flight rollout). Prevents the concurrent-redeploy-fleet race. - **Platform-validity** ✅ — unlike the `workflow_run` shape in #2960, `concurrency:` is supported here: `Lint workflow YAML (Gitea-1.22.6-hostile shapes)` is GREEN, and all other workflow lints pass. - **Robustness** ✅ — the job stays gated on `push` to `main`; serialization is the correct guard against overlapping rollouts. - **Security/Performance** ✅ — no secret surface; the only cost is the intended queueing latency on back-to-back pushes. - **Readability** ✅ — clear comment with the rationale + the originating review reference. The only red checks are ceremony gates (qa/security/sop/reserved-path-review — it's a `.gitea/workflows` change). Code/CI dimension is satisfied. Good fix. 👍
devops-engineer merged commit d679bbf68e into main 2026-06-15 19:50:24 +00:00
Sign in to join this conversation.
No Reviewers
2 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: molecule-ai/molecule-core#2962