molecule-core

History

Hongming Wang 3105e87cf7 ci: gate PRs on tests/harness/run-all-replays.sh Closes the gap between "the harness exists" and "the harness blocks bugs." Phase 2 of the harness roadmap (per tests/harness/README.md): make harness-based E2E a required CI check on every PR touching the tenant binary or the harness itself. Trigger: push + pull_request to staging+main, paths-filtered to workspace-server/, canvas/, tests/harness/**, and this workflow. merge_group support included so this becomes branch-protectable. Single-job-with-conditional-steps pattern (matches e2e-api.yml). One check run regardless of paths-filter outcome; satisfies branch protection cleanly per the PR #2264 SKIPPED-in-set finding. Why this exists: 2026-04-30 we shipped a TenantGuard allowlist gap (/buildinfo added to router.go in #2398, never added to the allowlist) that the existing buildinfo-stale-image.sh replay would have caught. The harness was wired correctly; nobody ran it. Replays as a discipline beat replays as a memory item. The CI pipeline: detect-changes (paths filter) └ harness-replays (always) ├ no-op pass when paths-filter says no relevant change └ otherwise: checkout + sibling plugin checkout + /etc/hosts entry + run-all-replays.sh + compose-logs-on-failure + force-teardown Compose logs from tenant/cp-stub/cf-proxy/postgres are dumped on failure so a CI red is debuggable without re-reproducing locally. The trap in run-all-replays.sh handles teardown; the always-run down.sh step is a belt-and-suspenders against trap-bypass kills. Follow-ups (not in this PR): - Add this check to staging branch protection once it's been green for a few PRs (the new-workflow-instability hedge that other gates followed). - Eventually wire the buildx GHA cache to speed up tenant image builds — currently every PR rebuilds the full Dockerfile.tenant (Go + Next.js + template clones) from scratch. Acceptable for now; optimize when the timeout-minutes:30 ceiling becomes painful. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-04-30 13:04:53 -07:00
..
auto-promote-on-e2e.yml	fix(ci): handle empty E2E lookup in auto-promote-on-e2e gate	2026-04-30 10:07:52 -07:00
auto-promote-staging.yml	ci(auto-promote): dispatch publish via molecule-ai App token to unblock workflow_run chain	2026-04-30 08:55:49 -07:00
auto-sync-main-to-staging.yml	fix(ci): auto-sync opens a PR + uses merge queue, not direct push	2026-04-28 15:59:26 -07:00
auto-tag-runtime.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
block-internal-paths.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
canary-staging.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
canary-verify.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
check-merge-group-trigger.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
check-migration-collisions.yml	fix(ci): drop --depth=1 from migration collision check fetch	2026-04-30 05:28:03 -07:00
ci.yml	ci: collapse all 4 path-filtered required checks to single-job-with-conditional-steps	2026-04-29 16:09:22 -07:00
codeql.yml	chore(deps): batch dep bumps — 6 safe upgrades (4 actions majors + 2 npm dev deps)	2026-04-28 17:44:55 -07:00
continuous-synth-e2e.yml	ci: continuous synthetic E2E against staging (#2342 )	2026-04-29 22:04:57 -07:00
e2e-api.yml	test(e2e): poll-mode + since_id cursor round-trip (#2339 PR 4)	2026-04-29 23:07:10 -07:00
e2e-staging-canvas.yml	fix(e2e-canvas): kill teardown race that poisons concurrent runs	2026-04-29 19:23:56 -07:00
e2e-staging-saas.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
e2e-staging-sanity.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
harness-replays.yml	ci: gate PRs on tests/harness/run-all-replays.sh	2026-04-30 13:04:53 -07:00
pr-guards.yml	ci: add pr-guards caller that disables auto-merge on push	2026-04-27 06:39:31 -07:00
promote-latest.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
publish-canvas-image.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
publish-runtime.yml	refactor(ci): extract wheel smoke into shared script	2026-04-30 11:52:07 -07:00
publish-workspace-server-image.yml	feat(deploy): verify each tenant /buildinfo matches published SHA after redeploy	2026-04-30 10:55:08 -07:00
railway-pin-audit.yml	ci: daily Railway pin-audit cron + issue-on-failure (#2169 )	2026-04-29 17:43:01 -07:00
redeploy-tenants-on-main.yml	fix(ci): gate 50%-floor on TOTAL_VERIFIED >= 4	2026-04-30 11:40:31 -07:00
redeploy-tenants-on-staging.yml	fix(ci): gate 50%-floor on TOTAL_VERIFIED >= 4	2026-04-30 11:40:31 -07:00
retarget-main-to-staging.yml	ci(retarget): handle 422 'duplicate PR' by closing redundant main-PR (closes #1884 )	2026-04-26 00:53:55 -07:00
runtime-pin-compat.yml	chore(deps): batch dep bumps — 6 safe upgrades (4 actions majors + 2 npm dev deps)	2026-04-28 17:44:55 -07:00
runtime-prbuild-compat.yml	refactor(ci): extract wheel smoke into shared script	2026-04-30 11:52:07 -07:00
secret-pattern-drift.yml	chore(deps): batch dep bumps — 6 safe upgrades (4 actions majors + 2 npm dev deps)	2026-04-28 17:44:55 -07:00
secret-scan.yml	chore(security): pin Actions to SHAs + enable Dependabot auto-bumps	2026-04-28 15:37:06 -07:00
sweep-cf-orphans.yml	Merge pull request #2248 from Molecule-AI/fix/sweep-cf-orphans-hard-fail-on-schedule	2026-04-29 01:16:22 +00:00
sweep-cf-tunnels.yml	feat(ops): add sweep-cf-tunnels janitor — orphan Cloudflare Tunnels accumulate	2026-04-29 19:42:47 -07:00
sweep-stale-e2e-orgs.yml	ci: hourly sweep of stale e2e-* orgs on staging	2026-04-24 23:07:57 -07:00
test-ops-scripts.yml	chore(deps): batch dep bumps — 6 safe upgrades (4 actions majors + 2 npm dev deps)	2026-04-28 17:44:55 -07:00