History

Molecule AI Core-DevOps 9153a2e464 All checks were successful sop-checklist / all-items-acked (pull_request) injected Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 17s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 55s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 39s Details CI / Detect changes (pull_request) Successful in 58s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 56s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 49s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 28s Details qa-review / approved (pull_request) Successful in 13s Details gate-check-v3 / gate-check (pull_request) Successful in 33s Details sop-checklist-gate / gate (pull_request) Successful in 13s Details security-review / approved (pull_request) Successful in 15s Details sop-tier-check / tier-check (pull_request) Successful in 20s Details lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m17s Details audit-force-merge / audit (pull_request) Successful in 10s Details Ops Scripts Tests / Ops scripts (unittest) (pull_request) Successful in 1m42s Details CI / Platform (Go) (pull_request) Successful in 7s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 6s Details CI / Canvas (Next.js) (pull_request) Successful in 7s Details CI / Python Lint & Test (pull_request) Successful in 6s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 8s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 8s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 7s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 22s Details CI / Canvas Deploy Reminder (pull_request) Successful in 8s Details CI / all-required (pull_request) Successful in 3s Details fix: add slug validation to prevent SSRF (OFFSEC-006) OFFSEC-006 (HIGH): promote-tenant-image.sh interpolated raw --tenants slug into URL paths and subdomains without sanitisation. Four injection points were vulnerable: • cp_redeploy_tenant (line 193): /cp/admin/tenants/$slug/redeploy • tenant_buildinfo (line 209): https://${slug}.moleculesai.app/buildinfo • tenant_health (line 217): https://${slug}.moleculesai.app/health • resolve_tenant_instance_id (line 263): /cp/admin/tenants/$slug Attack vectors: --tenants 'a?url=https://evil.com' → curl splits on ? as query separator --tenants 'evil.com@legitimate' → subdomain takeover via @ Fix: • Add validate_slug() function with regex ^[a-z0-9]([a-z0-9-]{0,61}[a-z0-9])?$ before any URL interpolation. Exit 64 on invalid slug. • Call validate_slug() in main() before any operations (up-front guard). • Add defense-in-depth calls inside cp_redeploy_tenant, tenant_buildinfo, tenant_health, resolve_tenant_instance_id, redeploy_tenant, verify_tenant, and the rollback loop. • Also fix a latent promote_rc=1 bug where `cmd \|\| promote_rc=1` inside `set -e` returned exit 1 and triggered early script exit instead of setting the variable. Replaced with `if ! cmd; then promote_rc=1; fi`. Test additions (test-promote-tenant-image.sh): • Test 9: 8 invalid slug variants rejected with exit 64 (?, &, @, /, \, space, etc.) • Test 10: 6 valid slugs accepted (chloe-dong, ab, a, etc.) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>		2026-05-14 02:17:42 +00:00
..
demo-freeze-snapshots	ops: demo-day freeze + rollback runbook	2026-05-01 12:04:30 -07:00
ops	fix(ci): harden Cloudflare sweep API errors	2026-05-13 00:35:15 -07:00
build_runtime_package.py	fix(ci): add _sanitize_a2a to TOP_LEVEL_MODULES allowlist (third workflow defect)	2026-05-10 19:32:58 -07:00
build-images.sh
bundle-compile.sh
check-cascade-list-vs-manifest.sh	feat(ci): structural drift gate for cascade list vs manifest (RFC #388 PR-3)	2026-05-03 03:52:39 -07:00
check-stale-promote-pr.sh	fix(ci): replace gh pr CLI with Gitea v1 REST in workflows + scripts (#75 class A)	2026-05-07 15:29:26 -07:00
cleanup-rogue-workspaces.sh
clone-manifest.sh	fix(ci): strip JSON5 comments from manifest.json before jq parse	2026-05-11 22:02:02 +00:00
demo-day-runbook.md	ops: demo-day freeze + rollback runbook	2026-05-01 12:04:30 -07:00
demo-freeze.sh	fix(scripts): migrate ghcr.io→ECR + raw.githubusercontent.com→Gitea (#46 )	2026-05-07 00:56:23 -07:00
demo-thaw.sh	ops: demo-day freeze + rollback runbook	2026-05-01 12:04:30 -07:00
dev-start.sh	fix(dev-start): detect missing Go and fall back to docker-compose platform	2026-04-29 20:04:37 -07:00
edge-429-probe.sh	chore(observability): edge-429 probe + ratelimit observability runbook	2026-05-07 15:48:34 -07:00
import-agent.sh
lockdown-tenant-sg.sh
measure-coordinator-task-bounds-runner.sh	fix(harness-runner): switch from non-existent /heartbeat-history to /activity	2026-04-28 23:12:51 -07:00
measure-coordinator-task-bounds.sh	docs: registry pattern + harness scripts READMEs	2026-04-28 22:19:40 -07:00
nuke-and-rebuild.sh	tech-debt: rename molecule-monorepo-net -> molecule-core-net	2026-05-09 20:51:48 +00:00
post-rebuild-setup.sh
promote-tenant-image.sh	fix: add slug validation to prevent SSRF (OFFSEC-006)	2026-05-14 02:17:42 +00:00
README.md	refactor(ci): drop "canary-" prefix → staging-smoke/staging-verify (Hongming directive 2026-05-11) (#443 )	2026-05-11 11:25:29 +00:00
refresh-workspace-images.sh	fix(scripts): migrate ghcr.io→ECR + raw.githubusercontent.com→Gitea (#46 )	2026-05-07 00:56:23 -07:00
rollback-latest.sh	fix(scripts): migrate ghcr.io→ECR + raw.githubusercontent.com→Gitea (#46 )	2026-05-07 00:56:23 -07:00
staging-smoke.sh	refactor(ci): drop "canary-" prefix → staging-smoke/staging-verify (Hongming directive 2026-05-11) (#443 )	2026-05-11 11:25:29 +00:00
test_build_runtime_package.py	chore: rewriter unit tests + drop misleading noqa on `import inbox`	2026-04-30 20:45:32 -07:00
test-a2a-cross-runtime.sh
test-all-adapters.sh
test-all-runtimes-a2a-e2e.sh	test(e2e): wire SaaS auth headers (TENANT_ADMIN_TOKEN + TENANT_ORG_ID)	2026-05-02 04:36:23 -07:00
test-all.sh
test-check-stale-promote-pr.sh	feat(ops): hourly alarm for auto-promote PR stuck on REVIEW_REQUIRED (#2975 )	2026-05-05 17:55:27 -07:00
test-cross-agent-chat.sh
test-hermes-plugin-e2e.sh	test(e2e): unified A2A round-trip parity harness across all 4 runtimes	2026-05-02 04:36:23 -07:00
test-nuke-and-rebuild.sh	fix(scripts): nuke-and-rebuild self-bootstraps templates; add E2E test	2026-04-26 14:37:04 -07:00
test-promote-tenant-image.sh	fix: add slug validation to prevent SSRF (OFFSEC-006)	2026-05-14 02:17:42 +00:00
test-team-e2e.sh
wheel_smoke.py	feat(mcp): notifications/claude/channel for push-feel inbox UX	2026-04-30 20:10:01 -07:00

README.md

scripts/

Operational and one-off scripts for molecule-core. Most are self-documenting — see the header comments in each file.

RFC #2251 coordinator task-bound harnesses

There are three related scripts; pick the right one:

Script	Purpose	Targets
`measure-coordinator-task-bounds.sh`	Canonical v1 harness for the RFC #2251 / Issue 4 reproduction. Provisions a PM coordinator + Researcher child via `claude-code-default` + `langgraph` templates, sends a synthesis-heavy A2A kickoff, observes elapsed time + activity trace.	OSS-shape platform — localhost or any `/workspaces`-shaped endpoint. Has tenant/admin-token guards for non-localhost runs.
`measure-coordinator-task-bounds-runner.sh`	Generalised runner for the same measurement contract but with arbitrary template + secret + model combinations (Hermes/MiniMax, etc.). Useful for cross-runtime variants without modifying the canonical harness.	Same as above (local or SaaS via `MODE=saas`).
`measure-coordinator-task-bounds.sh` (in molecule-controlplane)	Production-shape variant that bootstraps a real staging tenant via `POST /cp/admin/orgs`, then runs the same measurement against `<slug>.staging.moleculesai.app`.	Staging controlplane only — refuses to run against production.

See reference_harness_pair_pattern (auto-memory) for when to use which and the cross-repo design rationale.

Common safety pattern across all three

Cleanup trap on EXIT/INT/TERM auto-deletes provisioned resources.
DRY_RUN=1 prints plan + auth fingerprint, exits before any state mutation. Run this before pointing at staging or any shared infrastructure.
Non-target guard refuses arbitrary endpoints (the controlplane variant is locked to staging-api.moleculesai.app; the OSS variant requires explicit auth + tenant scoping for non-localhost PLATFORM).
Cleanup failures emit cleanup_*_failed events with remediation hints; no silenced curl. ADMIN_TOKEN expiring mid-run surfaces as a structured event rather than a silent leak.

Activity trace caveat

If activity_trace.raw == "<endpoint_unavailable>", the per-workspace /activity endpoint isn't wired on the target build — the bound measurement is INCONCLUSIVE on the platform-ceiling question. Either wire the endpoint or replace with the equivalent Datadog query. Note that /activity accepts a since_secs query parameter; see the endpoint handler for the supported range.

Other scripts

cleanup-rogue-workspaces.sh — emergency teardown for leaked workspaces. Prompts for confirmation. Pair with the harnesses if a cleanup trap fails (see cleanup_*_failed events).
staging-smoke.sh — quick smoke test for the staging canary fleet (formerly canary-smoke.sh).
dev-start.sh — local-dev platform bring-up.

The rest are self-documenting in their header comments.