fix(e2e): wait for routable workspace before A2A #1666

Merged
hongming merged 1 commits from fix/e2e-a2a-readiness-body into main 2026-05-22 04:37:56 +00:00
Owner

Summary\n- require each staged workspace to be online with a non-empty routable URL before continuing\n- capture A2A HTTP status/body on failure and redact token-like fields before printing\n\n## Evidence\n- live probe against staging synth run 92556 returned HTTP 503 with body: workspace has no URL / status provisioning\n- bash -n tests/e2e/test_staging_full_saas.sh\n- shellcheck -x tests/e2e/test_staging_full_saas.sh\n- git diff --check

## Summary\n- require each staged workspace to be online with a non-empty routable URL before continuing\n- capture A2A HTTP status/body on failure and redact token-like fields before printing\n\n## Evidence\n- live probe against staging synth run 92556 returned HTTP 503 with body: workspace has no URL / status provisioning\n- bash -n tests/e2e/test_staging_full_saas.sh\n- shellcheck -x tests/e2e/test_staging_full_saas.sh\n- git diff --check
hongming added 1 commit 2026-05-22 03:41:44 +00:00
fix(e2e): wait for routable workspace before A2A
Lint shellcheck (arm64 pilot) / shellcheck-arm64 (pilot) (pull_request) Waiting to run
Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 4s
CI / Python Lint & Test (pull_request) Successful in 4s
CI / Detect changes (pull_request) Successful in 6s
E2E API Smoke Test / detect-changes (pull_request) Successful in 7s
E2E Chat / detect-changes (pull_request) Successful in 6s
E2E Staging SaaS (full lifecycle) / E2E Staging SaaS (pull_request) Has been skipped
E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 7s
Handlers Postgres Integration / detect-changes (pull_request) Successful in 7s
Lint forbidden tenant-env keys / Scan workspace_secrets writers for forbidden env keys (pull_request) Successful in 7s
Lint no tenant GITEA or GITHUB token write / Scan for repo-host token write into tenant workspace surface (pull_request) Successful in 6s
gate-check-v3 / gate-check (pull_request) Successful in 15s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 15s
qa-review / approved (pull_request) Failing after 15s
sop-checklist / na-declarations (pull_request) N/A: (none)
security-review / approved (pull_request) Failing after 4s
sop-checklist / all-items-acked (pull_request) Successful in 4s
sop-checklist / review-refire (pull_request) Has been skipped
E2E Staging SaaS (full lifecycle) / pr-validate (pull_request) Successful in 30s
sop-tier-check / tier-check (pull_request) Successful in 6s
CI / Platform (Go) (pull_request) Successful in 4s
CI / Canvas (Next.js) (pull_request) Successful in 4s
E2E Chat / E2E Chat (pull_request) Successful in 3s
E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 2s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 2s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 12s
CI / all-required (pull_request) Successful in 1m6s
lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m5s
E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 1m4s
e348f6a93c
app-fe force-pushed fix/e2e-a2a-readiness-body from e348f6a93c to 6f98ac062e 2026-05-22 04:32:05 +00:00 Compare
Author
Owner

Investigation update:

  • Live staging smoke after the E2E model-selection patch still failed A2A, but the failure is now pinned.
  • It is not SG/EIC drift: the live workspace SG had tcp/8000 from 10.10.0.0/16, terminal diagnose passed, and the container served /.well-known/agent-card.json locally.
  • SSM docker logs showed adapter setup failed because config emitted provider='MiniMax-M2'. That came from stale MODEL_PROVIDER=model-id being treated as provider when MODEL was also set.
  • Follow-up controlplane fix is in molecule-controlplane#239: ignore duplicate/non-registry MODEL_PROVIDER when MODEL already carries the model id.

This PR still keeps the E2E-side improvements: wait/retry with sanitized 503 bodies, log MODEL_SLUG, and select claude-code model ids that match the injected credential path.

Investigation update: - Live staging smoke after the E2E model-selection patch still failed A2A, but the failure is now pinned. - It is not SG/EIC drift: the live workspace SG had tcp/8000 from 10.10.0.0/16, terminal diagnose passed, and the container served /.well-known/agent-card.json locally. - SSM docker logs showed adapter setup failed because config emitted provider='MiniMax-M2'. That came from stale MODEL_PROVIDER=model-id being treated as provider when MODEL was also set. - Follow-up controlplane fix is in molecule-controlplane#239: ignore duplicate/non-registry MODEL_PROVIDER when MODEL already carries the model id. This PR still keeps the E2E-side improvements: wait/retry with sanitized 503 bodies, log MODEL_SLUG, and select claude-code model ids that match the injected credential path.
hongming merged commit 65f4ffb0ac into main 2026-05-22 04:37:56 +00:00
Sign in to join this conversation.
No Reviewers
2 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: molecule-ai/molecule-core#1666