molecule-core/tests/e2e
core-devops 2e8603f940
Some checks failed
qa-review / approved (pull_request) Waiting to run
Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 15s
E2E Peer Visibility (literal MCP list_peers) / E2E Peer Visibility (pull_request) Successful in 30s
Handlers Postgres Integration / detect-changes (pull_request) Successful in 22s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 35s
Lint curl status-code capture / Scan workflows for curl status-capture pollution (pull_request) Successful in 20s
CI / Detect changes (pull_request) Successful in 49s
E2E API Smoke Test / detect-changes (pull_request) Successful in 50s
E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 49s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 40s
security-review / approved (pull_request) Failing after 41s
lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m33s
gate-check-v3 / gate-check (pull_request) Successful in 1m4s
sop-checklist / all-items-acked (pull_request) Successful in 25s
Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 1m30s
Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 10s
sop-tier-check / tier-check (pull_request) Successful in 25s
Lint workflow YAML (Gitea-1.22.6-hostile shapes) / Lint workflow YAML for Gitea-1.22.6-hostile shapes (pull_request) Successful in 1m50s
E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 11s
Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 7s
lint-continue-on-error-tracking / lint-continue-on-error-tracking (pull_request) Successful in 2m51s
Lint pre-flip continue-on-error / Verify continue-on-error flips have run-log proof (pull_request) Successful in 2m48s
lint-required-context-exists-in-bp / lint-required-context-exists-in-bp (pull_request) Failing after 2m50s
E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 1m46s
CI / Python Lint & Test (pull_request) Successful in 7m36s
CI / Canvas (Next.js) (pull_request) Successful in 19m17s
CI / Canvas Deploy Reminder (pull_request) Successful in 14s
CI / Platform (Go) (pull_request) Failing after 31m20s
CI / all-required (pull_request) Failing after 41m2s
test(e2e): gate fresh-provision peer-visibility via the literal MCP list_peers call
Hermes and OpenClaw were reported "fleet-verified / cascade-complete" off
proxy signals (registry registration + heartbeat; model round-trip 200)
while a freshly-provisioned workspace asked "can you see your peers" on
canvas actually FAILS (Hermes: 401 on the molecule MCP list_peers call;
OpenClaw: native sessions_list fallback, no platform peers). Tasks
#142/#159 were even marked "completed" under this proxy-verification flaw.

This adds a dedicated staging-E2E gate that codifies the LITERAL
user-facing path so it can never silently regress:

- New e2e-peer-visibility.yml + tests/e2e/test_peer_visibility_mcp_staging.sh.
- Provisions a brand-new throwaway org via the real CP provisioning path
  + one sibling workspace per runtime under test (hermes, openclaw,
  claude-code) under a shared parent.
- For each runtime, drives the byte-for-byte JSON-RPC tools/call
  name=list_peers envelope to POST /workspaces/:id/mcp using that
  workspace's OWN bearer token, through the real WorkspaceAuth +
  MCPRateLimiter chain. NOT a proxy: not GET /registry/:id/peers, not
  /health, not the heartbeat table.
- Asserts HTTP 200 + JSON-RPC result (not error) + the returned peer set
  literally contains the other provisioned sibling IDs (not empty, not a
  native-sessions fallback).
- Scoped teardown only of the e2e-pv-<run_id> org this run created
  (script EXIT trap + workflow always() net + sweep-stale-e2e-orgs as the
  final 'e2e-' prefix net) — never a cluster-wide cleanup.

Honest gate, NO continue-on-error: it is RED on today's broken behavior
by design and goes green only when the in-flight Hermes-401 +
OpenClaw-MCP-wiring root-cause fixes actually land. Landed NON-required
(not in branch_protections) so it does not wedge unrelated merges while
red; flip-to-required checklist tracked in molecule-core#1296.

Gitea-1.22.6 / act_runner hardening honored: mirrored actions/checkout
SHA (the one e2e-staging-canvas.yml uses successfully), per-SHA
concurrency, workflow-level GITHUB_SERVER_URL, no cross-repo uses.
Passes lint-workflow-yaml, lint-continue-on-error-tracking,
lint-required-no-paths locally.

Refs: molecule-core#1296

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-15 23:08:56 -07:00
..
lib test(e2e): pin pick_model_slug behavior with bash unit tests 2026-05-03 12:04:12 -07:00
_extract_token.py chore: apply round-7 review nits 2026-04-13 17:08:45 -07:00
_lib.sh feat(platform): GET /admin/workspaces/:id/test-token for E2E (#6) 2026-04-14 09:35:26 -07:00
lint_cleanup_traps.sh test(e2e): plug /tmp scratch leaks in 3 shell E2E tests + add CI lint gate (RFC #2873 iter 2) 2026-05-05 04:21:26 -07:00
STAGING_SAAS_E2E.md fix(ci): canonicalize MOLECULE_STAGING_ADMIN_TOKEN -> CP_STAGING_ADMIN_API_TOKEN (post-#443 rebase) + drop staging-smoke continue-on-error 2026-05-11 04:33:56 -07:00
test_2307_peer_visibility_staging.sh test(e2e): add staging peer-visibility harness for #2307 2026-04-29 13:26:24 -07:00
test_a2a_e2e.sh initial commit — Molecule AI platform 2026-04-13 11:55:37 -07:00
test_activity_e2e.sh chore: apply code-review round-6 suggestions 2026-04-13 17:08:45 -07:00
test_api.sh fix(e2e): stop asserting current_task on public workspace GET (#966) 2026-04-19 02:19:15 -07:00
test_chat_attachments_e2e.sh test(e2e): plug /tmp scratch leaks in 3 shell E2E tests + add CI lint gate (RFC #2873 iter 2) 2026-05-05 04:21:26 -07:00
test_chat_attachments_multiruntime_e2e.sh test(e2e): plug /tmp scratch leaks in 3 shell E2E tests + add CI lint gate (RFC #2873 iter 2) 2026-05-05 04:21:26 -07:00
test_chat_upload_e2e.sh feat(chat_files): rewrite Upload as HTTP-forward to workspace (RFC #2312, PR-C) 2026-04-29 14:26:37 -07:00
test_claude_code_e2e.sh chore: final open-source cleanup — binary, stale paths, private refs 2026-04-18 00:38:55 -07:00
test_comprehensive_e2e.sh fix(e2e): make provisioning-status assertions robust to CI environment 2026-04-13 17:31:07 -07:00
test_dev_mode.sh fix(quickstart): hotfixes discovered during live testing session 2026-04-23 14:57:18 -07:00
test_harness_rc_normalization.sh fix(e2e-sanity): normalize unexpected curl exit codes in cleanup trap (#2159) 2026-04-27 02:55:44 -07:00
test_mcp_stdio_staging.sh fix(e2e): suppress shellcheck SC2034 on intentionally-unused vars in test_mcp_stdio_staging.sh 2026-05-13 12:26:37 +00:00
test_model_slug.sh test(e2e): pin pick_model_slug behavior with bash unit tests 2026-05-03 12:04:12 -07:00
test_notify_attachments_e2e.sh test(e2e): plug /tmp scratch leaks in 3 shell E2E tests + add CI lint gate (RFC #2873 iter 2) 2026-05-05 04:21:26 -07:00
test_peer_visibility_mcp_staging.sh test(e2e): gate fresh-provision peer-visibility via the literal MCP list_peers call 2026-05-15 23:08:56 -07:00
test_poll_mode_chat_upload_e2e.sh test(e2e): add poll-mode chat upload E2E and wire into e2e-api.yml 2026-05-05 13:08:55 -07:00
test_poll_mode_e2e.sh test(e2e): Phase 3.5 — wheel parser classifies real server response (#2967) 2026-05-05 17:31:45 -07:00
test_priority_runtimes_e2e.sh feat(e2e): extend priority-runtimes test to cover all 8 templates 2026-04-27 05:57:59 -07:00
test_saas_tenant.sh chore: final open-source cleanup — binary, stale paths, private refs 2026-04-18 00:38:55 -07:00
test_secrets_dispatch.sh feat(synth-e2e): switch canary to claude-code + MiniMax-M2.7-highspeed 2026-05-03 15:35:14 -07:00
test_staging_external_runtime.sh test(e2e): read delivery_mode from register response, not GET 2026-04-30 10:35:21 -07:00
test_staging_full_saas.sh fix(e2e): surface terminal diagnose detail 2026-05-12 23:43:03 -07:00
test_workspace_abilities_e2e.sh test(e2e): workspace broadcast and talk-to-user abilities 2026-05-15 12:03:08 -07:00