molecule-core/tests
core-devops 2e8603f940
Some checks failed
qa-review / approved (pull_request) Waiting to run
Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 15s
E2E Peer Visibility (literal MCP list_peers) / E2E Peer Visibility (pull_request) Successful in 30s
Handlers Postgres Integration / detect-changes (pull_request) Successful in 22s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 35s
Lint curl status-code capture / Scan workflows for curl status-capture pollution (pull_request) Successful in 20s
CI / Detect changes (pull_request) Successful in 49s
E2E API Smoke Test / detect-changes (pull_request) Successful in 50s
E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 49s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 40s
security-review / approved (pull_request) Failing after 41s
lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m33s
gate-check-v3 / gate-check (pull_request) Successful in 1m4s
sop-checklist / all-items-acked (pull_request) Successful in 25s
Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 1m30s
Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 10s
sop-tier-check / tier-check (pull_request) Successful in 25s
Lint workflow YAML (Gitea-1.22.6-hostile shapes) / Lint workflow YAML for Gitea-1.22.6-hostile shapes (pull_request) Successful in 1m50s
E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 11s
Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 7s
lint-continue-on-error-tracking / lint-continue-on-error-tracking (pull_request) Successful in 2m51s
Lint pre-flip continue-on-error / Verify continue-on-error flips have run-log proof (pull_request) Successful in 2m48s
lint-required-context-exists-in-bp / lint-required-context-exists-in-bp (pull_request) Failing after 2m50s
E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 1m46s
CI / Python Lint & Test (pull_request) Successful in 7m36s
CI / Canvas (Next.js) (pull_request) Successful in 19m17s
CI / Canvas Deploy Reminder (pull_request) Successful in 14s
CI / Platform (Go) (pull_request) Failing after 31m20s
CI / all-required (pull_request) Failing after 41m2s
test(e2e): gate fresh-provision peer-visibility via the literal MCP list_peers call
Hermes and OpenClaw were reported "fleet-verified / cascade-complete" off
proxy signals (registry registration + heartbeat; model round-trip 200)
while a freshly-provisioned workspace asked "can you see your peers" on
canvas actually FAILS (Hermes: 401 on the molecule MCP list_peers call;
OpenClaw: native sessions_list fallback, no platform peers). Tasks
#142/#159 were even marked "completed" under this proxy-verification flaw.

This adds a dedicated staging-E2E gate that codifies the LITERAL
user-facing path so it can never silently regress:

- New e2e-peer-visibility.yml + tests/e2e/test_peer_visibility_mcp_staging.sh.
- Provisions a brand-new throwaway org via the real CP provisioning path
  + one sibling workspace per runtime under test (hermes, openclaw,
  claude-code) under a shared parent.
- For each runtime, drives the byte-for-byte JSON-RPC tools/call
  name=list_peers envelope to POST /workspaces/:id/mcp using that
  workspace's OWN bearer token, through the real WorkspaceAuth +
  MCPRateLimiter chain. NOT a proxy: not GET /registry/:id/peers, not
  /health, not the heartbeat table.
- Asserts HTTP 200 + JSON-RPC result (not error) + the returned peer set
  literally contains the other provisioned sibling IDs (not empty, not a
  native-sessions fallback).
- Scoped teardown only of the e2e-pv-<run_id> org this run created
  (script EXIT trap + workflow always() net + sweep-stale-e2e-orgs as the
  final 'e2e-' prefix net) — never a cluster-wide cleanup.

Honest gate, NO continue-on-error: it is RED on today's broken behavior
by design and goes green only when the in-flight Hermes-401 +
OpenClaw-MCP-wiring root-cause fixes actually land. Landed NON-required
(not in branch_protections) so it does not wedge unrelated merges while
red; flip-to-required checklist tracked in molecule-core#1296.

Gitea-1.22.6 / act_runner hardening honored: mirrored actions/checkout
SHA (the one e2e-staging-canvas.yml uses successfully), per-SHA
concurrency, workflow-level GITHUB_SERVER_URL, no cross-repo uses.
Passes lint-workflow-yaml, lint-continue-on-error-tracking,
lint-required-no-paths locally.

Refs: molecule-core#1296

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-15 23:08:56 -07:00
..
e2e test(e2e): gate fresh-provision peer-visibility via the literal MCP list_peers call 2026-05-15 23:08:56 -07:00
harness ci(docker): pin base image digests in all Dockerfiles 2026-05-09 23:56:39 +00:00
ops ops: add Railway SHA-pin drift audit script + regression test (#2001) 2026-04-27 05:01:23 -07:00
README.md chore: final open-source cleanup — binary, stale paths, private refs 2026-04-18 00:38:55 -07:00
test_ci_required_drift.py feat(internal#219 §4+§6): port ci-required-drift + audit-force-merge sidecar from CP 2026-05-11 00:35:25 -07:00
test_lint_bp_context_emit_match.py feat(ci)(hard-gate): lint-bp-context-emit-match (Tier 2f) 2026-05-12 14:37:43 +00:00
test_lint_continue_on_error_tracking.py feat(ci)(hard-gate): lint-continue-on-error-tracking (Tier 2e) 2026-05-12 07:05:07 +00:00
test_lint_curl_status_capture.py test curl status capture workflow lint 2026-05-12 13:40:31 -07:00
test_lint_mask_pr_atomicity.py feat(ci)(hard-gate): lint-mask-pr-atomicity (Tier 2d) 2026-05-11 23:06:18 -07:00
test_lint_required_context_exists_in_bp.py feat(ci)(hard-gate): lint-required-context-exists-in-bp (Tier 2g) 2026-05-12 14:37:29 +00:00
test_lint_required_no_paths.py feat(ci)(hard-gate): lint-required-workflows-no-paths-filter (structural enforcement of feedback_path_filtered_workflow_cant_be_required) 2026-05-12 05:48:22 +00:00
test_lint_workflow_yaml.py ci: fix publish docker healthcheck pipefail 2026-05-14 04:11:40 +00:00
test_main_red_watchdog.py fix(ci): status-reaper rev4 reads per-context "status" key not "state" (compensation was unreachable since rev1) 2026-05-11 20:44:20 -07:00
test_status_reaper.py ci: narrow status reaper soft skip to commit listing 2026-05-13 21:41:58 -07:00

Tests

This repo uses the standard monorepo testing convention: unit tests live with their package, cross-component E2E tests live here.

Where to find tests

Scope Location
Go unit + integration (platform, CLI, handlers) workspace-server/**/*_test.go — run with cd workspace-server && go test -race ./...
TypeScript unit (canvas components, hooks, store) canvas/src/**/__tests__/ — run with cd canvas && npm test -- --run
TypeScript unit (MCP server handlers) mcp-server/src/__tests__/ — run with cd mcp-server && npx jest
Python unit (workspace runtime, adapters) workspace/tests/ — run with cd workspace && python3 -m pytest
Python unit (SDK: plugin + remote agent) sdk/python/tests/ — run with cd sdk/python && python3 -m pytest
Cross-component E2E (spans platform + runtime + HTTP) tests/e2e/you are here

Why split this way

  • Go requires co-located _test.go files to access unexported symbols.
  • Per-package test commands keep the inner loop fast — changing canvas doesn't re-run Go tests.
  • tests/e2e/ covers scenarios that no single package owns: a full workspace lifecycle, A2A across two provisioned agents, delegation chains, bundle round-trips.

Running E2E

Every E2E script here assumes the platform is running at localhost:8080 and (where noted) provisioned agents are online. See the header comment of each .sh for specifics.

Cleaning up rogue test workspaces

If an E2E run aborts before its teardown runs (Ctrl-C, crash, CI timeout), the platform can be left with workspaces whose config volume is stale or empty — Docker's unless-stopped restart policy then spins those containers in a FileNotFoundError loop. The platform's pre-flight check (#17) marks such workspaces failed on the next restart, but a manual cleanup is useful:

bash scripts/cleanup-rogue-workspaces.sh               # deletes ws with id/name starting aaaaaaaa-, bbbbbbbb-, cccccccc-, test-ws-
MOLECULE_URL=http://host:8080 bash scripts/cleanup-rogue-workspaces.sh

The script DELETEs each matching workspace via the API and force-removes the ws-<id[:12]> container as a belt-and-suspenders fallback.