molecule-ai/molecule-core

Fork 2

Files

T

History

cp-be 680434a8e6

Lint shellcheck (arm64 pilot) / shellcheck-arm64 (pilot) (pull_request) Waiting to run

Details

Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 4s

Details

CI / Python Lint & Test (pull_request) Successful in 4s

Details

CI / Detect changes (pull_request) Successful in 9s

Details

E2E Peer Visibility (literal MCP list_peers) / E2E Peer Visibility (pull_request) Has been skipped

Details

E2E API Smoke Test / detect-changes (pull_request) Successful in 9s

Details

E2E Chat / detect-changes (pull_request) Successful in 9s

Details

E2E Staging SaaS (full lifecycle) / E2E Staging SaaS (pull_request) Has been skipped

Details

E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 8s

Details

Handlers Postgres Integration / detect-changes (pull_request) Successful in 11s

Details

Harness Replays / detect-changes (pull_request) Successful in 11s

Details

Lint forbidden tenant-env keys / Scan workspace_secrets writers for forbidden env keys (pull_request) Successful in 9s

Details

Lint no tenant GITEA or GITHUB token write / Scan for repo-host token write into tenant workspace surface (pull_request) Successful in 4s

Details

Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 6s

Details

E2E Staging SaaS (full lifecycle) / pr-validate (pull_request) Successful in 34s

Details

gate-check-v3 / gate-check (pull_request) Successful in 6s

Details

qa-review / approved (pull_request) Failing after 4s

Details

security-review / approved (pull_request) Failing after 5s

Details

sop-checklist / na-declarations (pull_request) N/A: (none)

Details

sop-checklist / review-refire (pull_request) Has been skipped

Details

E2E Peer Visibility (literal MCP list_peers) / E2E Peer Visibility (local) (pull_request) Successful in 50s

Details

sop-checklist / all-items-acked (pull_request) Successful in 7s

Details

sop-tier-check / tier-check (pull_request) Successful in 6s

Details

CI / Canvas (Next.js) (pull_request) Successful in 3s

Details

CI / Shellcheck (E2E scripts) (pull_request) Successful in 14s

Details

E2E Chat / E2E Chat (pull_request) Successful in 3s

Details

E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 12s

Details

lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 59s

Details

Harness Replays / Harness Replays (pull_request) Successful in 5s

Details

E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 1m28s

Details

CI / Canvas Deploy Reminder (pull_request) Has been skipped

Details

Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 2m17s

Details

E2E Staging External Runtime / E2E Staging External Runtime (pull_request) Successful in 5m28s

Details

CI / Platform (Go) (pull_request) Successful in 5m4s

Details

CI / all-required (pull_request) Compensating status: required jobs SUCCESS individually; aggregate hit 40m timeout.

audit-force-merge / audit (pull_request) Successful in 5s

Details

test(e2e): patch 3 more non-external POST /workspaces sites for MODEL_REQUIRED contract

Follow-up to the prior commits in this PR — E2E API Smoke run on commit
a3c15bc9 surfaced 3 remaining E2E scripts that POST /workspaces without
a model field AND without the external-runtime exemption, so they 422
under the new MODEL_REQUIRED gate.

- tests/e2e/test_notify_attachments_e2e.sh:96 — bare {"name":"Notify E2E","tier":1}
  (no runtime → defaults to langgraph). Adds "model":"anthropic:claude-opus-4-7"
  to match the deleted DefaultModel("") return value.

- tests/e2e/test_priority_runtimes_e2e.sh:192 — runtime:claude-code without
  model. Adds "model":"sonnet" to match the deleted DefaultModel("claude-code")
  return value.

- tests/e2e/test_priority_runtimes_e2e.sh:384 — runtime:gemini-cli without
  model. Adds "model":"gemini-2.0-flash" — gemini-cli routes via the gemini
  provider (per derive-provider.sh), so a gemini:* slug picks the right
  provider chain.

Scripts inspected but NOT patched (already external-exempt):
- test_api.sh — both POSTs use runtime:external + external:true
- test_today_pr_coverage_e2e.sh — both POSTs use runtime:external + external:true
- test_priority_runtimes_e2e.sh:255 — runtime:hermes ALREADY had "model":"openai/gpt-4o"
- test_priority_runtimes_e2e.sh:326 — already had "model":"openai/gpt-4o-mini"

Other scripts that POST without model (test_a2a_e2e.sh:133,
test_activity_e2e.sh:218, test_dev_mode.sh:72, test_workspace_abilities_e2e.sh,
test_comprehensive_e2e.sh, test_mcp_stdio_staging.sh, test_chat_upload_e2e.sh,
tests/harness/seed.sh) are NOT triggered by the e2e-api.yml workflow that
this PR's CI runs — they're tracked for a follow-up sweep once #1667 lands.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-21 22:16:11 -07:00

e2e

test(e2e): patch 3 more non-external POST /workspaces sites for MODEL_REQUIRED contract

2026-05-21 22:16:11 -07:00

harness

feat(uploads): bump cap to 100MB + correct-reason error messages

2026-05-19 20:23:04 -07:00

ops

ops: add Railway SHA-pin drift audit script + regression test (#2001 )

2026-04-27 05:01:23 -07:00

README.md

chore(runtime): delete core workspace copy

2026-05-20 14:47:55 -07:00

test_ci_required_drift.py

feat(internal#219 §4+§6): port ci-required-drift + audit-force-merge sidecar from CP

2026-05-11 00:35:25 -07:00

test_detect_changes.py

ci: fix PR path filter base diff

2026-05-20 23:12:27 -07:00

test_heavy_e2e_pr_gating.py

ci: keep browser e2e out of normal pr path

2026-05-20 21:02:33 -07:00

test_lint_bp_context_emit_match.py

feat(ci)(hard-gate): lint-bp-context-emit-match (Tier 2f)

2026-05-12 14:37:43 +00:00

test_lint_continue_on_error_tracking.py

feat(ci)(hard-gate): lint-continue-on-error-tracking (Tier 2e)

2026-05-12 07:05:07 +00:00

test_lint_curl_status_capture.py

test curl status capture workflow lint

2026-05-12 13:40:31 -07:00

test_lint_mask_pr_atomicity.py

feat(ci)(hard-gate): lint-mask-pr-atomicity (Tier 2d)

2026-05-11 23:06:18 -07:00

test_lint_required_context_exists_in_bp.py

feat(ci)(hard-gate): lint-required-context-exists-in-bp (Tier 2g)

2026-05-12 14:37:29 +00:00

test_lint_required_no_paths.py

feat(ci)(hard-gate): lint-required-workflows-no-paths-filter (structural enforcement of feedback_path_filtered_workflow_cant_be_required)

2026-05-12 05:48:22 +00:00

test_lint_workflow_yaml.py

ci: share path filter helper

2026-05-20 22:32:17 -07:00

test_main_red_watchdog.py

fix(watchdog): add HEAD-recheck + settling delay to suppress cancel-cascade false-positives (#1635 )

2026-05-21 06:08:40 +00:00

test_status_reaper.py

ci: compensate cancelled push status noise

2026-05-21 00:19:56 -07:00

README.md

Tests

This repo uses the standard monorepo testing convention: unit tests live with their package, cross-component E2E tests live here.

Where to find tests

Scope	Location
Go unit + integration (platform, CLI, handlers)	`workspace-server/*/_test.go` — run with `cd workspace-server && go test -race ./...`
TypeScript unit (canvas components, hooks, store)	`canvas/src/**/__tests__/` — run with `cd canvas && npm test -- --run`
TypeScript unit (MCP server handlers)	`mcp-server/src/__tests__/` — run with `cd mcp-server && npx jest`
Python unit (workspace runtime, adapters)	`molecule-ai-workspace-runtime/tests/` in the standalone runtime repo
Python unit (SDK: plugin + remote agent)	`sdk/python/tests/` — run with `cd sdk/python && python3 -m pytest`
Cross-component E2E (spans platform + runtime + HTTP)	`tests/e2e/` ← you are here

Why split this way

Go requires co-located _test.go files to access unexported symbols.
Per-package test commands keep the inner loop fast — changing canvas doesn't re-run Go tests.
tests/e2e/ covers scenarios that no single package owns: a full workspace lifecycle, A2A across two provisioned agents, delegation chains, bundle round-trips.

Running E2E

Every E2E script here assumes the platform is running at localhost:8080 and (where noted) provisioned agents are online. See the header comment of each .sh for specifics.

Cleaning up rogue test workspaces

If an E2E run aborts before its teardown runs (Ctrl-C, crash, CI timeout), the platform can be left with workspaces whose config volume is stale or empty — Docker's unless-stopped restart policy then spins those containers in a FileNotFoundError loop. The platform's pre-flight check (#17) marks such workspaces failed on the next restart, but a manual cleanup is useful:

bash scripts/cleanup-rogue-workspaces.sh               # deletes ws with id/name starting aaaaaaaa-, bbbbbbbb-, cccccccc-, test-ws-
MOLECULE_URL=http://host:8080 bash scripts/cleanup-rogue-workspaces.sh

The script DELETEs each matching workspace via the API and force-removes the ws-<id[:12]> container as a belt-and-suspenders fallback.