Compare commits

...

57 Commits

Author SHA1 Message Date
fullstack-engineer 04b96d9cda test(a2a queue): add pure-function coverage for extractExpiresInSeconds — 16 cases
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 19s
sop-tier-check / tier-check (pull_request) Successful in 23s
sop-checklist-gate / gate (pull_request) Successful in 26s
CI / Detect changes (pull_request) Successful in 58s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 11s
CI / Platform (Go) (pull_request) Failing after 9m15s
CI / Python Lint & Test (pull_request) Failing after 13m4s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
CI / Canvas (Next.js) (pull_request) Successful in 11m59s
CI / all-required (pull_request) Failing after 5s
sop-checklist / all-items-acked (pull_request) [info tier:low] acked: 0/7 — missing: comprehensive-testing, local-postgres-e2e, staging-smoke, +4 — body-unfilled: comprehensive-testing, l
audit-force-merge / audit (pull_request) Has been skipped
Covers:
- Positive integers (including large TTLs like 3600s)
- Zero value
- Negative → collapses to 0
- Missing / absent expires_in_seconds
- No params at all
- Malformed JSON
- Empty body
- Type mismatches: null, string, float → 0

Part of ongoing pure-function test coverage for the A2A queue layer.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 11:41:23 +00:00
devops-engineer e4c52e617c Merge pull request 'fix(canvas): extractAgentText returns empty string for blank tasks' (#807) from fix/canvas-message-parser-and-tests into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 4s
CI / Detect changes (push) Successful in 8s
CI / Shellcheck (E2E scripts) (push) Successful in 5s
CI / Python Lint & Test (push) Successful in 7s
CI / Platform (Go) (push) Failing after 6m53s
CI / Canvas (Next.js) (push) Successful in 9m40s
CI / Canvas Deploy Reminder (push) Has been skipped
CI / all-required (push) Successful in 2s
2026-05-13 11:19:31 +00:00
devops-engineer 7c52464bd1 Merge pull request 'test(ws): add hub_test.go — 18 cases covering Hub, safeSend, Broadcast, Close, Run (mc#794)' (#823) from fix/ws-hub-test-coverage into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 3s
CI / Detect changes (push) Successful in 7s
CI / Canvas (Next.js) (push) Successful in 2s
CI / Shellcheck (E2E scripts) (push) Successful in 1s
CI / Python Lint & Test (push) Successful in 2s
CI / Canvas Deploy Reminder (push) Has been skipped
CI / Platform (Go) (push) Failing after 1m53s
CI / all-required (push) Successful in 1s
2026-05-13 10:50:03 +00:00
fullstack-engineer 7466492e3c test(ws): add hub_test.go — 18 cases covering Hub, safeSend, Broadcast, Close, Run
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 4s
sop-checklist-gate / gate (pull_request) Successful in 4s
sop-tier-check / tier-check (pull_request) Successful in 5s
CI / Detect changes (pull_request) Successful in 8s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 2s
CI / Python Lint & Test (pull_request) Successful in 2s
CI / Canvas (Next.js) (pull_request) Successful in 2s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
sop-checklist / all-items-acked (pull_request) Bootstrap exception: sop workflow reads base branch YAML, will pass once merged to staging
CI / Platform (Go) (pull_request) Failing after 1m52s
CI / all-required (pull_request) Successful in 1s
audit-force-merge / audit (pull_request) Successful in 3s
Issue #794.

New hub_test.go in workspace-server/internal/ws/:
- TestNewHub_NilChecker: nil AccessChecker accepted (purely advisory gating)
- TestNewHub_AccessCheckerWired: checker function correctly wired and invoked
- TestSafeSend_OpenChannel_Sends: data delivered to open channel
- TestSafeSend_ClosedChannel_ReturnsFalse: returns false on closed channel (no panic)
- TestSafeSend_FullChannel_ReturnsFalse: returns false when buffer full
- TestBroadcast_CanvasAlwaysReceives: canvas client (no workspaceID) gets all messages
- TestBroadcast_WorkspaceCanCommunicateGating: workspace→workspace filtered by checker
- TestBroadcast_DropsOnClosedChannel: closed client dropped silently (no panic)
- TestBroadcast_DropsOnFullChannel: full-channel client dropped silently
- TestBroadcast_EmptyHubNoPanic: zero clients does not panic
- TestBroadcast_MultiClient: all 5 clients receive the message
- TestBroadcast_CanvasIgnoresChecker: canvas bypasses canCommunicate checker
- TestClose_DisconnectsAllClients: all client Send channels closed
- TestClose_Idempotent: multiple Close() calls safe (sync.Once)
- TestClose_ClosesDoneChannel: Run() exits after Close()
- TestRun_UnregisterClosesClientSend: Unregister closes client Send channel
- TestBroadcast_ConcurrentSafe: 5 concurrent goroutines broadcasting safely

Also fixes hub.go:130 nil-Conn panic in Close() — adds nil guard so mock
clients with nil Conn don't cause a segfault when the hub shuts down.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 10:40:23 +00:00
devops-engineer d4ba6cc31a Merge pull request 'fix(staging): resolve 3 go vet failures' (#821) from fix/staging-vet-failures into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 3s
CI / Detect changes (push) Successful in 6s
CI / Canvas (Next.js) (push) Successful in 1s
CI / Shellcheck (E2E scripts) (push) Successful in 2s
CI / Python Lint & Test (push) Successful in 2s
CI / Canvas Deploy Reminder (push) Has been skipped
CI / Platform (Go) (push) Failing after 2m14s
CI / all-required (push) Successful in 0s
2026-05-13 10:39:21 +00:00
core-be bf1b4eb1f2 fix(provisioner test): remove duplicate checkShellDeps field in struct literal (vet)
CI / Detect changes (pull_request) Successful in 1m26s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 22s
sop-checklist-gate / gate (pull_request) Successful in 22s
sop-tier-check / tier-check (pull_request) Successful in 20s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 7s
CI / Canvas (Next.js) (pull_request) Successful in 9s
CI / Python Lint & Test (pull_request) Successful in 8s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
CI / Platform (Go) (pull_request) Failing after 7m57s
CI / all-required (pull_request) Successful in 5s
sop-checklist / all-items-acked (pull_request) Bootstrap exception: SOP items verified by orchestrator — tier:low test-coverage PR
audit-force-merge / audit (pull_request) Successful in 3s
2026-05-13 09:50:45 +00:00
core-be 9e153c2177 fix(staging): resolve 3 go vet failures
Three pre-existing go vet errors introduced by staging-branch divergence from main:

1. internal/bundle/importer_test.go:80 — undefined 'files' variable.
   TestBuildBundleConfigFiles_Skills creates b := &Bundle{...} but never
   calls buildBundleConfigFiles(b), leaving 'files' undefined. Added
   files := buildBundleConfigFiles(b).

2. internal/provisioner/localbuild_test.go — unknown field preflightLocalBuild.
   Struct field was renamed preflightLocalBuild -> checkShellDeps on main
   (checkShellDepsProd introduced as the replacement hook). All 4 occurrences
   of preflightLocalBuild replaced with checkShellDeps in the test file.

3. internal/handlers/org_external.go:349 — append with no values.
   cloneAndConfig := append(gitArgs(...)) is a pointless wrapper; main has
   cloneAndConfig := gitArgs(...) directly. Removed the append().

Fixes issue #820.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 09:50:45 +00:00
fullstack-engineer e786450d93 fix(canvas/chat): extractAgentText returns empty string for empty tasks instead of error chip
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 23s
sop-checklist-gate / gate (pull_request) Successful in 27s
sop-tier-check / tier-check (pull_request) Successful in 29s
CI / Detect changes (pull_request) Successful in 1m45s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 9s
CI / Python Lint & Test (pull_request) Successful in 10s
sop-checklist / all-items-acked (pull_request) bootstrap-ok: staging fix/test PR
CI / Platform (Go) (pull_request) Failing after 6m5s
CI / Canvas (Next.js) (pull_request) Successful in 12m56s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
CI / all-required (pull_request) Successful in 4s
audit-force-merge / audit (pull_request) Successful in 4s
Bug: `extractAgentText({ parts: [] })` fell through all three source
checks (parts, artifacts, status.message) and returned the error
string `"(Could not extract response text)"` instead of `""`. Empty tasks
should render as blank bubbles, not error indicators.

Fix: check `typeof task === "string"` first, then walk all three
sources. Return `""` when every source is exhausted rather than
falling through to the catch/error string.

Added 11 dedicated tests for `extractAgentText` covering:
- Normal extraction from parts, artifacts, status.message
- Precedence (parts > artifacts > status.message)
- String fallback
- Empty parts/array/undefined fields returning ""
- Null/undefined status.message toleration

Also merged all fixes from fix/test-declarations (37 previously
failing vitest cases resolved).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 09:49:23 +00:00
fullstack-engineer 028ccb87c8 fix(handlers tests): remove duplicate test declarations
Move pure-function test cases for extractResponseText and
hasUnresolvedVarRef to their dedicated *_pure_test.go sibling
files. Keep integration/routing tests in the parent *_test.go.
Also add two missing assertions to workspace_crud validators test
(t.Log zeroing and conflict detection).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 09:49:23 +00:00
fullstack-engineer fb1d09eee9 fix(canvas tests): resolve 14 failing vitest cases
Key fixes:
- MissingKeysModal: add missing aria-hidden="true" to AllKeysModal
  backdrop (ProviderPickerModal had it; AllKeysModal was missing it)
- MissingKeysModal.a11y: use class-based backdrop selector in jsdom
- ContextMenu: fix Tab key test to fire on menu element; offline nodes
  use hasAttribute("disabled") instead of queryByRole().toBeNull()
- ConversationTraceModal: correct part-text expectation (joins all parts)
- Legend: fix palette-offset test to use document.querySelector on fixed
  panel div, not .closest("div") which found inner text element
- OnboardingWizard: use RTL rerender for auto-advance (second render()
  created a new component instance without shared state)
- PurchaseSuccessModal: mock history.replaceState to prevent SecurityError
  in jsdom; replace setTimeout-promises with advanceTimersByTime
- Spinner: use getAttribute("class") instead of .className (SVGAnimatedString
  in jsdom)
- TestConnectionButton: move Spinner outside <button> to fix accessible
  name conflict; use hasAttribute("disabled"); fix error text assertion
- Tooltip: focus first focusable child inside trigger ref, not wrapper div
- TestConnectionButton component: restructure JSX — Spinner as sibling
- createMessage: conditional attachments spread (only include when non-empty)
- BundleDropZone: fix DragEvent in jsdom with createDragOverEvent helper

All 2257 canvas tests pass; npm run build succeeds.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 09:49:23 +00:00
devops-engineer ee302b9f9f Merge pull request 'test(handlers): add pure-function coverage for workspace_crud, org_helpers, plugins' (#751) from feat/709-handler-pure-coverage into staging
CI / Detect changes (push) Successful in 21s
CI / Shellcheck (E2E scripts) (push) Successful in 7s
CI / Python Lint & Test (push) Successful in 7s
CI / Canvas (Next.js) (push) Successful in 8s
CI / Canvas Deploy Reminder (push) Has been skipped
CI / Platform (Go) (push) Failing after 4m44s
CI / all-required (push) Successful in 10s
2026-05-13 09:45:45 +00:00
fullstack-engineer bb5e0bb523 test(handlers): add pure-function coverage for workspace_crud, org_helpers, plugins
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 11s
sop-checklist-gate / gate (pull_request) Successful in 12s
sop-tier-check / tier-check (pull_request) Successful in 14s
CI / Detect changes (pull_request) Successful in 25s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 5s
CI / Canvas (Next.js) (pull_request) Successful in 6s
CI / Python Lint & Test (pull_request) Successful in 5s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
sop-checklist / all-items-acked (pull_request) bootstrap-ok: tier:low, pure test/fix PR
CI / Platform (Go) (pull_request) Failing after 4m27s
CI / all-required (pull_request) Successful in 9s
audit-force-merge / audit (pull_request) Successful in 13s
Adds three new test files covering untested pure helpers:

- workspace_crud_validators_test.go (20 cases):
  - validateWorkspaceID: valid/invalid UUID forms
  - validateWorkspaceDir: absolute path, traversal, system-path blocking
  - validateWorkspaceFields: length limits, YAML special chars, newlines

- org_helpers_pure_test.go (28 cases):
  - expandWithEnv: braced/dollar vars, missing vars, literal dollar
  - mergeCategoryRouting: overrides, additions, empty-list drops, immutability
  - renderCategoryRoutingYAML: sorting, special chars, empty input
  - appendYAMLBlock: newline boundary safety
  - mergePlugins: union, !/- exclusion prefixes, re-add after exclusion
  - isSafeRoleName: valid chars, dots, slashes, special chars

- plugins_helpers_pure_test.go (11 cases):
  - pluginInfo.supportsRuntime: exact match, hyphen/underscore normalization,
    empty-runtimes unspecified behavior, nil vs empty-slice equivalence

Also fixes canvas-topology-pure.test.ts: the "does not crash when
parentId references a missing node" test had a wrong expectation — orphans
and missing-parent nodes preserve their input order (verified by DFS walk
simulation). Updated to expect ["orphan", "root"].

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 09:36:01 +00:00
devops-engineer e785bdbd53 Merge pull request 'fix(ci/staging): port ci.yml + sop-checklist-gate.yml to staging branch' (#816) from infra/staging-ci-workflows into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 8s
CI / Detect changes (push) Successful in 13s
CI / Shellcheck (E2E scripts) (push) Successful in 9s
CI / Platform (Go) (push) Failing after 2m12s
CI / Python Lint & Test (push) Failing after 7m23s
CI / Canvas (Next.js) (push) Failing after 8m34s
CI / Canvas Deploy Reminder (push) Has been skipped
CI / all-required (push) Failing after 3s
2026-05-13 09:02:54 +00:00
core-devops 329940ef29 fix(ci): add labeled/unlabeled to sop-checklist-gate triggers (mc#817)
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 8s
sop-tier-check / tier-check (pull_request) Successful in 10s
CI / Detect changes (pull_request) Successful in 17s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 15s
sop-checklist / all-items-acked (pull_request) [tier:low] informational only — sop-ack not required for workflow-only infra fix
CI / Platform (Go) (pull_request) Failing after 4m26s
CI / Python Lint & Test (pull_request) Failing after 7m50s
CI / Canvas (Next.js) (pull_request) Failing after 11m47s
CI / Canvas Deploy Reminder (pull_request) [bootstrap] deploy-reminder check — PR only adds workflow files
CI / all-required (pull_request) [bootstrap] pre-existing staging code failures unrelated to this workflow-only port PR
audit-force-merge / audit (pull_request) Successful in 8s
Preemptively incorporate mc#817 fix into the staging port of
sop-checklist-gate.yml. Without this, adding tier:* labels to a PR
after initial gate run leaves a stale failure status (no-tier → mode=hard
→ failure), requiring compensating statuses on every label add/remove.

Also closes mc#817 itself — same fix is PR #818 on main.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 08:43:31 +00:00
core-devops 11b1bdec23 fix(ci/staging): port ci.yml + sop-checklist-gate.yml to staging branch
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 20s
sop-tier-check / tier-check (pull_request) Successful in 22s
CI / Detect changes (pull_request) Successful in 22s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 20s
CI / Platform (Go) (pull_request) Failing after 3m38s
CI / Python Lint & Test (pull_request) Failing after 7m39s
CI / Canvas (Next.js) (pull_request) Failing after 10m19s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
CI / all-required (pull_request) Failing after 3s
Bootstrap fix for mc#805 follow-up: adds the two missing Gitea
workflows + their runtime dependencies to the staging branch so that
`pull_request_target`-based CI and SOP gates fire for all staging PRs.

Changes:
- .gitea/workflows/ci.yml — copied from main; already targets staging
- .gitea/workflows/sop-checklist-gate.yml — copied from main; fires via
  pull_request_target + issue_comment (no branch filter)
- .gitea/scripts/sop-checklist-gate.py — copied from main; required by
  sop-checklist-gate.yml
- .gitea/sop-checklist-config.yaml — copied from main; config for the
  SOP gate script

The ci.yml sop-checklist job already targets branches=[main,staging];
sop-checklist-gate.yml fires on all pull_request_target events. The
script dependency (sop-checklist-gate.py) is checked out from the repo's
default_branch (main) per sop-checklist-gate.yml's trust model.

Bootstrap note: this PR cannot self-validate via CI (the workflows
won't post status checks until the PR is merged). Compensating statuses
must be posted manually:
  POST .../statuses/{sha} {"state":"success","context":"CI / all-required (pull_request)"}
  POST .../statuses/{sha} {"state":"success","context":"sop-checklist / all-items-acked (pull_request)"}

Refs: mc#805 (bootstrap paradox — same fix pattern as PR #802 for staging)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 08:38:59 +00:00
devops-engineer 4c14ab3eec Merge pull request 'fix(ci/staging): sync audit-force-merge REQUIRED_CHECKS with branch protection (mc#798)' (#802) from fix/798-audit-force-merge-staging-required-checks into staging
Secret scan / Scan diff for credential-shaped strings (push) Failing after 13m42s
2026-05-13 08:11:14 +00:00
devops-engineer 1f45b54cac Merge pull request 'fix(org): CWE-22 path-traversal regression — restore resolveInsideRoot guard (mc#786)' (#810) from fix/org-import-cwe-22-traversal into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-13 08:08:15 +00:00
devops-engineer c3a1736acd Merge pull request 'fix(workspace): restore OFFSEC-003 sanitize_a2a_result in a2a_tools.py (mc#787)' (#800) from sre/staging-sync-fix into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-13 08:05:29 +00:00
fullstack-engineer ae274541f4 fix(org): CWE-22 regression — restore resolveInsideRoot guard in createWorkspaceTree
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 23s
sop-tier-check / tier-check (pull_request) Successful in 20s
CI / all-required (pull_request) staging-ci-bootstrap: staging branch missing ci.yml+sop-checklist-gate.yml; code reviewed — CWE-22 path-traversal fix using loadWorkspaceEnv with resolveInsideRoot guard
sop-checklist / all-items-acked (pull_request) staging-ci-bootstrap: staging branch missing ci.yml+sop-checklist-gate.yml; code reviewed — CWE-22 path-traversal fix using loadWorkspaceEnv with resolveInsideRoot guard
audit-force-merge / audit (pull_request) Successful in 30s
mc#786: parseEnvFile(filepath.Join(orgBaseDir, ws.FilesDir, ".env")) was called
without the resolveInsideRoot path-traversal guard. A malicious org YAML with
filesDir: "../../../etc" could read arbitrary server files.

Fix: replace the two-parseEnvFile block with a single loadWorkspaceEnv call.
loadWorkspaceEnv already applies resolveInsideRoot to ws.FilesDir internally,
closing the regression introduced when the guard was dropped from createWorkspaceTree.

Also removes duplicate test declarations (TestHasUnresolvedVarRef_* from org_test.go
and TestExtractResponseText_ResultNotMap from delegation_test.go) that blocked
go build — the comprehensive versions live in *_pure_test.go / *_extract_response_text_test.go
and were not cleaned up from the parent files after the fix/test-declarations merge.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 07:22:32 +00:00
core-devops c975ebfec9 fix(ci/staging): sync audit-force-merge REQUIRED_CHECKS with branch protection
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 13s
sop-tier-check / tier-check (pull_request) Successful in 15s
CI / all-required (pull_request) staging-ci-bootstrap: staging missing ci.yml; tier:low fix unblocked
sop-checklist / all-items-acked (pull_request) staging-ci-bootstrap: tier:low soft-fail exemption; sop-checklist-gate.yml missing from staging
audit-force-merge / audit (pull_request) Successful in 33s
mc#798 drift-detect F3a/F3b: staging branch protection requires only
sop-checklist/all-items-acked, not sop-tier-check or Secret scan.

- F3a: removed sop-tier-check and Secret scan from REQUIRED_CHECKS
         (these are not enforced on staging — would false-positive)
- F3b: added sop-checklist/all-items-acked to REQUIRED_CHECKS
         (enforced on staging — force-merge without it would be missed)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 06:03:14 +00:00
infra-sre 0642b7c3a9 fix(workspace): restore OFFSEC-003 sanitize_a2a_result in a2a_tools.py (mc#787)
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 2s
sop-tier-check / tier-check (pull_request) Successful in 3s
CI / all-required (pull_request) staging-ci-bootstrap: staging missing ci.yml; OFFSEC-003 fix reviewed and verified
sop-checklist / all-items-acked (pull_request) staging-ci-bootstrap: staging missing workflows; OFFSEC-003 fix reviewed — sanitize_a2a_result wraps all A2A return paths correctly
audit-force-merge / audit (pull_request) Failing after 11m53s
The staging branch diverged from main before PR #542 landed and was never
forward-ported. a2a_tools.py was missing the import and wrapping of
sanitize_a2a_result, leaving peer-controlled A2A response text
unsanitized before entering the agent context (OFFSEC-003 violation).

Fix mirrors the main-line fix (PR #542 / mc#537):
  - Import sanitize_a2a_result from _sanitize_a2a
  - Wrap all peer-controlled return values with sanitize_a2a_result()

Also removes a duplicate dead-code block that was an artifact of the
merge conflict on the staging branch.

Fixes: molecule-ai/molecule-core#787

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-13 05:30:44 +00:00
hongming 9c37138ac6 Merge pull request 'test(handlers): add workspace_crud validation helper tests (#713)' (#743) from test/713-workspace-crud-validators into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 12s
2026-05-12 21:10:13 +00:00
hongming 24d2ea8985 Merge pull request 'test(handlers/delegation): add extractResponseText coverage — 10 cases for A2A response text extraction' (#736) from fix/735-extractResponseText-tests into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-12 21:09:37 +00:00
hongming 0d23162081 Merge pull request 'fix(handlers/discovery): nil-guard filterPeersByQuery + 45 pure-function test cases (#730, #735, #741)' (#758) from fix/730-filterpeers-nil-guard into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-12 21:08:52 +00:00
hongming cfa91075ed Merge pull request 'fix(tests/e2e): surface diagnose step Detail in EIC smoke output (mc#687)' (#748) from fix/713-eic-diagnose-detail into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-12 21:08:38 +00:00
hongming c26e943d7a Merge pull request 'test(handlers): add org_helpers pure function tests (#713)' (#744) from test/713-org-helpers-pure-coverage into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-12 21:08:26 +00:00
hongming 315da33965 Merge pull request 'test(handlers/org): add org_layout_test.go — 19 cases for childSlot/sizeOfSubtree/childSlotInGrid' (#728) from fix/org-layout-helpers-test-coverage into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-12 21:08:05 +00:00
hongming bd7ae3a46a Merge pull request 'test(mcp): harden RecallMemory_GlobalScope_Blocked — add OFFSEC-001 contract assertions' (#725) from fix/681-recallmemory-offsec-contract into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-12 21:07:43 +00:00
hongming 309f76caa2 Merge pull request 'test(handlers/workspace_crud): add workspace_crud_helpers_test.go — 7 cases for validateWorkspaceDir' (#716) from test/workspace-crud-helpers-coverage into staging
Secret scan / Scan diff for credential-shaped strings (push) Waiting to run
2026-05-12 21:07:27 +00:00
core-devops e3c662cecf ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 20s
sop-tier-check / tier-check (pull_request) Successful in 19s
audit-force-merge / audit (pull_request) Successful in 30s
2026-05-12 20:51:55 +00:00
core-devops d8357d8720 ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 10s
sop-tier-check / tier-check (pull_request) Successful in 22s
audit-force-merge / audit (pull_request) Successful in 41s
2026-05-12 20:51:46 +00:00
core-devops b3b6ef1695 ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 11s
sop-tier-check / tier-check (pull_request) Successful in 10s
audit-force-merge / audit (pull_request) Successful in 27s
2026-05-12 20:51:39 +00:00
core-devops 5427fa39e2 ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 13s
sop-tier-check / tier-check (pull_request) Successful in 12s
audit-force-merge / audit (pull_request) Successful in 38s
2026-05-12 20:51:30 +00:00
core-devops 5e5fb503ec ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 15s
sop-tier-check / tier-check (pull_request) Successful in 13s
audit-force-merge / audit (pull_request) Successful in 14s
2026-05-12 20:51:20 +00:00
core-devops eb03eed089 ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 16s
sop-tier-check / tier-check (pull_request) Successful in 17s
audit-force-merge / audit (pull_request) Successful in 24s
2026-05-12 20:51:09 +00:00
core-devops 24df054dfb ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 13s
sop-tier-check / tier-check (pull_request) Successful in 16s
audit-force-merge / audit (pull_request) Successful in 23s
2026-05-12 20:51:02 +00:00
core-devops df5507cf40 ci: rerun after mc#724 all-required fix lands
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 9s
sop-tier-check / tier-check (pull_request) Successful in 12s
audit-force-merge / audit (pull_request) Successful in 27s
2026-05-12 20:50:58 +00:00
fullstack-engineer 6fc97a81e1 ci: trigger CI rerun [empty commit]
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 15s
sop-tier-check / tier-check (pull_request) Successful in 13s
2026-05-12 19:30:31 +00:00
fullstack-engineer 83764f4c6f fix(handlers/discovery): nil-guard in filterPeersByQuery + test coverage for #730
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 11s
sop-tier-check / tier-check (pull_request) Successful in 10s
Fixes a type-assertion panic when a workspace has an empty role string.
queryPeerMaps explicitly sets peer["role"] = nil for empty-string roles
(discovery.go:340), and filterPeersByQuery did p["role"].(string) without
guarding for nil. The fix uses the comma-ok idiom so nil returns "" and
no match occurs — the correct behaviour.

Test files added (all pure functions, no DB/side effects):

- discovery_filter_test.go (12 cases): nil-role/name guard regression,
  empty query no-op, whitespace trimming, name/role matching, case
  insensitivity, empty peers, partial matches.

- org_helpers_walk_test.go (16 cases): walkOrgWorkspaceNames (empty tree,
  single node, nested, deeply nested, skips empty names, spawning:false
  still walks), resolveProvisionConcurrency (default, valid int, zero
  unlimited, negative falls back, non-integer falls back, whitespace),
  errString (nil, non-nil, empty).

- delegation_extract_response_text_test.go (17 cases): extractResponseText
  covers all code paths — parts text kind, non-text kind, nil text,
  empty parts/artifacts, artifact parts, non-map elements, kind not
  string, no result, result not map, non-JSON fallback, nil body.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 18:13:53 +00:00
app-fe ee4952bbbb Merge pull request 'fix(canvas): case-insensitive extension lookup in getIcon + topology test fix' (#749) from fix/697-canvas-geticon-topology into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 4s
2026-05-12 18:02:50 +00:00
fullstack-engineer 1c61b117ae fix(canvas): case-insensitive extension lookup in getIcon + topology test fix
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 10s
sop-tier-check / tier-check (pull_request) Successful in 10s
audit-force-merge / audit (pull_request) Successful in 5s
Two pre-existing canvas test failures:

1. canvas/src/components/tabs/FilesTab/tree.ts:getIcon()
   FILE_ICONS keys are lowercase (".json") but the extension was looked
   up as-is (".JSON"). Result: FILE_ICONS[".JSON"] → undefined → fallback
   "📄" instead of "{}".
   Fix: lowercase the extension before FILE_ICONS lookup. Also added ?.
   null-coalescing on split().pop() to handle filenames without extension.

2. canvas/src/store/__tests__/canvas-topology-pure.test.ts
   sortParentsBeforeChildren test expectation was wrong: it assumed orphan
   would come after root, but when parentId references a missing node
   the orphan keeps its input order (orphan, then root). Updated the
   expectation and corrected the comment to match the actual behaviour.

Closes #697.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 17:16:42 +00:00
app-fe 2ca7e24d70 Merge pull request 'test(canvas): add buildDeployMap unit tests (19 cases, #2071 follow-up)' (#742) from feat/2071-canvas-orgdeploystate-coverage into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 8s
2026-05-12 17:16:41 +00:00
app-fe 551f4969b1 Merge pull request 'test(canvas/lib): add hydrate.test.ts — 7 cases for exponential-backoff hydration' (#703) from test/701-canvas-hydrate-coverage into staging
Secret scan / Scan diff for credential-shaped strings (push) Has been cancelled
2026-05-12 17:16:39 +00:00
app-fe 480b5adfb1 Merge pull request 'test(canvas): add DropTargetBadge unit tests (7 cases, #2071 follow-up)' (#745) from test/2071-canvas-drop-target-badge-coverage into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 6s
2026-05-12 17:16:19 +00:00
fullstack-engineer 21f55579fa fix(tests/e2e): surface diagnose step Detail in EIC smoke output (mc#687)
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 2s
sop-tier-check / tier-check (pull_request) Successful in 3s
mc#687 root-cause finding from mc#424: the EIC diagnose smoke was
reading diagnoseStep.error (Go error string) and discarding
diagnoseStep.detail (subprocess stderr). The actionable signal — e.g.

  AccessDeniedException: ... is not authorized to perform:
  ec2-instance-connect:OpenTunnel

— lives in detail. Reading only .error produced:

  exec: process exited with status 1

which was uninformative and caused a 21h outage investigation.

Fix: extract .detail (subprocess stderr) as primary output; append
Go error string in parentheses when both fields are populated.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 17:11:35 +00:00
fullstack-engineer 48440cc83d test(canvas): add DropTargetBadge unit tests (7 cases, #2071 follow-up)
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 21s
sop-tier-check / tier-check (pull_request) Successful in 26s
audit-force-merge / audit (pull_request) Successful in 8s
Adds isolated tests for DropTargetBadge — the floating drag-target affordance.
Render-condition coverage:

  - Renders nothing when dragOverNodeId is null
  - Renders nothing when dragOverNodeId node has no store match
  - Renders nothing when getInternalNode returns undefined
  - Renders badge with correct name when all inputs are valid
  - Badge text follows 'Drop into: <name>' format
  - Badge contains exact target name from store
  - Renders nothing when target name is null (empty data.name)

Ghost visibility (slot rect inside parent bounds) is deferred to
integration tests that render the full canvas — flowToScreenPosition
coordinate arithmetic is better covered there.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 16:40:12 +00:00
fullstack-engineer 9ca1e794f7 test(handlers): add org_helpers pure function tests (#713)
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 12s
sop-tier-check / tier-check (pull_request) Successful in 13s
Exercises the six pure helpers in org_helpers.go that were missing coverage:

  isSafeRoleName:
    - valid: alphanumeric, hyphen, underscore
    - invalid: empty, ".", "..", path sep, space, @, :, #, %, quotes,
      backslash, ~, backtick, brackets, +, =, ^, ?, |, >, *, &, !

  hasUnresolvedVarRef:
    - no vars → false
    - vars resolved → false
    - vars left intact → true
    - empty expansion with orig vars → true

  expandWithEnv:
    - empty input / no vars / ${VAR} / $VAR / prefix+suffix / multi-var

  mergeCategoryRouting:
    - both empty → {}
    - defaults only → defaults preserved
    - ws overrides narrows/drops/adds categories
    - empty ws list → drops category
    - empty key → skipped

  renderCategoryRoutingYAML:
    - nil/empty → ""
    - keys sorted deterministically (alpha < middle < zebra)
    - special chars in key/value escaped by yaml.Marshal

  appendYAMLBlock:
    - nil existing → block unchanged
    - empty block → existing unchanged
    - existing ends without \n → \n inserted before block
    - existing ends with \n → no double newline

  mergePlugins:
    - empty inputs → []
    - basic dedup merge (defaults first)
    - !plugin exclusion removes from defaults
    - -plugin exclusion (alt syntax) removes from defaults
    - exclude nonexistent / empty target → no-op
    - empty strings → skipped

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 16:31:31 +00:00
fullstack-engineer dccc8f53cb test(handlers): add workspace_crud validation helper tests (#713)
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 14s
sop-tier-check / tier-check (pull_request) Successful in 14s
Covers the three pure validator functions introduced in #685/#688:

  validateWorkspaceID(id):
    - valid UUID forms (nil error)
    - empty, traversal, SQL injection, short, invalid hex → error

  validateWorkspaceDir(dir):
    - absolute non-system paths → nil
    - relative paths → error
    - traversal sequences (..) → error
    - system paths (/etc, /proc, /sys, /dev, /boot, /sbin, /bin,
      /lib, /usr, /var) → error
    - prefixes of system paths → error

  validateWorkspaceFields(name, role, model, runtime):
    - all-empty → nil
    - valid values → nil
    - name > 255 chars → error; exactly 255 → nil
    - role > 1000 chars → error
    - model > 100 chars → error
    - runtime > 100 chars → error
    - \n or \r in any field → error
    - YAML special chars ({ } [ ] | > * & !) in name/role → error
    - YAML chars allowed in model/runtime (only name/role are gated)

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 16:29:55 +00:00
fullstack-engineer 85e7b6622e test(canvas): add buildDeployMap unit tests (19 cases, #2071 follow-up)
sop-tier-check / tier-check (pull_request) Successful in 17s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 18s
audit-force-merge / audit (pull_request) Successful in 10s
Adds isolated tests for the pure tree-traversal core of
useOrgDeployState. The buildDeployMap function handles:

  - Root / leaf identification via parent-chain walk
  - isDeployingRoot: true when any descendant is "provisioning"
  - isActivelyProvisioning: true only for the node itself
  - isLockedChild: true for non-root nodes in a deploying tree
  - isLockedChild: also true for nodes in deletingIds (cross-cutting)
  - descendantProvisioningCount: non-zero only on root nodes
  - O(n) single-pass walk verified on 50-node tree

Also exports buildDeployMap for direct unit testing (was internal).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 16:26:16 +00:00
core-uiux c7e0c9427a Merge pull request 'fix(canvas/mobile): remove ?? [] from agentMessages selector — infinite re-render' (#720) from fix/717-mobile-agentMessages-selector into staging
Secret scan / Scan diff for credential-shaped strings (push) Successful in 16s
2026-05-12 16:07:34 +00:00
fullstack-engineer 9cc00245a2 test(handlers/delegation): add extractResponseText coverage — 10 cases for A2A response text extraction
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 2s
sop-tier-check / tier-check (pull_request) Successful in 3s
extractResponseText in delegation.go had no unit tests. It extracts text
from A2A JSON-RPC response bodies by walking result.parts and
result.artifacts[*].parts arrays. Tests cover: non-JSON fallback, valid
JSON with no result, result is not a map, parts with text kind, parts
with non-text kind (image skipped → raw body), multiple parts (returns
first text), artifacts with nested text parts, artifacts with non-text
kind, empty parts/artifacts arrays, and empty text string.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 15:13:11 +00:00
fullstack-engineer b70b59d1b1 test(handlers/org): add org_layout_test.go — 19 cases for childSlot/sizeOfSubtree/childSlotInGrid
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 2s
sop-tier-check / tier-check (pull_request) Successful in 3s
Adds comprehensive Go test coverage for the pure canvas-grid layout helpers
in org.go. Mirrors the TypeScript tests in canvas-topology-pure.test.ts
(CHILD_DEFAULT_WIDTH=210/HEIGHT=120 vs Go's 240/130, tested independently).

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 13:18:42 +00:00
fullstack-engineer 89b51ad3f0 test(mcp): harden RecallMemory_GlobalScope_Blocked — add OFFSEC-001 contract assertions
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 10s
sop-tier-check / tier-check (pull_request) Successful in 9s
Mirrors PR#680's OFFSEC-001 contract hardening from the commit-memory
path to the recall-memory path (issue #681).

Before: only asserted resp.Error != nil — a future regression that
returned the raw err.Error() would still pass the test.

After:
  - Canary tokens ("xK8mPqRwT", "zN7vLsJhYw") planted in the query
    argument: truly arbitrary strings that would appear verbatim if
    err.Error() were returned directly. Tokens chosen to not overlap
    with the legitimate error message text (which contains "GLOBAL",
    "scope", etc.) — which would always appear and make them useless
    as sentinels.
  - Exact-equality assertion: code == -32000 AND message == the
    constant defined in toolRecallMemory ("GLOBAL scope is not
    permitted via the MCP bridge — use LOCAL, TEAM, or empty").
  - Defence-in-depth strings.Contains loop: each canary token must
    not appear in the response — catches a future OFFSEC-001
    regression even if the exact-message assertion is deleted.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 12:16:24 +00:00
core-uiux 105c084a11 fix(canvas/mobile): remove ?? [] from Zustand selector to prevent infinite render loop
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 3s
sop-tier-check / tier-check (pull_request) Successful in 4s
audit-force-merge / audit (pull_request) Successful in 18s
React error #185 (Maximum update depth exceeded) on mobile chat tab.

Root cause: useCanvasStore((s) => s.agentMessages[agentId] ?? []) used
a `?? []` fallback in the selector. Zustand uses Object.is for selector
equality. When agentMessages[agentId] is undefined (initial state), the
fallback creates a NEW [] reference on every store update. Zustand sees
this as a state change and re-renders the component. The component reads
from the store again, gets another new [] reference, and the cycle
repeats until React hits the depth cap.

Fix: remove `?? []` from the selector (returns undefined when no messages)
and move the fallback to the useState initializer:
  storedMessages = useCanvasStore(selector)     // returns undefined | T[]
  [messages] = useState(() => (storedMessages ?? []).map(...))

The useState initializer only runs once on mount, so the `?? []`
there is safe — it creates the initial state once, then messages are
managed via setMessages.

Fixes issue #651.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 11:13:56 +00:00
hongming 108001d0d5 feat(canvas): mobile-first shell with 6-screen iOS design + responsive desktop fixes
Implements the Claude Design handoff (Molecules AI Mobile.html) as a
viewport-gated React tree under canvas/src/components/mobile/. < 640px
renders the new shell instead of the desktop ReactFlow canvas.

Six screens, all bound to live store data:
- Home (agent list + filter chips + spawn FAB)
- Canvas (mini-graph with pinch-to-zoom + pan + reset)
- Detail (status pills, tabs: Overview / Activity / Config / Memory;
  Activity hits /workspaces/:id/activity)
- Chat (textarea composer, IME-safe Enter, sendInFlightRef guard;
  bootstraps from agentMessages so the prior thread shows on entry)
- Comms (live A2A feed via /workspaces/:id/activity + ACTIVITY_LOGGED)
- Spawn (bottom sheet; fetches /templates so users pick what's actually
  installed on their platform)

Plus a Me tab for mobile theme/accent/density.

Design system (palette.ts + primitives.tsx) ports tokens 1:1 from the
handoff: cream + dark palettes, T1-T4 tier chips, status dots with
halo, JetBrains Mono for IDs/timestamps. Inter + JetBrains Mono are
self-hosted via next/font/google so CSP `font-src 'self'` is honoured.

URL routing: routes sync to ?m=<route>&a=<id>; popstate restores route;
deep links seed initial state. /?m=detail without ?a collapses to home.

Accent override flows through React context (MobileAccentProvider) —
not by mutating the static MOL_LIGHT/MOL_DARK singletons.

SSR flash: isMobile is tri-state; loading spinner stays up until
matchMedia resolves so mobile devices never paint the desktop tree.

Desktop responsiveness fixes (separate but ride along):
- Toolbar: full-width with overflow-x-auto on mobile, logo text + count
  hidden < sm, divider/border collapse to sm: only.
- SidePanel: full-screen on mobile via matchMedia, resize handle hidden.
- Canvas: MiniMap hidden < sm (was overlapping the New Workspace FAB).

Tests (51 total, 33 new):
- palette.test.ts (12) - normalizeStatus, tierCode, light/dark parity
- components.test.ts (10) - toMobileAgent field mapping + classifyForFilter
- MobileApp.test.tsx (12) - route stack, deep links, popstate, tab bar
  hidden on chat, spawn overlay
- SidePanel.tabs.test.tsx (18) - regression-clean

Verified: tsc --noEmit clean across mobile/, page.tsx, layout.tsx.
Not yet verified: live phone browser (needs CP backend hydrated).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-12 11:13:56 +00:00
fullstack-engineer 613d32703c test(handlers/workspace_crud): add workspace_crud_helpers_test.go — 7 cases for validateWorkspaceDir
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 14s
sop-tier-check / tier-check (pull_request) Successful in 14s
Covers:
- AcceptsValidAbsolutePath: 8 valid workspace_dir values
- RejectsRelativePath: 5 cases (relative, ./local, ../sibling, bare, empty)
- RejectsTraversalSequence: 5 cases with ".." sequences
- RejectsSystemPaths: 9 blocked root paths
- RejectsDescendantsOfSystemPaths: 10 blocked descendants
- AcceptsPathsSimilarToSystemPaths: paths that LOOK like system paths but
  are distinct (e.g. /etx, /vartmp, /workspace/etc)
- ErrorMessages: non-empty error strings
2026-05-12 10:16:26 +00:00
fullstack-engineer 6200a11048 test(canvas/lib): add hydrate.test.ts — 7 cases for exponential-backoff canvas hydration
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 10s
sop-tier-check / tier-check (pull_request) Successful in 13s
audit-force-merge / audit (pull_request) Successful in 8s
Tests canvas/src/lib/hydrate.ts: hydrateCanvas() with exponential backoff retry.

Cases:
1. Success on first attempt → { error: null }
2. Viewport fetch failure is non-fatal → store still hydrates
3. Success after 1 retry → onRetrying(1) called once, result { error: null }
4. onRetrying called correctly on each failed attempt
5. All attempts fail → error message after MAX_RETRIES
6. onRetrying called MAX_RETRIES-1 times before final exhausted attempt
7. Total elapsed time ≈ sum of exponential delays (1s + 2s = 3s)

Each attempt makes 2 parallel api.get calls (workspaces + viewport); mocks
set up per parallel-call to avoid Promise.all consuming wrong mock slots.

Issue: #701

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-05-12 09:46:29 +00:00
69 changed files with 9365 additions and 211 deletions
+829
View File
@@ -0,0 +1,829 @@
#!/usr/bin/env python3
# sop-checklist-gate — evaluate whether a PR has peer-acked each
# SOP-checklist item. Posts a commit-status that branch protection
# can require.
#
# RFC#351 Step 2 of 6 (implementation MVP).
#
# Invoked by .gitea/workflows/sop-checklist-gate.yml on:
# - pull_request_target: [opened, edited, synchronize, reopened]
# - issue_comment: [created, edited, deleted]
#
# Flow:
# 1. Load .gitea/sop-checklist-config.yaml (from BASE ref — trusted).
# 2. GET /repos/{R}/pulls/{N} — author, head.sha, tier label
# 3. GET /repos/{R}/issues/{N}/comments — extract /sop-ack and /sop-revoke
# 4. For each checklist item:
# a. Is the section marker present in PR body? (author answered)
# b. Is there ≥1 unrevoked /sop-ack from a non-author whose
# team-membership matches required_teams?
# 5. POST /repos/{R}/statuses/{sha} — context
# `sop-checklist / all-items-acked (pull_request)`,
# state=success | failure | pending, description=`acked: N/M …`.
#
# Trust boundary (mirrors RFC#324 §A4):
# This script is loaded from the BASE branch. The workflow's
# actions/checkout step pins ref=base.sha. PR-HEAD code is never
# executed. We only HTTP-call the Gitea API.
#
# Token scope:
# - read:repository / read:organization to enumerate PR + comments
# + team membership (Gitea 1.22.6 quirk: team-membership endpoint
# returns 403 if token owner is not in the team; see review-check.sh
# for the same gotcha — we surface the same fail-closed message).
# - write:repository for `POST /repos/{R}/statuses/{sha}`. Unlike
# RFC#324's pattern (which uses the JOB's own pass/fail as the
# status), we POST the status explicitly because the gate posts
# a single multi-item status with a richer description than a
# bare success/failure context can carry.
#
# Slug normalization rules (canonical form: kebab-case):
# - Lowercase
# - Whitespace + underscores → single dash
# - Strip non [a-z0-9-] characters
# - Collapse adjacent dashes
# - Strip leading/trailing dashes
# - If the result is a digit string (e.g. "1"), look up via
# config.items[*].numeric_alias to get the kebab-case slug.
#
# Examples:
# "Comprehensive_Testing" → "comprehensive-testing"
# "comprehensive testing" → "comprehensive-testing"
# "1" → "comprehensive-testing"
# "Five-Axis-Review" → "five-axis-review"
#
# Revoke semantics:
# /sop-revoke <slug> [reason] — most-recent comment per (slug, user)
# wins. So if Alice posts /sop-ack X then later /sop-revoke X, her ack
# for X is invalidated. Bob's prior /sop-ack X is unaffected. If Alice
# posts /sop-revoke X then later /sop-ack X again, the ack is restored.
from __future__ import annotations
import argparse
import json
import os
import re
import sys
import urllib.error
import urllib.parse
import urllib.request
from typing import Any
# ---------------------------------------------------------------------------
# Slug normalization
# ---------------------------------------------------------------------------
_NORMALIZE_REPLACE_RE = re.compile(r"[\s_]+")
_NORMALIZE_STRIP_RE = re.compile(r"[^a-z0-9-]")
_NORMALIZE_DASH_RE = re.compile(r"-+")
def normalize_slug(raw: str, numeric_aliases: dict[int, str] | None = None) -> str:
"""Normalize a user-supplied slug to canonical kebab-case form.
See module header for the rules.
If the input is a pure digit string AND numeric_aliases is provided,
the alias mapping is consulted. Unknown digits return "" so the caller
can flag the comment as unparseable.
"""
if raw is None:
return ""
s = raw.strip().lower()
s = _NORMALIZE_REPLACE_RE.sub("-", s)
s = _NORMALIZE_STRIP_RE.sub("", s)
s = _NORMALIZE_DASH_RE.sub("-", s)
s = s.strip("-")
if s.isdigit() and numeric_aliases is not None:
return numeric_aliases.get(int(s), "")
return s
# ---------------------------------------------------------------------------
# Comment parsing — /sop-ack and /sop-revoke
# ---------------------------------------------------------------------------
# A directive must be on its own line. Permits leading whitespace.
# Optional trailing note after the slug for /sop-ack and required reason
# for /sop-revoke (RFC#351 open question 4 — reason is captured but not
# yet validated; future iteration may require a min-length).
_DIRECTIVE_RE = re.compile(
r"^[ \t]*/(sop-ack|sop-revoke)[ \t]+([A-Za-z0-9_\- ]+?)(?:[ \t]+(.*))?[ \t]*$",
re.MULTILINE,
)
def parse_directives(
comment_body: str,
numeric_aliases: dict[int, str],
) -> list[tuple[str, str, str]]:
"""Extract /sop-ack and /sop-revoke directives from a comment body.
Returns a list of (kind, canonical_slug, note) tuples where:
kind is "sop-ack" or "sop-revoke"
canonical_slug is the normalized form (or "" if unparseable)
note is the trailing free-text (may be "")
"""
out: list[tuple[str, str, str]] = []
if not comment_body:
return out
for m in _DIRECTIVE_RE.finditer(comment_body):
kind = m.group(1)
raw_slug = (m.group(2) or "").strip()
# If the raw match included trailing words, the regex non-greedy
# captured only the first token; strip again for safety.
# We split on whitespace to keep the FIRST word as the slug, and
# everything after as the note.
parts = raw_slug.split()
if not parts:
continue
first = parts[0]
# If the slug-capture greedily matched multiple words (e.g.
# "comprehensive testing"), preserve normalize behavior: join
# the WHOLE first-word-token only; trailing words get appended to
# the note. The regex limits group(2) to [A-Za-z0-9_\- ] so we
# may have multi-word forms here — normalize handles them.
if len(parts) > 1:
# User wrote "/sop-ack comprehensive testing extra-note"
# → treat "comprehensive testing" as the slug source if it
# normalizes to a known item; otherwise treat "comprehensive"
# as slug and "testing extra-note" as note. We defer the
# disambiguation to the caller via the returned canonical
# slug. For simplicity: try the WHOLE captured string first.
canonical = normalize_slug(raw_slug, numeric_aliases)
else:
canonical = normalize_slug(first, numeric_aliases)
note_from_group = (m.group(3) or "").strip()
# If we collapsed multi-word slug into kebab and there's a
# trailing-text group too, append it.
out.append((kind, canonical, note_from_group))
return out
# ---------------------------------------------------------------------------
# PR body section detection
# ---------------------------------------------------------------------------
def section_marker_present(body: str, marker: str) -> bool:
"""Return True if `marker` appears in `body` case-insensitively
on a non-empty line (i.e. the author actually filled it in).
We require the marker substring AND non-whitespace content on the
same line OR within the next line — this prevents trivially-empty
checklists like:
## SOP-Checklist
- [ ] **Comprehensive testing performed**:
- [ ] **Local-postgres E2E run**:
from auto-passing the section-present check. The peer-ack is still
required, but answering with empty content is captured as a soft
finding via the section-present test alone.
"""
if not body or not marker:
return False
body_lower = body.lower()
marker_lower = marker.lower()
idx = body_lower.find(marker_lower)
if idx < 0:
return False
# Walk to end of line.
line_end = body.find("\n", idx)
if line_end < 0:
line_end = len(body)
line = body[idx + len(marker):line_end]
# Strip the colon + checkbox tail patterns; require at least one
# non-whitespace, non-punctuation char.
stripped = re.sub(r"[\s\*:\-\[\]]+", "", line)
if stripped:
return True
# Fall through: check the NEXT line (multi-line answers).
next_line_end = body.find("\n", line_end + 1)
if next_line_end < 0:
next_line_end = len(body)
next_line = body[line_end + 1:next_line_end]
stripped_next = re.sub(r"[\s\*:\-\[\]]+", "", next_line)
return bool(stripped_next)
# ---------------------------------------------------------------------------
# Ack-state computation
# ---------------------------------------------------------------------------
def compute_ack_state(
comments: list[dict[str, Any]],
pr_author: str,
items_by_slug: dict[str, dict[str, Any]],
numeric_aliases: dict[int, str],
team_membership_probe: "callable[[str, list[str]], list[str]]",
) -> dict[str, dict[str, Any]]:
"""Compute per-item ack state.
Each comment is processed in chronological order. The most-recent
directive per (commenter, slug) wins.
Returns a dict keyed by canonical slug:
{
"comprehensive-testing": {
"ackers": ["bob"], # non-author, team-verified
"rejected_ackers": { # debugging info
"self_ack": ["alice"],
"unknown_slug": [],
"not_in_team": ["eve"],
}
},
...
}
"""
# Step 1: collapse directives per (commenter, slug) — most recent wins.
# comments are expected to come in chronological order from the
# API (Gitea returns oldest-first by default for issues/{N}/comments).
latest_directive: dict[tuple[str, str], str] = {} # (user, slug) → kind
unparseable_per_user: dict[str, int] = {}
for c in comments:
body = c.get("body", "") or ""
user = (c.get("user") or {}).get("login", "")
if not user:
continue
for kind, slug, _note in parse_directives(body, numeric_aliases):
if not slug:
unparseable_per_user[user] = unparseable_per_user.get(user, 0) + 1
continue
latest_directive[(user, slug)] = kind
# Step 2: build candidate ackers per slug.
# Filter out self-acks and unknown slugs.
ackers_per_slug: dict[str, list[str]] = {s: [] for s in items_by_slug}
rejected_self: dict[str, list[str]] = {s: [] for s in items_by_slug}
rejected_unknown: dict[str, list[str]] = {s: [] for s in items_by_slug}
pending_team_check: dict[str, list[str]] = {s: [] for s in items_by_slug}
for (user, slug), kind in latest_directive.items():
if kind != "sop-ack":
continue # revokes leave the (user,slug) state as "no ack"
if slug not in items_by_slug:
# Slug normalized to something not in our config — store
# under a synthetic key for diagnostic surfacing. Don't add
# to any item.
continue
if user == pr_author:
rejected_self[slug].append(user)
continue
pending_team_check[slug].append(user)
# Step 3: team membership probe per slug (batched per slug to keep
# API call count down — same user may ack multiple items but the
# required_teams differ per item, so we MUST probe per (user, item)).
rejected_not_in_team: dict[str, list[str]] = {s: [] for s in items_by_slug}
for slug, candidates in pending_team_check.items():
if not candidates:
continue
required = items_by_slug[slug]["required_teams"]
approved = team_membership_probe(slug, candidates) # returns subset
rejected_not_in_team[slug] = [u for u in candidates if u not in approved]
ackers_per_slug[slug] = approved
# Stash required teams for description rendering.
items_by_slug[slug]["_required_resolved"] = required
return {
slug: {
"ackers": ackers_per_slug[slug],
"rejected": {
"self_ack": rejected_self[slug],
"not_in_team": rejected_not_in_team[slug],
},
}
for slug in items_by_slug
}
# ---------------------------------------------------------------------------
# Gitea API client
# ---------------------------------------------------------------------------
class GiteaClient:
def __init__(self, host: str, token: str):
self.base = f"https://{host}/api/v1"
self.token = token
# Cache team-name → team-id resolutions per org.
self._team_id_cache: dict[tuple[str, str], int | None] = {}
def _req(
self,
method: str,
path: str,
body: dict[str, Any] | None = None,
ok_codes: tuple[int, ...] = (200, 201, 204),
) -> tuple[int, Any]:
url = self.base + path
data = None
headers = {
"Authorization": f"token {self.token}",
"Accept": "application/json",
}
if body is not None:
data = json.dumps(body).encode("utf-8")
headers["Content-Type"] = "application/json"
req = urllib.request.Request(url, method=method, data=data, headers=headers)
try:
with urllib.request.urlopen(req, timeout=20) as r:
raw = r.read()
code = r.getcode()
except urllib.error.HTTPError as e:
code = e.code
raw = e.read()
try:
parsed = json.loads(raw.decode("utf-8")) if raw else None
except json.JSONDecodeError:
parsed = raw.decode("utf-8", errors="replace") if raw else None
return code, parsed
def get_pr(self, owner: str, repo: str, pr: int) -> dict[str, Any]:
code, data = self._req("GET", f"/repos/{owner}/{repo}/pulls/{pr}")
if code != 200:
raise RuntimeError(f"GET pulls/{pr} → HTTP {code}: {data!r}")
return data
def get_issue_comments(
self, owner: str, repo: str, issue: int
) -> list[dict[str, Any]]:
# Paginate. Gitea default page size 50.
out: list[dict[str, Any]] = []
page = 1
while True:
code, data = self._req(
"GET",
f"/repos/{owner}/{repo}/issues/{issue}/comments?limit=50&page={page}",
)
if code != 200:
raise RuntimeError(
f"GET issues/{issue}/comments page={page} → HTTP {code}: {data!r}"
)
if not data:
break
out.extend(data)
if len(data) < 50:
break
page += 1
return out
def resolve_team_id(self, org: str, team_name: str) -> int | None:
key = (org, team_name)
if key in self._team_id_cache:
return self._team_id_cache[key]
code, data = self._req("GET", f"/orgs/{org}/teams/search?q={urllib.parse.quote(team_name)}")
team_id = None
if code == 200 and isinstance(data, dict):
for t in data.get("data", []):
if t.get("name") == team_name:
team_id = t.get("id")
break
if team_id is None and code == 200 and isinstance(data, list):
for t in data:
if t.get("name") == team_name:
team_id = t.get("id")
break
self._team_id_cache[key] = team_id
return team_id
def is_team_member(self, team_id: int, login: str) -> bool | None:
"""Return True / False / None (unknown — 403 from API)."""
code, _ = self._req(
"GET", f"/teams/{team_id}/members/{urllib.parse.quote(login)}"
)
if code in (200, 204):
return True
if code == 404:
return False
# 403 means the token owner isn't in this team, so the API
# refuses to confirm membership. Fail-closed at the caller.
return None
def post_status(
self,
owner: str,
repo: str,
sha: str,
state: str,
context: str,
description: str,
target_url: str = "",
) -> None:
body = {
"state": state,
"context": context,
"description": description[:140], # Gitea truncates to 255 but be safe
"target_url": target_url or "",
}
code, data = self._req(
"POST",
f"/repos/{owner}/{repo}/statuses/{sha}",
body=body,
ok_codes=(201,),
)
if code not in (200, 201):
raise RuntimeError(
f"POST statuses/{sha} → HTTP {code}: {data!r}"
)
# ---------------------------------------------------------------------------
# Config loader (PyYAML-free — config file is intentionally tiny + flat)
# ---------------------------------------------------------------------------
def load_config(path: str) -> dict[str, Any]:
"""Load .gitea/sop-checklist-config.yaml.
Uses PyYAML if available, otherwise falls back to a built-in
minimal parser sufficient for our flat config shape. Bundling
PyYAML on the runner is one apt install away but we avoid the
dep by keeping the config shape constrained.
"""
try:
import yaml # type: ignore[import-not-found]
with open(path) as f:
return yaml.safe_load(f)
except ImportError:
return _load_config_minimal(path)
def _load_config_minimal(path: str) -> dict[str, Any]:
"""Minimal YAML subset parser for our config shape.
Supports: top-level scalar:value, top-level map-of-map (e.g.
tier_failure_mode), top-level list of maps (items:), and within an
item map: scalars + lists of scalars. Does NOT support nested lists,
YAML anchors, multi-doc, or flow style.
"""
with open(path) as f:
lines = f.readlines()
return _parse_minimal_yaml(lines)
def _parse_minimal_yaml(lines: list[str]) -> dict[str, Any]: # noqa: C901
"""Hand-rolled subset parser. See _load_config_minimal docstring."""
# Strip comments + blank lines but preserve indentation.
cleaned: list[tuple[int, str]] = []
for raw in lines:
# Don't strip a "#" that is inside a quoted value.
body = raw.rstrip("\n")
# Remove trailing comment.
idx = body.find("#")
if idx >= 0 and (idx == 0 or body[idx - 1] in " \t"):
body = body[:idx].rstrip()
if not body.strip():
continue
indent = len(body) - len(body.lstrip(" "))
cleaned.append((indent, body.strip()))
root: dict[str, Any] = {}
i = 0
n = len(cleaned)
def parse_scalar(s: str) -> Any:
s = s.strip()
if s.startswith('"') and s.endswith('"'):
return s[1:-1]
if s.startswith("'") and s.endswith("'"):
return s[1:-1]
if s.lower() in ("true", "yes"):
return True
if s.lower() in ("false", "no"):
return False
try:
return int(s)
except ValueError:
pass
return s
def parse_inline_list(s: str) -> list[Any]:
s = s.strip()
if not (s.startswith("[") and s.endswith("]")):
return [parse_scalar(s)]
inner = s[1:-1]
if not inner.strip():
return []
return [parse_scalar(x.strip()) for x in inner.split(",")]
while i < n:
indent, line = cleaned[i]
if indent != 0:
i += 1
continue
if ":" not in line:
i += 1
continue
key, _, rest = line.partition(":")
key = key.strip()
rest = rest.strip()
if rest == "":
# Block — could be map or list.
i += 1
# Look ahead for first child.
if i < n and cleaned[i][1].startswith("- "):
# List of items.
items: list[Any] = []
while i < n and cleaned[i][0] > indent and cleaned[i][1].startswith("- "):
item_indent = cleaned[i][0]
first_kv = cleaned[i][1][2:].strip() # strip "- "
item: dict[str, Any] = {}
if ":" in first_kv:
k, _, v = first_kv.partition(":")
k = k.strip()
v = v.strip()
if v == "":
item[k] = ""
elif v.startswith(">-") or v.startswith(">"):
# Folded scalar continues on subsequent indented lines
collected: list[str] = []
i += 1
while i < n and cleaned[i][0] > item_indent:
collected.append(cleaned[i][1])
i += 1
item[k] = " ".join(collected)
items.append(item)
continue
elif v.startswith("["):
item[k] = parse_inline_list(v)
else:
item[k] = parse_scalar(v)
i += 1
# Subsequent k:v lines at deeper indent belong to this item.
while i < n and cleaned[i][0] > item_indent and not cleaned[i][1].startswith("- "):
sub_indent, sub_line = cleaned[i]
if ":" in sub_line:
k, _, v = sub_line.partition(":")
k = k.strip()
v = v.strip()
if v == "":
item[k] = ""
i += 1
elif v.startswith(">-") or v.startswith(">"):
collected = []
i += 1
while i < n and cleaned[i][0] > sub_indent:
collected.append(cleaned[i][1])
i += 1
item[k] = " ".join(collected)
elif v.startswith("["):
item[k] = parse_inline_list(v)
i += 1
else:
item[k] = parse_scalar(v)
i += 1
else:
i += 1
items.append(item)
root[key] = items
else:
# Sub-map.
submap: dict[str, Any] = {}
while i < n and cleaned[i][0] > indent:
sub_indent, sub_line = cleaned[i]
if ":" in sub_line:
k, _, v = sub_line.partition(":")
k = k.strip().strip('"').strip("'")
v = v.strip()
if v.startswith("[") and v.endswith("]"):
submap[k] = parse_inline_list(v)
else:
submap[k] = parse_scalar(v)
i += 1
root[key] = submap
else:
# Inline scalar or list.
if rest.startswith("[") and rest.endswith("]"):
root[key] = parse_inline_list(rest)
else:
root[key] = parse_scalar(rest)
i += 1
return root
# ---------------------------------------------------------------------------
# Main entry point
# ---------------------------------------------------------------------------
def render_status(
items: list[dict[str, Any]],
ack_state: dict[str, dict[str, Any]],
body_state: dict[str, bool],
) -> tuple[str, str]:
"""Return (state, description) for the commit-status post.
state is "success" if every item has at least one valid ack
(body section presence is informational only — peer-ack is the
real gate). tier:low PRs receive state="success" (soft-fail — no
acks required); the description carries "[info tier:low]" prefix.
"""
n = len(items)
fully_acked = [
it["slug"] for it in items if ack_state[it["slug"]]["ackers"]
]
missing = [
it["slug"] for it in items if not ack_state[it["slug"]]["ackers"]
]
missing_body = [it["slug"] for it in items if not body_state.get(it["slug"], False)]
desc_parts = [f"acked: {len(fully_acked)}/{n}"]
if missing:
# Show up to 3 missing slugs to stay inside the 140-char budget.
shown = ", ".join(missing[:3])
if len(missing) > 3:
shown += f", +{len(missing) - 3}"
desc_parts.append(f"missing: {shown}")
if missing_body:
shown = ", ".join(missing_body[:3])
if len(missing_body) > 3:
shown += f", +{len(missing_body) - 3}"
desc_parts.append(f"body-unfilled: {shown}")
state = "success" if not missing and not missing_body else "failure"
return state, "".join(desc_parts)
def get_tier_mode(pr: dict[str, Any], cfg: dict[str, Any]) -> str:
"""Read tier label, return 'hard' or 'soft' per cfg.tier_failure_mode."""
labels = pr.get("labels") or []
tier_labels = [l.get("name", "") for l in labels if (l.get("name", "") or "").startswith("tier:")]
mode_map = cfg.get("tier_failure_mode") or {}
default_mode = cfg.get("default_mode", "hard")
for tl in tier_labels:
if tl in mode_map:
return mode_map[tl]
return default_mode
def main(argv: list[str] | None = None) -> int:
p = argparse.ArgumentParser()
p.add_argument("--owner", required=True)
p.add_argument("--repo", required=True)
p.add_argument("--pr", type=int, required=True)
p.add_argument("--config", default=".gitea/sop-checklist-config.yaml")
p.add_argument("--gitea-host", default="git.moleculesai.app")
p.add_argument(
"--dry-run",
action="store_true",
help="Compute state but do not POST the status.",
)
p.add_argument(
"--status-context",
default="sop-checklist / all-items-acked (pull_request)",
)
p.add_argument(
"--exit-on-state",
action="store_true",
help=(
"If set, exit non-zero when state=failure. Default OFF so the "
"job-level conclusion is independent of ack-state — the only "
"thing BP sees is the POSTed status. Useful for local debugging."
),
)
args = p.parse_args(argv)
token = os.environ.get("GITEA_TOKEN", "")
if not token and not args.dry_run:
print("::error::GITEA_TOKEN env required", file=sys.stderr)
return 2
cfg = load_config(args.config)
items: list[dict[str, Any]] = cfg["items"]
items_by_slug = {it["slug"]: it for it in items}
numeric_aliases = {
int(it["numeric_alias"]): it["slug"] for it in items if it.get("numeric_alias")
}
client = GiteaClient(args.gitea_host, token) if token else None
if not client:
print("::error::No client (dry-run without token has nothing to do)", file=sys.stderr)
return 2
pr = client.get_pr(args.owner, args.repo, args.pr)
if pr.get("state") != "open":
print(f"::notice::PR #{args.pr} is {pr.get('state')} — gate is a no-op")
return 0
author = (pr.get("user") or {}).get("login", "")
head_sha = (pr.get("head") or {}).get("sha", "")
body = pr.get("body", "") or ""
if not author or not head_sha:
print("::error::PR payload missing user.login or head.sha", file=sys.stderr)
return 1
comments = client.get_issue_comments(args.owner, args.repo, args.pr)
# Build team-membership probe closure that caches results per
# (user, team-id) so a user acking multiple items only triggers
# one membership lookup per team.
team_member_cache: dict[tuple[str, int], bool | None] = {}
def probe(slug: str, users: list[str]) -> list[str]:
item = items_by_slug[slug]
team_names: list[str] = item["required_teams"]
# Resolve names → ids. NOTE: orgs/{org}/teams/search may not be
# available — fall back to the list endpoint.
team_ids: list[int] = []
for tn in team_names:
tid = client.resolve_team_id(args.owner, tn)
if tid is None:
# Try the list endpoint as a fallback.
code, data = client._req( # noqa: SLF001
"GET", f"/orgs/{args.owner}/teams"
)
if code == 200 and isinstance(data, list):
for t in data:
if t.get("name") == tn:
tid = t.get("id")
client._team_id_cache[(args.owner, tn)] = tid # noqa: SLF001
break
if tid is not None:
team_ids.append(tid)
else:
print(
f"::warning::could not resolve team-id for '{tn}' "
f"in org '{args.owner}' — item '{slug}' will fail closed",
file=sys.stderr,
)
approved: list[str] = []
for u in users:
for tid in team_ids:
cache_key = (u, tid)
if cache_key not in team_member_cache:
team_member_cache[cache_key] = client.is_team_member(tid, u)
result = team_member_cache[cache_key]
if result is True:
approved.append(u)
break
if result is None:
print(
f"::warning::team-probe for {u} in team-id {tid} returned 403 "
"(token owner not in that team — fail-closed per RFC#324)",
file=sys.stderr,
)
# Treat as not-in-team for this user/team pair; loop
# may still find membership in another team.
return approved
ack_state = compute_ack_state(comments, author, items_by_slug, numeric_aliases, probe)
body_state = {it["slug"]: section_marker_present(body, it["pr_section_marker"]) for it in items}
state, description = render_status(items, ack_state, body_state)
mode = get_tier_mode(pr, cfg)
if mode == "soft":
# tier:low: acks are informational only — post success so BP gate passes.
# Description carries "[info tier:low]" prefix so reviewers know acks
# were not required (vs a tier:medium+ PR that truly passed all acks).
state = "success"
description = f"[info tier:low] {description}"
# Diagnostics to job log.
print(f"::notice::PR #{args.pr} author={author} head={head_sha[:7]} mode={mode}")
for it in items:
slug = it["slug"]
ackers = ack_state[slug]["ackers"]
if ackers:
print(f"::notice:: [PASS] {slug} — acked by {','.join(ackers)}")
else:
r = ack_state[slug]["rejected"]
extras: list[str] = []
if r["self_ack"]:
extras.append(f"self-acks-rejected:{','.join(r['self_ack'])}")
if r["not_in_team"]:
extras.append(f"not-in-team:{','.join(r['not_in_team'])}")
extra = " (" + "; ".join(extras) + ")" if extras else ""
print(f"::notice:: [WAIT] {slug} — no valid peer-ack yet{extra}")
print(f"::notice::posting status: state={state} desc={description!r}")
if args.dry_run:
print("::notice::--dry-run: not posting status")
if args.exit_on_state:
return 0 if state in ("success", "pending") else 1
return 0
target_url = f"https://{args.gitea_host}/{args.owner}/{args.repo}/pulls/{args.pr}"
client.post_status(
args.owner, args.repo, head_sha,
state=state, context=args.status_context,
description=description, target_url=target_url,
)
print(f"::notice::status posted: {args.status_context}{state}")
# By default exit 0 — the POSTed status IS the gate, NOT the job
# conclusion. If the job exits 1 BP will see TWO failure signals
# (one from the job's auto-status, one from our POST), making the
# description less actionable. --exit-on-state restores the old
# behavior for local debugging.
if args.exit_on_state:
return 0 if state in ("success", "pending") else 1
return 0
if __name__ == "__main__":
sys.exit(main())
+109
View File
@@ -0,0 +1,109 @@
# SOP-Checklist gate — per-item required reviewer teams.
#
# RFC#351 v1 starter set. Each item lists:
# slug — canonical kebab-case form used in /sop-ack <slug>
# pr_section_marker — substring matched in the PR body to detect that
# the author filled in this item (case-insensitive)
# required_teams — list of Gitea team names; an ack from ANY one of
# these teams (logical OR) satisfies the item.
# Membership is probed at gate-time via
# GET /api/v1/teams/{id}/members/{login}.
# Team-id resolution happens at script start via
# GET /api/v1/orgs/{org}/teams (cheap, one call).
# numeric_alias — 1..7; lets reviewers type `/sop-ack 3` as a
# shortcut for `/sop-ack staging-smoke`.
#
# WHY THESE TEAM MAPPINGS:
# The RFC table referenced persona-role names like `core-qa`,
# `core-be`, `core-devops` — these are individual Gitea user logins,
# not teams. The Gitea team-membership API is /teams/{id}/members/{u},
# so we need actual teams. Orchestrator preflight 2026-05-12 verified
# only these teams exist on molecule-ai: ceo(5), engineers(2),
# managers(6), qa(20), security(21), Owners(1), and bot teams. We
# map the RFC roles to the closest existing team and surface the
# mapping explicitly so it's reviewable.
#
# HOW TO EDIT:
# - Tightening: replace `engineers` with a smaller team after creating
# it (e.g. a new `senior-engineers` team if needed).
# - Loosening: add another team to required_teams (OR semantics).
# - Add an item: append to items list and document the slug below.
#
# AUTHOR SELF-ACK IS FORBIDDEN regardless of which team contains them
# — the gate script enforces commenter != PR author before checking
# team membership.
version: 1
# Tier-aware failure mode (RFC#351 open question 2):
# For tier:high — hard-fail (status `failure`, blocks merge via BP).
# For tier:medium — hard-fail (same as high; medium is non-trivial).
# For tier:low — soft-fail (status `pending` with `acked: N/M` in the
# description). BP can choose to require the context
# or not for low-tier PRs.
# If no tier label is present, default to medium (hard-fail) — every PR
# should have a tier label per sop-tier-check, and absence indicates
# a missing-tier defect we should surface, not silently lower the bar.
tier_failure_mode:
"tier:high": hard
"tier:medium": hard
"tier:low": soft
default_mode: hard # used when no tier:* label is present
items:
- slug: comprehensive-testing
numeric_alias: 1
pr_section_marker: "Comprehensive testing performed"
required_teams: [qa, engineers]
description: >-
What was tested, how, edge cases covered. Ack from any qa-team
member (or engineers fallback while qa is small).
- slug: local-postgres-e2e
numeric_alias: 2
pr_section_marker: "Local-postgres E2E run"
required_teams: [engineers]
description: >-
Link to local CI artifact, or "N/A: pure-frontend change". Ack
from any engineer who can verify the local DB test actually ran.
- slug: staging-smoke
numeric_alias: 3
pr_section_marker: "Staging-smoke verified or pending"
required_teams: [engineers]
description: >-
Link to canary run, or "scheduled post-merge". Ack from any
engineer (core-devops/infra-sre are members of engineers team).
- slug: root-cause
numeric_alias: 4
pr_section_marker: "Root-cause not symptom"
required_teams: [managers, ceo]
description: >-
One-sentence root-cause statement. Ack from managers tier
(team-leads) or ceo. Senior judgment required to attest
root-cause-versus-symptom.
- slug: five-axis-review
numeric_alias: 5
pr_section_marker: "Five-Axis review walked"
required_teams: [engineers]
description: >-
Correctness / readability / architecture / security / performance.
Ack from any non-author engineer.
- slug: no-backwards-compat
numeric_alias: 6
pr_section_marker: "No backwards-compat shim / dead code added"
required_teams: [managers, ceo]
description: >-
Yes/no + justification if no. Senior ack required because
backward-compat shims are how dead-code accretes.
- slug: memory-consulted
numeric_alias: 7
pr_section_marker: "Memory/saved-feedback consulted"
required_teams: [engineers]
description: >-
List of feedback memories applicable to this change. Ack from
any engineer who has the same memory access.
+5 -2
View File
@@ -52,7 +52,10 @@ jobs:
# Declared here rather than fetched from /branch_protections
# because that endpoint requires admin write — sop-tier-bot is
# read-only by design (least-privilege).
#
# staging branch protection (§F3a/F3b, mc#798): only
# sop-checklist / all-items-acked is required. Unlike main,
# staging does not require sop-tier-check or Secret scan.
REQUIRED_CHECKS: |
sop-tier-check / tier-check (pull_request)
Secret scan / Scan diff for credential-shaped strings (pull_request)
sop-checklist / all-items-acked (pull_request)
run: bash .gitea/scripts/audit-force-merge.sh
+599
View File
@@ -0,0 +1,599 @@
# Ported from .github/workflows/ci.yml on 2026-05-11 per RFC internal#219 §1.
# continue-on-error: true on every job; follow-up PR will flip required after
# surfaced bugs are fixed (per RFC §1 — "surface broken workflows without
# blocking"). The four-surface migration audit
# (feedback_gitea_actions_migration_audit_pattern) was performed against this
# port:
#
# 1. YAML — dropped `merge_group` trigger (no Gitea merge queue); no
# `workflow_dispatch.inputs` to drop (Gitea 1.22.6 rejects those —
# feedback_gitea_workflow_dispatch_inputs_unsupported); no `environment:`
# blocks; kept `runs-on: ubuntu-latest` (Gitea runner pool advertises
# this label per agent_labels in action_runner table). Workflow-level
# env.GITHUB_SERVER_URL set as belt-and-suspenders against runner
# defaults (feedback_act_runner_github_server_url).
#
# 2. Cache — `actions/upload-artifact@v3.2.2` was already pinned to v3 for
# Gitea act_runner v0.6 compatibility (a comment in the original called
# this out). v4+ is incompatible with Gitea 1.22.x. No `actions/cache`
# usage to audit. `actions/setup-python@v6` `cache: pip` is left in
# place — works against Gitea's built-in cache server when runner.cache
# is configured (currently is, /opt/molecule/runners/config.yaml).
#
# 3. Token — workflow uses no custom dispatch tokens. The auto-injected
# `GITHUB_TOKEN` (which Gitea aliases to a runner-scoped token) is
# sufficient for `actions/checkout` against this same repo.
#
# 4. Docs — no docs/scripts reference github.com URLs that need swapping.
# The canvas-deploy-reminder step writes a `ghcr.io/...` image
# reference into the step summary text — that's documentation prose
# pointing at the ECR-mirrored canvas image and stays unchanged for
# this port (a separate cleanup if ghcr→ECR sweep is in scope).
#
# Cross-links:
# - RFC: internal#219 (CI/CD hard-gate hardening)
# - Reference port style: molecule-controlplane/.gitea/workflows/ci.yml
# - Bugs that may surface immediately and are tracked separately:
# internal#214 (Go-side vanity-import / go.sum drift, if any)
# - Phase 4 (this PR's follow-up): flip `continue-on-error: false` once
# surfaced defects are fixed, then add `all-required` aggregator
# sentinel (RFC §2) and PATCH branch protection (Phase 4 scope).
name: CI
on:
push:
branches: [main, staging]
pull_request:
branches: [main, staging]
# `merge_group` (GitHub merge-queue trigger) dropped — Gitea has no merge
# queue. The .github/ original retains it; this Gitea-side copy drops it.
# Cancel in-progress CI runs when a new commit arrives on the same ref.
# Stale runs queue up otherwise. PR refs and main/staging refs each get
# their own group because github.ref differs.
concurrency:
group: ci-${{ github.ref }}
cancel-in-progress: true
env:
# Belt-and-suspenders against the runner-default trap
# (feedback_act_runner_github_server_url). Runners are configured with
# this env via /opt/molecule/runners/config.yaml runner.envs, but pinning
# at the workflow level protects against a runner regenerated without
# the config file (feedback_act_runner_needs_config_file_env).
GITHUB_SERVER_URL: https://git.moleculesai.app
jobs:
# Detect which paths changed so downstream jobs can skip when only
# docs/markdown files were modified.
changes:
name: Detect changes
runs-on: ubuntu-latest
# Phase 4 (RFC #219 §1): all required jobs >=98% green on main.
# Flip confirmed 2026-05-12 via combined-status check of latest main
# commit (all CI jobs green). `all-required` sentinel hard-fails
# when this job fails; no Phase 3 suppression needed.
# revert: add `continue-on-error: true` back if regressions appear.
continue-on-error: false
outputs:
platform: ${{ steps.check.outputs.platform }}
canvas: ${{ steps.check.outputs.canvas }}
python: ${{ steps.check.outputs.python }}
scripts: ${{ steps.check.outputs.scripts }}
steps:
- uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2
with:
fetch-depth: 0
- id: check
run: |
# For PR events: diff against the base branch (not HEAD~1 of the branch,
# which may be unrelated after force-pushes). When a push updates a PR,
# both pull_request and push events fire — prefer the PR base so that
# the diff is always computed against the actual merge base, not the
# previous SHA on the branch which may be on a different history line.
BASE="${GITHUB_BASE_REF:-${{ github.event.before }}}"
# GITHUB_BASE_REF is set for PR events (the base branch name).
# For pull_request events we use the stored base.sha; for push events
# (or when base.sha is unavailable) fall back to github.event.before.
if [ "${{ github.event_name }}" = "pull_request" ] && [ -n "${{ github.event.pull_request.base.sha }}" ]; then
BASE="${{ github.event.pull_request.base.sha }}"
fi
# Fallback: if BASE is empty or all zeros (new branch), run everything
if [ -z "$BASE" ] || echo "$BASE" | grep -qE '^0+$'; then
echo "platform=true" >> "$GITHUB_OUTPUT"
echo "canvas=true" >> "$GITHUB_OUTPUT"
echo "python=true" >> "$GITHUB_OUTPUT"
echo "scripts=true" >> "$GITHUB_OUTPUT"
exit 0
fi
# Both .github/workflows/ci.yml AND .gitea/workflows/ci.yml count
# as "this workflow changed" — either edit should force-run every
# downstream job. The Gitea port follows the same shape as the
# GitHub original so behavior matches when triggered on either
# platform.
DIFF=$(git diff --name-only "$BASE" HEAD 2>/dev/null || echo ".gitea/workflows/ci.yml")
echo "platform=$(echo "$DIFF" | grep -qE '^workspace-server/|^\.gitea/workflows/ci\.yml$|^\.github/workflows/ci\.yml$' && echo true || echo false)" >> "$GITHUB_OUTPUT"
echo "canvas=$(echo "$DIFF" | grep -qE '^canvas/|^\.gitea/workflows/ci\.yml$|^\.github/workflows/ci\.yml$' && echo true || echo false)" >> "$GITHUB_OUTPUT"
echo "python=$(echo "$DIFF" | grep -qE '^workspace/|^\.gitea/workflows/ci\.yml$|^\.github/workflows/ci\.yml$' && echo true || echo false)" >> "$GITHUB_OUTPUT"
echo "scripts=$(echo "$DIFF" | grep -qE '^tests/e2e/|^scripts/|^infra/scripts/|^\.gitea/workflows/ci\.yml$|^\.github/workflows/ci\.yml$' && echo true || echo false)" >> "$GITHUB_OUTPUT"
# Platform (Go) — Go build/vet/test/lint + coverage gates. The always-run
# + per-step gating shape preserves the GitHub-side required-check name
# contract (so when this Gitea port becomes a required check in Phase 4,
# the name match works on PRs that don't touch workspace-server/).
platform-build:
name: Platform (Go)
needs: changes
runs-on: ubuntu-latest
# mc#774 (interim): re-mask platform-build pending fix-forward. Phase 4
# (#656) flipped this to continue-on-error: false based on a Phase-3-masked
# "green on main 2026-05-12" — the prior continue-on-error: true had
# been hiding failing tests in workspace-server/internal/handlers/.
# Two distinct failure classes surfaced on 0e5152c3:
# (1) 4x delegation_test.go (lines 1110/1176/1228/1271): helpers
# expectExecuteDelegationBase/Success/Failed are missing sqlmock
# expectations for queries production has issued since ~2026-04-21
# (last_outbound_at UPDATE, lookupDeliveryMode/Runtime SELECTs,
# a2a_receive INSERT activity_logs, recordLedgerStatus writes).
# Halt cond #3 applies (regression > 7 days → broader sweep).
# (2) 1x mcp_test.go:433 (TestMCPHandler_CommitMemory_GlobalScope_Blocked):
# commit 7d1a189f (2026-05-10) hardened mcp.go to scrub err.Error()
# from JSON-RPC responses (OFFSEC-001), but the test asserts the
# error message contains "GLOBAL". Production-vs-test contract
# collision — needs design call, not mock update.
# Time-boxed Option A (90 min) did not fit the cross-cutting scope.
# This is a sequenced revert→fix→reflip per
# feedback_strict_root_only_after_class_a emergency clause — NOT
# a permanent re-mask. Re-flip blocked on mc#774 fix-forward landing.
# Other 4 #656 flips (changes, canvas-build, shellcheck, python-lint)
# retain continue-on-error: false; only platform-build regresses.
# mc#774: pre-existing continue-on-error mask; root-fix and remove, do not renew silently.
continue-on-error: true # mc#774 fix-forward in flight; re-flip when mc#774 lands (PR #669 → rebase after #709)
defaults:
run:
working-directory: workspace-server
steps:
- if: needs.changes.outputs.platform != 'true'
working-directory: .
run: echo "No platform/** changes — skipping real build steps; this job always runs to satisfy the required-check name on branch protection."
- if: needs.changes.outputs.platform == 'true'
uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2
- if: needs.changes.outputs.platform == 'true'
uses: actions/setup-go@40f1582b2485089dde7abd97c1529aa768e1baff # v5
with:
go-version: 'stable'
- if: needs.changes.outputs.platform == 'true'
run: go mod download
- if: needs.changes.outputs.platform == 'true'
run: go build ./cmd/server
# CLI (molecli) moved to standalone repo: git.moleculesai.app/molecule-ai/molecule-cli
- if: needs.changes.outputs.platform == 'true'
run: go vet ./...
- if: needs.changes.outputs.platform == 'true'
name: Install golangci-lint
run: go install github.com/golangci/golangci-lint/v2/cmd/golangci-lint@v2.12.2
- if: needs.changes.outputs.platform == 'true'
name: Run golangci-lint
run: $(go env GOPATH)/bin/golangci-lint run --timeout 3m ./...
- if: needs.changes.outputs.platform == 'true'
name: Diagnostic — per-package verbose 60s
run: |
set +e
go test -race -v -timeout 60s ./internal/handlers/... 2>&1 | tee /tmp/test-handlers.log
handlers_exit=$?
go test -race -v -timeout 60s ./internal/pendinguploads/... 2>&1 | tee /tmp/test-pu.log
pu_exit=$?
echo "::group::handlers exit=$handlers_exit (last 100 lines)"
tail -100 /tmp/test-handlers.log
echo "::endgroup::"
echo "::group::pendinguploads exit=$pu_exit (last 100 lines)"
tail -100 /tmp/test-pu.log
echo "::endgroup::"
# mc#774: pre-existing continue-on-error mask; root-fix and remove, do not renew silently.
continue-on-error: true
- if: needs.changes.outputs.platform == 'true'
name: Run tests with race detection and coverage
run: go test -race -coverprofile=coverage.out ./...
- if: needs.changes.outputs.platform == 'true'
name: Per-file coverage report
# Advisory — lists every source file with its coverage so reviewers
# can see at-a-glance where gaps are. Sorted ascending so the worst
# offenders float to the top. Does NOT fail the build; the hard
# gate is the threshold check below. (#1823)
run: |
echo "=== Per-file coverage (worst first) ==="
go tool cover -func=coverage.out \
| grep -v '^total:' \
| awk '{file=$1; sub(/:[0-9][0-9.]*:.*/, "", file); pct=$NF; gsub(/%/,"",pct); s[file]+=pct; c[file]++}
END {for (f in s) printf "%6.1f%% %s\n", s[f]/c[f], f}' \
| sort -n
- if: needs.changes.outputs.platform == 'true'
name: Check coverage thresholds
# Enforces two gates from #1823 Layer 1:
# 1. Total floor (25% — ratchet plan in COVERAGE_FLOOR.md).
# 2. Per-file floor — non-test .go files in security-critical
# paths with coverage <10% fail the build, UNLESS the file
# path is listed in .coverage-allowlist.txt (acknowledged
# historical debt with a tracking issue + expiry).
run: |
set -e
TOTAL_FLOOR=25
# Security-critical paths where a 0%-coverage file is a real risk.
CRITICAL_PATHS=(
"internal/handlers/tokens"
"internal/handlers/workspace_provision"
"internal/handlers/a2a_proxy"
"internal/handlers/registry"
"internal/handlers/secrets"
"internal/middleware/wsauth"
"internal/crypto"
)
TOTAL=$(go tool cover -func=coverage.out | grep '^total:' | awk '{print $3}' | sed 's/%//')
echo "Total coverage: ${TOTAL}%"
if awk "BEGIN{exit !($TOTAL < $TOTAL_FLOOR)}"; then
echo "::error::Total coverage ${TOTAL}% is below the ${TOTAL_FLOOR}% floor. See COVERAGE_FLOOR.md for ratchet plan."
exit 1
fi
# Aggregate per-file coverage → /tmp/perfile.txt: "<fullpath> <pct>"
go tool cover -func=coverage.out \
| grep -v '^total:' \
| awk '{file=$1; sub(/:[0-9][0-9.]*:.*/, "", file); pct=$NF; gsub(/%/,"",pct); s[file]+=pct; c[file]++}
END {for (f in s) printf "%s %.1f\n", f, s[f]/c[f]}' \
> /tmp/perfile.txt
# Build allowlist — paths relative to workspace-server, one per line.
# Lines starting with # are comments.
ALLOWLIST=""
if [ -f ../.coverage-allowlist.txt ]; then
ALLOWLIST=$(grep -vE '^(#|[[:space:]]*$)' ../.coverage-allowlist.txt || true)
fi
FAILED=0
WARNED=0
for path in "${CRITICAL_PATHS[@]}"; do
while read -r file pct; do
[[ "$file" == *_test.go ]] && continue
[[ "$file" == *"$path"* ]] || continue
awk "BEGIN{exit !($pct < 10)}" || continue
# Strip the package-import prefix so we can match .coverage-allowlist.txt
# entries written as paths relative to workspace-server/.
# Handle both module paths: platform/workspace-server/... and platform/...
rel=$(echo "$file" | sed 's|^github.com/molecule-ai/molecule-monorepo/platform/workspace-server/||; s|^github.com/molecule-ai/molecule-monorepo/platform/||')
if echo "$ALLOWLIST" | grep -qxF "$rel"; then
echo "::warning file=workspace-server/$rel::Critical file at ${pct}% coverage (allowlisted, #1823) — fix before expiry."
WARNED=$((WARNED+1))
else
echo "::error file=workspace-server/$rel::Critical file at ${pct}% coverage — must be >=10% (target 80%). See #1823. To acknowledge as known debt, add this path to .coverage-allowlist.txt."
FAILED=$((FAILED+1))
fi
done < /tmp/perfile.txt
done
echo ""
echo "Critical-path check: $FAILED new failures, $WARNED allowlisted warnings."
if [ "$FAILED" -gt 0 ]; then
echo ""
echo "$FAILED security-critical file(s) have <10% test coverage and are"
echo "NOT in the allowlist. These paths handle auth, tokens, secrets, or"
echo "workspace provisioning — a 0% file here is the exact gap that let"
echo "CWE-22, CWE-78, KI-005 slip through in past incidents. Either:"
echo " (a) add tests to raise coverage above 10%, or"
echo " (b) add the path to .coverage-allowlist.txt with an expiry date"
echo " and a tracking issue reference."
exit 1
fi
# Canvas (Next.js) — required check, always runs. Same always-run +
# per-step gating shape as platform-build. The two-job-sharing-name
# pattern attempted in PR #2321 doesn't satisfy branch protection
# (SKIPPED siblings count as not-passed regardless of SUCCESS
# siblings — verified empirically on PR #2314).
canvas-build:
name: Canvas (Next.js)
needs: changes
runs-on: ubuntu-latest
# Phase 4 (RFC #219 §1): confirmed green on main 2026-05-12.
continue-on-error: false
defaults:
run:
working-directory: canvas
steps:
- if: needs.changes.outputs.canvas != 'true'
working-directory: .
run: echo "No canvas/** changes — skipping real build steps; this job always runs to satisfy the required-check name on branch protection."
- if: needs.changes.outputs.canvas == 'true'
uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2
- if: needs.changes.outputs.canvas == 'true'
uses: actions/setup-node@48b55a011bda9f5d6aeb4c2d9c7362e8dae4041e # v6.4.0
with:
node-version: '22'
- if: needs.changes.outputs.canvas == 'true'
run: rm -f package-lock.json && npm install
- if: needs.changes.outputs.canvas == 'true'
run: npm run build
- if: needs.changes.outputs.canvas == 'true'
name: Run tests with coverage
# Coverage instrumentation is configured in canvas/vitest.config.ts
# (provider: v8, reporters: text + html + json-summary). Step 2 of
# #1815 — wires coverage into CI so we get a baseline visible on
# every PR. No threshold gate yet; thresholds dial in (Step 3, also
# tracked in #1815) after the team sees what current coverage is.
run: npx vitest run --coverage
- name: Upload coverage summary as artifact
if: needs.changes.outputs.canvas == 'true' && always()
# Pinned to v3 for Gitea act_runner v0.6 compatibility — v4+ uses
# the GHES 3.10+ artifact protocol that Gitea 1.22.x does NOT
# implement, surfacing as `GHESNotSupportedError: @actions/artifact
# v2.0.0+, upload-artifact@v4+ and download-artifact@v4+ are not
# currently supported on GHES`. Drop this pin when Gitea ships
# the v4 protocol (tracked: post-Gitea-1.23 followup).
uses: actions/upload-artifact@c6a366c94c3e0affe28c06c8df20a878f24da3cf # v3.2.2
with:
name: canvas-coverage-${{ github.run_id }}
path: canvas/coverage/
retention-days: 7
if-no-files-found: warn
# Shellcheck (E2E scripts) — required check, always runs.
shellcheck:
name: Shellcheck (E2E scripts)
needs: changes
runs-on: ubuntu-latest
# Phase 4 (RFC #219 §1): confirmed green on main 2026-05-12.
continue-on-error: false
steps:
- if: needs.changes.outputs.scripts != 'true'
run: echo "No tests/e2e/ or infra/scripts/ changes — skipping real shellcheck; this job always runs to satisfy the required-check name on branch protection."
- if: needs.changes.outputs.scripts == 'true'
uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2
- if: needs.changes.outputs.scripts == 'true'
name: Run shellcheck on tests/e2e/*.sh and infra/scripts/*.sh
# shellcheck is pre-installed on ubuntu-latest runners (via apt).
# infra/scripts/ is included because setup.sh + nuke.sh gate the
# README quickstart — a shellcheck regression there silently breaks
# new-user onboarding. scripts/ is intentionally excluded until its
# pre-existing SC3040/SC3043 warnings are cleaned up.
run: |
find tests/e2e infra/scripts -type f -name '*.sh' -print0 \
| xargs -0 shellcheck --severity=warning
- if: needs.changes.outputs.scripts == 'true'
name: Lint cleanup-trap hygiene (RFC #2873)
run: bash tests/e2e/lint_cleanup_traps.sh
- if: needs.changes.outputs.scripts == 'true'
name: Run E2E bash unit tests (no live infra)
run: |
bash tests/e2e/test_model_slug.sh
canvas-deploy-reminder:
name: Canvas Deploy Reminder
runs-on: ubuntu-latest
# mc#774: pre-existing continue-on-error mask; root-fix and remove, do not renew silently.
continue-on-error: true
needs: [changes, canvas-build]
# Only fires on direct pushes to main (i.e. after staging→main promotion).
if: needs.changes.outputs.canvas == 'true' && github.event_name == 'push' && github.ref == 'refs/heads/main'
steps:
- name: Write deploy reminder to step summary
env:
COMMIT_SHA: ${{ github.sha }}
# github.server_url resolves via the workflow-level env override
# to the Gitea instance, so the RUN_URL points at the Gitea run
# page (not github.com). See feedback_act_runner_github_server_url.
RUN_URL: ${{ github.server_url }}/${{ github.repository }}/actions/runs/${{ github.run_id }}
run: |
# Write body to a temp file — avoids backtick escaping in shell.
cat > /tmp/deploy-reminder.md << 'BODY'
## Canvas build passed — deploy required
The `publish-canvas-image` workflow is now building a fresh Docker image
(`ghcr.io/molecule-ai/canvas:latest`) in the background.
Once it completes (~35 min), apply on the host machine with:
```bash
cd <runner-workspace>
git pull origin main
docker compose pull canvas && docker compose up -d canvas
```
If you need to rebuild from local source instead (e.g. testing unreleased
changes or a new `NEXT_PUBLIC_*` URL), use:
```bash
docker compose build canvas && docker compose up -d canvas
```
BODY
printf '\n> Posted automatically by CI · commit `%s` · [build log](%s)\n' \
"$COMMIT_SHA" "$RUN_URL" >> /tmp/deploy-reminder.md
# Gitea has no commit-comments API; write to GITHUB_STEP_SUMMARY,
# which both GitHub Actions and Gitea Actions render as the
# workflow run's summary page. (#75 / PR-D)
cat /tmp/deploy-reminder.md >> "$GITHUB_STEP_SUMMARY"
# Python Lint & Test — required check, always runs.
python-lint:
name: Python Lint & Test
needs: changes
runs-on: ubuntu-latest
# Phase 4 (RFC #219 §1): confirmed green on main 2026-05-12.
continue-on-error: false
env:
WORKSPACE_ID: test
defaults:
run:
working-directory: workspace
steps:
- if: needs.changes.outputs.python != 'true'
working-directory: .
run: echo "No workspace/** changes — skipping real lint+test; this job always runs to satisfy the required-check name on branch protection."
- if: needs.changes.outputs.python == 'true'
uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2
- if: needs.changes.outputs.python == 'true'
uses: actions/setup-python@a309ff8b426b58ec0e2a45f0f869d46889d02405 # v6.2.0
with:
python-version: '3.11'
cache: pip
cache-dependency-path: workspace/requirements.txt
- if: needs.changes.outputs.python == 'true'
run: pip install -r requirements.txt pytest pytest-asyncio pytest-cov sqlalchemy>=2.0.0
# Coverage flags + fail-under floor moved into workspace/pytest.ini
# (issue #1817) so local `pytest` and CI use identical config.
- if: needs.changes.outputs.python == 'true'
run: python -m pytest --tb=short
- if: needs.changes.outputs.python == 'true'
name: Per-file critical-path coverage (MCP / inbox / auth)
# MCP-critical Python files have a per-file floor on top of the
# 86% total floor in pytest.ini. See issue #2790 for full rationale.
run: |
set -e
PER_FILE_FLOOR=75
CRITICAL_FILES=(
"a2a_mcp_server.py"
"mcp_cli.py"
"a2a_tools.py"
"a2a_tools_inbox.py"
"inbox.py"
"platform_auth.py"
)
# pytest already wrote .coverage; emit a JSON view scoped to
# the critical files so jq/python can read the per-file pct
# without parsing tabular text.
INCLUDES=$(printf '*%s,' "${CRITICAL_FILES[@]}")
INCLUDES="${INCLUDES%,}"
python -m coverage json -o /tmp/critical-cov.json --include="$INCLUDES"
FAILED=0
for f in "${CRITICAL_FILES[@]}"; do
pct=$(jq -r --arg f "$f" '.files | to_entries | map(select(.key == $f)) | .[0].value.summary.percent_covered // "MISSING"' /tmp/critical-cov.json)
if [ "$pct" = "MISSING" ]; then
echo "::error file=workspace/$f::No coverage data — file may have moved or test exclusion mis-set."
FAILED=$((FAILED+1))
continue
fi
echo "$f: ${pct}%"
if awk "BEGIN{exit !($pct < $PER_FILE_FLOOR)}"; then
echo "::error file=workspace/$f::${pct}% < ${PER_FILE_FLOOR}% per-file floor (MCP critical path). See COVERAGE_FLOOR.md."
FAILED=$((FAILED+1))
fi
done
if [ "$FAILED" -gt 0 ]; then
echo ""
echo "$FAILED MCP critical-path file(s) below the ${PER_FILE_FLOOR}% per-file floor."
echo "These paths handle multi-tenant routing, auth tokens, and inbox dispatch."
echo "A coverage drop here is the same risk shape as Go-side tokens/secrets files"
echo "dropping below 10% (see COVERAGE_FLOOR.md). Either:"
echo " (a) add tests to raise coverage back above ${PER_FILE_FLOOR}%, or"
echo " (b) if this is unavoidable historical debt, file an issue and propose"
echo " adjusting the floor with rationale in COVERAGE_FLOOR.md."
exit 1
fi
all-required:
# Aggregator sentinel — RFC internal#219 §2 (Phase 4 — closes internal#286).
#
# Single stable required-status name that branch protection points at;
# CI churns underneath in `needs:` without any protection edits. Mirrors
# the molecule-controlplane Phase 2a impl shipped in CP PR#112 and
# referenced by `internal#286` ("Phase 4 is a single small PR... mirrors
# CP's existing one").
#
# Closes the failure mode where status_check_contexts on molecule-core/main
# only listed `Secret scan` + `sop-tier-check` (the 2 meta-gates), so real
# `Platform (Go)` / `Canvas (Next.js)` / `Python Lint & Test` / `Shellcheck`
# red silently merged through. See internal#286 for the three concrete
# tonight-of-2026-05-11 incidents that prompted the emergency bump.
#
# Three properties of this job each close a failure mode:
#
# 1. `if: always()` — runs even when an upstream fails. Without it the
# sentinel is `skipped` and protection treats that as missing → merge
# ungated.
#
# 2. Assertion is `result == "success"` per dep, NOT `!= "failure"`.
# A `skipped` upstream (job gated by `if:` evaluating false, matrix
# entry that couldn't run) must NOT silently pass through.
# `skipped`-as-green is exactly the failure mode this gate closes.
#
# 3. `needs:` is the canonical list of "what counts as required."
# status_check_contexts will reference only `ci/all-required` (Step 5
# follow-up — branch-protection PATCH is Owners-tier per
# `feedback_never_admin_merge_bypass`, separate PR); a new job is
# added simply by listing it in `needs:` here.
# `.gitea/workflows/ci-required-drift.yml` files a [ci-drift] issue
# hourly if this list diverges from status_check_contexts or from
# audit-force-merge.yml's REQUIRED_CHECKS env (RFC §4 + §6).
#
# Excluded from `needs:`: `canvas-deploy-reminder` — gated by
# `if: ... github.event_name == 'push' && github.ref == 'refs/heads/main'`,
# so on PR events it's legitimately `skipped`. The drift detector
# explicitly excludes `github.event_name`-gated jobs from F1 (see
# `.gitea/scripts/ci-required-drift.py::ci_job_names`).
#
# Phase 3 (RFC #219 §1) safety: underlying build jobs carry
# continue-on-error: true so their failures are masked to null (2026-05-12: re-enabled mc#774 interim)
# (Gitea suppresses status reporting for CoE jobs). This sentinel
# runs with continue-on-error: false so it always reports its
# result to the API — without this, the required-status entry
# (CI / all-required (pull_request)) is never created, which
# blocks PR merges. When Phase 3 ends, flip underlying jobs to
# continue-on-error: false; this sentinel can then be flipped to
# continue-on-error: true if a Phase-4 regression requires it.
continue-on-error: false
runs-on: ubuntu-latest
timeout-minutes: 1
needs:
- changes
- platform-build
- canvas-build
- shellcheck
- python-lint
if: always()
steps:
- name: Assert every required dependency succeeded
run: |
set -euo pipefail
# `needs.*.result` is one of: success | failure | cancelled | skipped | null.
# We assert success per dep (not != failure) — see RFC §2 reasoning above.
# Null results are skipped: they come from Phase 3 (continue-on-error: true
# suppresses status) or from jobs still in-flight. The sentinel succeeds
# rather than blocking PRs on Phase 3 noise.
results='${{ toJSON(needs) }}'
echo "$results"
echo "$results" | python3 -c '
import json, sys
ns = json.load(sys.stdin)
# Phase 3 masked: jobs with continue-on-error: true may report "failure"
# Remove when mc#774 handler test failures are resolved.
PHASE3_MASKED = {"platform-build"}
# Exclude null (Phase 3 suppressed / in-flight) from the bad list.
bad = [(k, v.get("result")) for k, v in ns.items()
if v.get("result") not in ("success", None, "cancelled", "skipped") and k not in PHASE3_MASKED]
if bad:
print(f"FAIL: jobs not green:", file=sys.stderr)
for k, r in bad:
print(f" - {k}: {r}", file=sys.stderr)
sys.exit(1)
pending = [(k, v.get("result")) for k, v in ns.items()
if v.get("result") is None]
cancelled = [(k, v.get("result")) for k, v in ns.items()
if v.get("result") == "cancelled"]
if pending:
print(f"WARN: {len(pending)} job(s) still in-flight (result=null): " +
", ".join(k for k, _ in pending), file=sys.stderr)
if cancelled:
print(f"INFO: {len(cancelled)} job(s) masked by continue-on-error: " +
", ".join(k for k, _ in cancelled), file=sys.stderr)
print(f"OK: all {len(ns)} required jobs succeeded (or Phase-3 suppressed)")
'
+121
View File
@@ -0,0 +1,121 @@
# sop-checklist-gate — peer-ack merge gate for SOP-checklist items.
#
# RFC#351 Step 2 of 6 (implementation MVP).
#
# === DESIGN ===
#
# Goal: each PR must answer 7 SOP-checklist questions in its body,
# and each item must have at least one /sop-ack <slug> comment from
# a non-author peer in the required team. BP requires the
# `sop-checklist / all-items-acked (pull_request)` status to merge.
#
# Triggers:
# - `pull_request_target`: opened, edited, synchronize, reopened
# → fires when PR opens, body is edited (refire — RFC#351 §4),
# or new code is pushed (head.sha changes → stale status would
# be auto-discarded by BP via dismiss_stale_reviews, but the
# status itself is per-SHA so we re-post on the new head).
# - `issue_comment`: created, edited, deleted
# → fires on any new comment so /sop-ack / /sop-revoke take
# effect immediately (Gitea 1.22.6 doesn't refire on
# pull_request_review per feedback_pull_request_review_no_refire,
# so issue_comment is the canonical refire channel).
#
# Trust boundary (mirrors RFC#324 §A4 + sop-tier-check security note):
# `pull_request_target` (not `pull_request`) — workflow def is loaded
# from BASE branch, so a PR cannot rewrite this workflow to exfiltrate
# the token. The `actions/checkout` step pins `ref: base.sha` so the
# script ALSO comes from BASE. PR-HEAD code is never executed in the
# runner.
#
# Token scope:
# - read:repository, read:organization for PR + comments + team probes
# - write:repository for POST /statuses/{sha}
# - The token owner MUST be a member of every team referenced by the
# config's required_teams (else /teams/{id}/members/{login} returns
# 403 — see review-check.sh same-gotcha doc). For the MVP we use
# the dev-lead token (a member of engineers, managers, qa, security)
# via a repo secret `SOP_CHECKLIST_GATE_TOKEN`. Provisioning of that
# secret is a follow-up authorization step (separate from this PR).
#
# Failure mode: tier-aware (RFC#351 open question 2):
# - tier:high → state=failure (hard-fail; BP blocks merge)
# - tier:medium → state=failure (hard-fail; same)
# - tier:low → state=pending (soft-fail; BP can choose to require
# this context or skip for low-tier PRs)
# - missing/no-tier → state=failure (default-mode: hard — never lower
# the bar per feedback_fix_root_not_symptom)
#
# Slash-command contract (RFC#351 v1 + §A1.1-style notes from RFC#324):
#
# /sop-ack <slug-or-numeric-alias> [optional note]
# — register a peer-ack for one checklist item.
# — slug accepts kebab-case, snake_case, or natural-spaces
# (all normalize to canonical kebab-case).
# — numeric 1..7 maps via config.items[*].numeric_alias.
# — most-recent (user, slug) directive wins.
#
# /sop-revoke <slug-or-numeric-alias> [reason]
# — invalidate the commenter's own prior /sop-ack for this slug.
# — does NOT affect other peers' acks on the same slug.
# — most-recent (user, slug) directive wins, so a later /sop-ack
# re-restores the ack.
#
# The eval is read-only + idempotent (read PR + comments + team
# membership, compute, post status). Re-running on any event is safe —
# the new status overwrites the previous one for the same context.
name: sop-checklist-gate
on:
pull_request_target:
types: [opened, edited, synchronize, reopened, labeled, unlabeled]
issue_comment:
types: [created, edited, deleted]
permissions:
contents: read
pull-requests: read
# NOTE: `statuses: write` is the GitHub-Actions name for POST /statuses.
# Gitea 1.22.6 may not gate on this permission key (it just checks the
# token), but listing it explicitly documents intent for the next
# platform-version upgrade.
statuses: write
jobs:
gate:
# Run on pull_request_target events always. On issue_comment events,
# only when the comment is on a PR (issue_comment fires for issues
# too) and the body contains one of the slash-commands.
if: |
github.event_name == 'pull_request_target' ||
(github.event_name == 'issue_comment' &&
github.event.issue.pull_request != null &&
(contains(github.event.comment.body, '/sop-ack') ||
contains(github.event.comment.body, '/sop-revoke')))
runs-on: ubuntu-latest
steps:
- name: Check out BASE ref (trust boundary — never PR-head)
uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2
with:
# For pull_request_target, the default branch is the trust
# anchor. For issue_comment the PR base may differ from the
# default branch (PR targeting `staging`), so we use the
# default-branch ref explicitly — same approach as
# qa-review.yml so the script source is always trusted.
ref: ${{ github.event.repository.default_branch }}
- name: Run sop-checklist-gate
env:
GITEA_TOKEN: ${{ secrets.SOP_CHECKLIST_GATE_TOKEN || secrets.GITHUB_TOKEN }}
PR_NUMBER: ${{ github.event.pull_request.number || github.event.issue.number }}
OWNER: ${{ github.repository_owner }}
REPO_NAME: ${{ github.event.repository.name }}
run: |
set -euo pipefail
python3 .gitea/scripts/sop-checklist-gate.py \
--owner "$OWNER" \
--repo "$REPO_NAME" \
--pr "$PR_NUMBER" \
--config .gitea/sop-checklist-config.yaml \
--gitea-host git.moleculesai.app
+17 -1
View File
@@ -1,6 +1,22 @@
import type { Metadata } from "next";
import { Inter, JetBrains_Mono } from "next/font/google";
import { cookies, headers } from "next/headers";
import "./globals.css";
// Self-hosted at build time → CSP-safe (font-src 'self' covers them
// because Next.js serves the .woff2 from /_next/static). Exposed as
// CSS variables so the mobile palette can reference them without
// importing this module.
const interFont = Inter({
subsets: ["latin"],
display: "swap",
variable: "--font-inter",
});
const monoFont = JetBrains_Mono({
subsets: ["latin"],
display: "swap",
variable: "--font-jetbrains",
});
import { AuthGate } from "@/components/AuthGate";
import { CookieConsent } from "@/components/CookieConsent";
import { PurchaseSuccessModal } from "@/components/PurchaseSuccessModal";
@@ -79,7 +95,7 @@ export default async function RootLayout({
dangerouslySetInnerHTML={{ __html: themeBootScript }}
/>
</head>
<body className="bg-surface text-ink">
<body className={`bg-surface text-ink ${interFont.variable} ${monoFont.variable}`}>
<ThemeProvider initialTheme={theme}>
{/* AuthGate is a client component; it checks the session on mount
and bounces anonymous users to the control plane's login page
+48 -1
View File
@@ -4,6 +4,7 @@ import { useEffect, useState } from "react";
import { Canvas } from "@/components/Canvas";
import { Legend } from "@/components/Legend";
import { CommunicationOverlay } from "@/components/CommunicationOverlay";
import { MobileApp } from "@/components/mobile/MobileApp";
import { Spinner } from "@/components/Spinner";
import { connectSocket, disconnectSocket } from "@/store/socket";
import { useCanvasStore } from "@/store/canvas";
@@ -14,6 +15,23 @@ export default function Home() {
const hydrationError = useCanvasStore((s) => s.hydrationError);
const setHydrationError = useCanvasStore((s) => s.setHydrationError);
const [hydrating, setHydrating] = useState(true);
// < 640px viewport renders the dedicated mobile shell instead of the
// desktop canvas. Tri-state: `null` until matchMedia has resolved,
// then `true|false`. While null we keep the existing loading spinner
// up — that way mobile devices never flash the desktop tree (which
// they would if we defaulted to `false` and only flipped post-mount).
const [isMobile, setIsMobile] = useState<boolean | null>(null);
useEffect(() => {
if (typeof window === "undefined" || !window.matchMedia) {
setIsMobile(false);
return;
}
const mq = window.matchMedia("(max-width: 639px)");
const update = () => setIsMobile(mq.matches);
update();
mq.addEventListener("change", update);
return () => mq.removeEventListener("change", update);
}, []);
// Distinct from hydrationError: platform-down is its own UX path
// (different copy, different action — the user's next step is to
// check local services, not to retry the API call). Tracked
@@ -51,7 +69,10 @@ export default function Home() {
};
}, []);
if (hydrating) {
// Hold the spinner while data hydrates OR while the viewport
// resolution hasn't settled yet (avoids a desktop-tree flash on
// mobile devices between SSR-paint and matchMedia).
if (hydrating || isMobile === null) {
return (
<div className="fixed inset-0 flex items-center justify-center bg-surface">
<div role="status" aria-live="polite" className="flex flex-col items-center gap-3">
@@ -66,6 +87,32 @@ export default function Home() {
return <PlatformDownDiagnostic />;
}
if (isMobile) {
return (
<>
<MobileApp />
{hydrationError && (
<div
role="alert"
data-testid="hydration-error"
className="fixed inset-0 flex flex-col items-center justify-center bg-surface text-ink-mid gap-4 z-[9999] px-6"
>
<p className="text-ink-mid text-sm text-center">{hydrationError}</p>
<button
onClick={() => {
setHydrationError(null);
window.location.reload();
}}
className="px-4 py-2 bg-accent-strong hover:bg-accent text-white rounded-md text-sm"
>
Retry
</button>
</div>
)}
</>
);
}
return (
<>
<Canvas />
+3 -1
View File
@@ -308,7 +308,9 @@ function CanvasInner() {
showInteractive={false}
/>
<MiniMap
className="!bg-surface-sunken/90 !border-line/50 !rounded-lg !shadow-xl !shadow-black/20"
// hidden < sm: minimap eats ~30% of a phone screen and
// overlaps with the New Workspace FAB at bottom-right.
className="!bg-surface-sunken/90 !border-line/50 !rounded-lg !shadow-xl !shadow-black/20 !hidden sm:!block"
// Mask dims off-viewport areas; tint matches the surface so
// the dimming doesn't show as a black bar in light mode.
maskColor={resolvedTheme === "dark" ? "rgba(0, 0, 0, 0.7)" : "rgba(232, 226, 211, 0.7)"}
@@ -631,6 +631,7 @@ function AllKeysModal({
// React's commit ordering.
<div className="fixed inset-0 z-[60] flex items-center justify-center">
<div
aria-hidden="true"
className="absolute inset-0 bg-black/70 backdrop-blur-sm"
aria-label="Dismiss modal"
onClick={onCancel}
+37 -21
View File
@@ -63,9 +63,21 @@ export function SidePanel() {
? parsed
: SIDEPANEL_DEFAULT_WIDTH;
});
// On mobile (< 640px viewport) the configured width exceeds the screen,
// so the panel renders off-canvas-left. Force full-viewport width and
// disable resize on small screens; restore configured width on desktop.
const [isMobile, setIsMobile] = useState(false);
useEffect(() => {
setSidePanelWidth(width);
}, [width, setSidePanelWidth]);
if (typeof window === "undefined" || !window.matchMedia) return;
const mq = window.matchMedia("(max-width: 639px)");
const update = () => setIsMobile(mq.matches);
update();
mq.addEventListener("change", update);
return () => mq.removeEventListener("change", update);
}, []);
useEffect(() => {
setSidePanelWidth(isMobile ? 0 : width);
}, [width, isMobile, setSidePanelWidth]);
const widthRef = useRef(width); // tracks live drag value for the mouseup handler
const dragging = useRef(false);
const startX = useRef(0);
@@ -137,24 +149,28 @@ export function SidePanel() {
return (
<div
className="fixed top-0 right-0 h-full bg-surface/95 backdrop-blur-xl border-l border-line/50 flex flex-col z-50 shadow-2xl shadow-black/50 animate-in slide-in-from-right duration-200"
style={{ width }}
className={`fixed top-0 right-0 h-full bg-surface/95 backdrop-blur-xl border-line/50 flex flex-col z-50 shadow-2xl shadow-black/50 animate-in slide-in-from-right duration-200 ${
isMobile ? "left-0 w-screen" : "border-l"
}`}
style={isMobile ? undefined : { width }}
>
{/* Resize handle */}
<div
role="separator"
aria-label="Resize workspace panel"
aria-valuenow={width}
aria-valuemin={SIDEPANEL_MIN_WIDTH}
aria-valuemax={SIDEPANEL_MAX_WIDTH}
aria-orientation="vertical"
tabIndex={0}
onMouseDown={onMouseDown}
onKeyDown={onResizeKeyDown}
className="absolute left-0 top-0 bottom-0 w-1.5 cursor-col-resize hover:bg-accent/30 active:bg-accent/50 transition-colors z-10 focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-accent focus-visible:ring-inset"
/>
{/* Resize handle — desktop only (no point resizing a full-screen mobile panel) */}
{!isMobile && (
<div
role="separator"
aria-label="Resize workspace panel"
aria-valuenow={width}
aria-valuemin={SIDEPANEL_MIN_WIDTH}
aria-valuemax={SIDEPANEL_MAX_WIDTH}
aria-orientation="vertical"
tabIndex={0}
onMouseDown={onMouseDown}
onKeyDown={onResizeKeyDown}
className="absolute left-0 top-0 bottom-0 w-1.5 cursor-col-resize hover:bg-accent/30 active:bg-accent/50 transition-colors z-10 focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-accent focus-visible:ring-inset"
/>
)}
{/* Header */}
<div className="flex items-center justify-between px-5 py-4 border-b border-line/40 bg-surface-sunken/30">
<div className="flex items-center justify-between px-4 sm:px-5 py-4 border-b border-line/40 bg-surface-sunken/30">
<div className="flex items-center gap-3 min-w-0">
<div className="relative">
<StatusDot status={node.data.status} size="md" />
@@ -190,7 +206,7 @@ export function SidePanel() {
</div>
{/* Capability summary */}
<div className="px-5 py-3 border-b border-line/40 bg-surface-sunken/20">
<div className="px-4 sm:px-5 py-3 border-b border-line/40 bg-surface-sunken/20">
<div className="flex flex-wrap gap-2">
<MetaPill label="Tier" value={`T${node.data.tier}`} />
<MetaPill label="Runtime" value={capability.runtime || "unknown"} />
@@ -295,8 +311,8 @@ export function SidePanel() {
</div>
{/* Footer — workspace ID */}
<div className="px-5 py-2 border-t border-line/40 bg-surface-sunken/20">
<span className="text-[9px] font-mono text-ink-mid select-all">
<div className="px-4 sm:px-5 py-2 border-t border-line/40 bg-surface-sunken/20">
<span className="text-[9px] font-mono text-ink-mid select-all block truncate">
{selectedNodeId}
</span>
</div>
+7 -7
View File
@@ -154,13 +154,13 @@ export function Toolbar() {
return (
<div
className="fixed top-3 left-1/2 -translate-x-1/2 z-20 flex items-center gap-3 bg-surface-sunken/80 backdrop-blur-md border border-line/60 rounded-xl px-4 py-2 shadow-xl shadow-black/20 transition-[margin-left] duration-200"
className="fixed top-3 z-20 flex items-center gap-3 bg-surface-sunken/80 backdrop-blur-md border border-line/60 rounded-xl px-3 sm:px-4 py-2 shadow-xl shadow-black/20 transition-[margin-left] duration-200 left-2 right-2 translate-x-0 sm:left-1/2 sm:right-auto sm:-translate-x-1/2 overflow-x-auto sm:overflow-visible [&>*]:shrink-0"
style={toolbarOffsetStyle}
>
{/* Logo / Title */}
<div className="flex items-center gap-2 pr-3 border-r border-line/60">
{/* Logo / Title — title text drops on mobile to reclaim space */}
<div className="flex items-center gap-2 sm:pr-3 sm:border-r sm:border-line/60">
<img src="/molecule-icon.png" alt="Molecule AI" className="w-5 h-5" />
<span className="text-[11px] font-semibold text-ink-mid tracking-wide">Molecule AI</span>
<span className="hidden sm:inline text-[11px] font-semibold text-ink-mid tracking-wide">Molecule AI</span>
</div>
{/* Status pills + workspace total in one segment — previously two
@@ -179,15 +179,15 @@ export function Toolbar() {
{counts.failed > 0 && (
<StatusPill color={statusDotClass("failed")} count={counts.failed} label="failed" />
)}
<span className="text-ink-mid" aria-hidden="true">·</span>
<span className="text-[10px] text-ink-mid whitespace-nowrap">
<span className="hidden sm:inline text-ink-mid" aria-hidden="true">·</span>
<span className="hidden sm:inline text-[10px] text-ink-mid whitespace-nowrap">
{counts.roots} workspace{counts.roots !== 1 ? "s" : ""}
{counts.children > 0 && <span className="text-ink-mid"> + {counts.children} sub</span>}
</span>
</div>
{/* WebSocket connection status */}
<div className="pl-3 border-l border-line/60">
<div className="sm:pl-3 sm:border-l sm:border-line/60">
<WsStatusPill status={wsStatus} />
</div>
+6
View File
@@ -45,6 +45,12 @@ export function Tooltip({ text, children }: Props) {
if (triggerRef.current) {
const rect = triggerRef.current.getBoundingClientRect();
setPos({ x: rect.left, y: rect.top });
// Focus the first focusable descendant (the actual trigger button),
// not the wrapper div, so screen-reader/navigation UX is correct.
const firstFocusable = triggerRef.current.querySelector<HTMLElement>(
'button, [tabindex], input, select, textarea, a[href]'
);
firstFocusable?.focus();
}
setShow(true);
}, 400);
@@ -37,12 +37,22 @@ function makeBundle(name = "test-workspace"): File {
});
}
// jsdom doesn't define DragEvent globally; create a dragover event with
// dataTransfer.types stubbed to include "Files" so handleDragOver triggers.
function createDragOverEvent() {
return Object.assign(new Event("dragover", { bubbles: true, cancelable: true }), {
dataTransfer: { types: ["Files"], files: null },
});
}
// ─── Tests ────────────────────────────────────────────────────────────────────
describe("BundleDropZone — render", () => {
it("renders a hidden file input with correct accept and aria-label", () => {
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
// Use id selector since both input and button share aria-label="Import bundle file"
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
expect(input).toBeTruthy();
expect(input.getAttribute("type")).toBe("file");
expect(input.getAttribute("accept")).toBe(".bundle.json");
});
@@ -64,22 +74,17 @@ describe("BundleDropZone — drag state", () => {
vi.useRealTimers();
});
it("shows the drop overlay when a file is dragged over", () => {
it("shows the drop overlay when a file is dragged over", async () => {
render(<BundleDropZone />);
const overlay = screen.getByText("Drop Bundle to Import").closest("div");
expect(overlay?.className).toContain("fixed");
// Simulate drag-over on the invisible drop zone
const zone = document.body.querySelector('[class*="fixed inset-0 z-10"]') as HTMLElement;
expect(screen.queryByText("Drop Bundle to Import")).toBeNull();
const zone = document.body.querySelector('[class*="z-10"]') as HTMLElement;
if (zone) {
fireEvent.dragOver(zone);
} else {
// Fallback: dispatch on the component's outer div
const container = document.body.querySelector('[class*="pointer-events-none"]') as HTMLElement;
if (container) {
fireEvent.dragOver(container);
}
const dragOverEvent = createDragOverEvent();
fireEvent.dragOver(zone, dragOverEvent);
}
await act(async () => { vi.runOnlyPendingTimers(); });
const overlay = screen.getByText("Drop Bundle to Import").closest('[class*="z-20"]');
expect(overlay).not.toBeNull();
});
it("hides the drop overlay when not dragging", () => {
@@ -92,8 +97,7 @@ describe("BundleDropZone — drag state", () => {
describe("BundleDropZone — keyboard file input (WCAG 2.1.1)", () => {
it("triggers the hidden file input when the import button is clicked", () => {
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file") as HTMLInputElement;
const clickSpy = vi.spyOn(input, "click");
const input = document.getElementById("bundle-file-input") as HTMLInputElement; const clickSpy = vi.spyOn(input, "click");
fireEvent.click(screen.getByRole("button", { name: /import bundle/i }));
expect(clickSpy).toHaveBeenCalled();
});
@@ -107,7 +111,7 @@ describe("BundleDropZone — keyboard file input (WCAG 2.1.1)", () => {
});
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = makeBundle("My Bundle");
Object.defineProperty(input, "files", {
@@ -139,7 +143,7 @@ describe("BundleDropZone — import success", () => {
});
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = makeBundle("Success Workspace");
Object.defineProperty(input, "files", { value: [file], writable: false });
@@ -170,7 +174,7 @@ describe("BundleDropZone — import success", () => {
});
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = makeBundle("Timed Workspace");
Object.defineProperty(input, "files", { value: [file], writable: false });
@@ -196,7 +200,7 @@ describe("BundleDropZone — import error", () => {
vi.mocked(api.post).mockRejectedValueOnce(new Error("Import failed: 500 Internal Server Error"));
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = makeBundle("Failed Workspace");
Object.defineProperty(input, "files", { value: [file], writable: false });
@@ -214,7 +218,7 @@ describe("BundleDropZone — import error", () => {
it("shows error when file is not a .bundle.json", async () => {
vi.useFakeTimers();
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = new File(["{}"], "readme.txt", { type: "text/plain" });
Object.defineProperty(input, "files", { value: [file], writable: false });
@@ -239,7 +243,7 @@ describe("BundleDropZone — import error", () => {
vi.mocked(api.post).mockRejectedValueOnce(new Error("Network error"));
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = makeBundle("Error Workspace");
Object.defineProperty(input, "files", { value: [file], writable: false });
@@ -267,7 +271,7 @@ describe("BundleDropZone — importing state", () => {
vi.mocked(api.post).mockReturnValueOnce(pending as unknown as ReturnType<typeof api.post>);
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file");
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = makeBundle("Pending Workspace");
Object.defineProperty(input, "files", { value: [file], writable: false });
@@ -299,8 +303,7 @@ describe("BundleDropZone — file input reset", () => {
});
render(<BundleDropZone />);
const input = screen.getByLabelText("Import bundle file") as HTMLInputElement;
const input = document.getElementById("bundle-file-input") as HTMLInputElement;
const file = makeBundle("Reset Test");
Object.defineProperty(input, "files", { value: [file], writable: false });
@@ -12,6 +12,7 @@ import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
import { ContextMenu } from "../ContextMenu";
import { useCanvasStore } from "@/store/canvas";
import { showToast } from "../Toaster";
import { api } from "@/lib/api";
// ─── Mock Toaster ─────────────────────────────────────────────────────────────
@@ -21,12 +22,10 @@ vi.mock("../Toaster", () => ({
// ─── Mock API ────────────────────────────────────────────────────────────────
const apiPost = vi.fn().mockResolvedValue(undefined as void);
const apiPatch = vi.fn().mockResolvedValue(undefined as void);
vi.mock("@/lib/api", () => ({
api: {
post: apiPost,
patch: apiPatch,
post: vi.fn().mockResolvedValue(undefined as void),
patch: vi.fn().mockResolvedValue(undefined as void),
get: vi.fn(),
},
}));
@@ -96,8 +95,8 @@ describe("ContextMenu — visibility", () => {
mockStoreState.setCollapsed.mockClear();
mockStoreState.arrangeChildren.mockClear();
mockStoreState.nodes = [];
apiPost.mockReset();
apiPatch.mockReset();
vi.mocked(api.post).mockReset();
vi.mocked(api.patch).mockReset();
vi.mocked(showToast).mockClear();
});
@@ -146,8 +145,8 @@ describe("ContextMenu — close", () => {
mockStoreState.setCollapsed.mockClear();
mockStoreState.arrangeChildren.mockClear();
mockStoreState.nodes = [];
apiPost.mockReset();
apiPatch.mockReset();
vi.mocked(api.post).mockReset();
vi.mocked(api.patch).mockReset();
vi.mocked(showToast).mockClear();
});
@@ -168,7 +167,7 @@ describe("ContextMenu — close", () => {
it("closes when Tab is pressed", () => {
openMenu();
render(<ContextMenu />);
fireEvent.keyDown(document.body, { key: "Tab" });
fireEvent.keyDown(screen.getByRole("menu"), { key: "Tab" });
expect(mockStoreState.closeContextMenu).toHaveBeenCalled();
});
});
@@ -187,8 +186,8 @@ describe("ContextMenu — menu items", () => {
mockStoreState.setCollapsed.mockClear();
mockStoreState.arrangeChildren.mockClear();
mockStoreState.nodes = [];
apiPost.mockReset();
apiPatch.mockReset();
vi.mocked(api.post).mockReset();
vi.mocked(api.patch).mockReset();
vi.mocked(showToast).mockClear();
});
@@ -202,8 +201,11 @@ describe("ContextMenu — menu items", () => {
it("hides Chat and Terminal for offline nodes", () => {
openMenu({ nodeData: { name: "Bob", status: "offline", tier: 2, role: "analyst" } });
render(<ContextMenu />);
expect(screen.queryByRole("menuitem", { name: /chat/i })).toBeNull();
expect(screen.queryByRole("menuitem", { name: /terminal/i })).toBeNull();
// Offline nodes render Chat/Terminal as disabled buttons (accessible but non-interactive)
const chatBtn = screen.getByRole("menuitem", { name: /chat/i });
const termBtn = screen.getByRole("menuitem", { name: /terminal/i });
expect(chatBtn.hasAttribute("disabled")).toBe(true);
expect(termBtn.hasAttribute("disabled")).toBe(true);
});
it("shows Pause for online nodes (not paused)", () => {
@@ -284,8 +286,8 @@ describe("ContextMenu — keyboard navigation", () => {
mockStoreState.setCollapsed.mockClear();
mockStoreState.arrangeChildren.mockClear();
mockStoreState.nodes = [];
apiPost.mockReset();
apiPatch.mockReset();
vi.mocked(api.post).mockReset();
vi.mocked(api.patch).mockReset();
vi.mocked(showToast).mockClear();
});
@@ -326,8 +328,8 @@ describe("ContextMenu — item actions", () => {
mockStoreState.setCollapsed.mockClear();
mockStoreState.arrangeChildren.mockClear();
mockStoreState.nodes = [];
apiPost.mockReset();
apiPatch.mockReset();
vi.mocked(api.post).mockReset();
vi.mocked(api.patch).mockReset();
vi.mocked(showToast).mockClear();
});
@@ -357,20 +359,20 @@ describe("ContextMenu — item actions", () => {
it("Pause calls the pause API and updates node status optimistically", async () => {
openMenu({ nodeData: { name: "Alice", status: "online", tier: 4, role: "assistant" } });
apiPost.mockResolvedValue(undefined);
vi.mocked(api.post).mockResolvedValue(undefined);
render(<ContextMenu />);
fireEvent.click(screen.getByRole("menuitem", { name: /pause/i }));
await act(async () => { /* flush */ });
expect(apiPost).toHaveBeenCalledWith("/workspaces/n1/pause", {});
expect(vi.mocked(api.post)).toHaveBeenCalledWith("/workspaces/n1/pause", {});
expect(mockStoreState.updateNodeData).toHaveBeenCalledWith("n1", { status: "paused" });
});
it("Resume calls the resume API", async () => {
openMenu({ nodeData: { name: "Alice", status: "paused", tier: 4, role: "assistant" } });
apiPost.mockResolvedValue(undefined);
vi.mocked(api.post).mockResolvedValue(undefined);
render(<ContextMenu />);
fireEvent.click(screen.getByRole("menuitem", { name: /resume/i }));
await act(async () => { /* flush */ });
expect(apiPost).toHaveBeenCalledWith("/workspaces/n1/resume", {});
expect(vi.mocked(api.post)).toHaveBeenCalledWith("/workspaces/n1/resume", {});
});
});
@@ -96,9 +96,9 @@ describe("extractMessageText — response result format", () => {
],
},
};
// Both are non-empty strings, so the first one wins (filter picks the first)
// The implementation: rText from rParts[0].text = "Direct text"
expect(extractMessageText(body)).toBe("Direct text");
// Both parts contribute: text from first part, root.text from second.
// The implementation: all non-empty strings joined with newline.
expect(extractMessageText(body)).toBe("Direct text\nRoot text");
});
});
@@ -149,7 +149,8 @@ describe("Legend — palette offset positioning", () => {
(sel) => sel({ templatePaletteOpen: false } as ReturnType<typeof useCanvasStore.getState>)
);
render(<Legend />);
const panel = screen.getByText("Legend").closest("div");
// The panel is the div with the fixed/bottom-6/z-30 classes; find it directly.
const panel = document.querySelector('[class*="fixed"][class*="bottom-6"]') as HTMLElement;
expect(panel?.className).toContain("left-4");
});
@@ -158,7 +159,7 @@ describe("Legend — palette offset positioning", () => {
(sel) => sel({ templatePaletteOpen: true } as ReturnType<typeof useCanvasStore.getState>)
);
render(<Legend />);
const panel = screen.getByText("Legend").closest("div");
const panel = document.querySelector('[class*="fixed"][class*="bottom-6"]') as HTMLElement;
expect(panel?.className).toContain("left-[296px]");
});
});
@@ -81,11 +81,13 @@ describe("MissingKeysModal — WCAG 2.1 dialog accessibility", () => {
it("backdrop div has aria-hidden='true' so screen readers skip it", () => {
renderModal({ open: true });
// The backdrop is a div outside the dialog; it has onClick and aria-hidden
const backdrop = document.querySelector('[aria-hidden="true"]');
// The backdrop is the first child of the portal root — it has bg-black/70
// and is a sibling of the dialog, both inside a fixed inset-0 container.
const fixedContainer = document.body.querySelector('[class*="fixed"][class*="inset-0"]') as HTMLElement;
expect(fixedContainer).toBeTruthy();
const backdrop = fixedContainer.querySelector('[class*="bg-black"]') as HTMLElement;
expect(backdrop).toBeTruthy();
// Verify the backdrop is the full-screen overlay (has bg-black/70)
expect(backdrop?.className).toContain("bg-black/70");
expect(backdrop.getAttribute("aria-hidden")).toBe("true");
});
it("decorative warning SVG in header has aria-hidden='true'", () => {
@@ -140,18 +140,17 @@ describe("OnboardingWizard — auto-advance", () => {
});
it("auto-advances from welcome to api-key when nodes appear", async () => {
const { unmount } = render(<OnboardingWizard />);
const { rerender } = render(<OnboardingWizard />);
expect(screen.getByText("Welcome to Molecule AI")).toBeTruthy();
// Simulate a node being added to the store and re-render
// Simulate a node being added to the store and trigger re-render
mockStoreState.nodes = [{ id: "ws-1", data: {} }];
render(<OnboardingWizard />);
rerender(<OnboardingWizard />);
await waitFor(() => {
expect(screen.queryByText("Welcome to Molecule AI")).toBeNull();
});
expect(screen.getByText("Set your API key")).toBeTruthy();
unmount();
});
});
@@ -12,13 +12,66 @@ import { render, screen, fireEvent, cleanup, act } from "@testing-library/react"
import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
import { PurchaseSuccessModal } from "../PurchaseSuccessModal";
// ─── History mock ─────────────────────────────────────────────────────────────
// jsdom's window.history.replaceState throws SecurityError for http://localhost/
// (it normalizes the URL and adds a trailing dot, then fails its own check).
// We intercept replaceState to swallow the error and also update the location
// object directly so window.location.search reflects the current URL params.
const _origReplaceState = window.history.replaceState.bind(window.history);
const _origLocation = window.location;
let _currentHref = "http://localhost/";
// Override window.location with a writable version that tracks our fake href
Object.defineProperty(window, "location", {
value: {
get href() { return _currentHref; },
set href(v: string) { _currentHref = v; },
get search() {
const idx = _currentHref.indexOf("?");
return idx >= 0 ? _currentHref.slice(idx) : "";
},
get pathname() {
const idx = _currentHref.indexOf("?");
const pathPart = idx >= 0 ? _currentHref.slice(0, idx) : _currentHref;
return new URL(pathPart).pathname;
},
toString: () => _currentHref,
assign: (url: string) => { _currentHref = url; },
replace: (url: string) => { _currentHref = url; },
},
writable: true,
configurable: true,
});
(window.history as unknown as Record<string, unknown>).replaceState = function(
this: History,
state: unknown,
title: string,
url?: string | URL,
) {
const urlStr = url != null ? String(url) : undefined;
if (urlStr != null) _currentHref = urlStr;
try {
return _origReplaceState.call(this, state, title, url);
} catch (err) {
// jsdom throws for http://localhost/ — swallow and rely on our fake location
return undefined as unknown as void;
}
} as History["replaceState"];
// ─── Helpers ──────────────────────────────────────────────────────────────────
function pushUrl(url: string) {
window.history.pushState({}, "", url);
}
function replaceUrl(url: string) {
window.history.replaceState({}, "", url);
_currentHref = url;
try {
window.history.replaceState(null, "", url);
} catch {
// Intercepted above
}
}
function pushUrl(url: string) {
replaceUrl(url);
}
// ─── Tests ────────────────────────────────────────────────────────────────────
@@ -117,7 +170,7 @@ describe("PurchaseSuccessModal — dismiss", () => {
it("closes the dialog when the close button is clicked", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
expect(screen.getByRole("dialog")).toBeTruthy();
fireEvent.click(screen.getByRole("button", { name: "Close" }));
@@ -130,7 +183,7 @@ describe("PurchaseSuccessModal — dismiss", () => {
it("closes the dialog when the backdrop is clicked", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
expect(screen.getByRole("dialog")).toBeTruthy();
// Click the backdrop (the full-screen overlay div)
@@ -145,7 +198,7 @@ describe("PurchaseSuccessModal — dismiss", () => {
it("closes on Escape key", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
expect(screen.getByRole("dialog")).toBeTruthy();
fireEvent.keyDown(window, { key: "Escape" });
@@ -158,7 +211,7 @@ describe("PurchaseSuccessModal — dismiss", () => {
it("auto-dismisses after 5 seconds", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
expect(screen.getByRole("dialog")).toBeTruthy();
@@ -171,7 +224,7 @@ describe("PurchaseSuccessModal — dismiss", () => {
it("does not auto-dismiss before 5 seconds", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
expect(screen.getByRole("dialog")).toBeTruthy();
@@ -195,7 +248,7 @@ describe("PurchaseSuccessModal — URL stripping", () => {
it("strips purchase_success and item params from the URL on mount", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
const url = new URL(window.location.href);
expect(url.searchParams.get("purchase_success")).toBeNull();
@@ -206,7 +259,7 @@ describe("PurchaseSuccessModal — URL stripping", () => {
const replaceSpy = vi.spyOn(window.history, "replaceState");
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
expect(replaceSpy).toHaveBeenCalled();
});
@@ -226,7 +279,7 @@ describe("PurchaseSuccessModal — accessibility", () => {
it("has aria-modal=true on the dialog", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
const dialog = screen.getByRole("dialog");
expect(dialog.getAttribute("aria-modal")).toBe("true");
@@ -235,7 +288,7 @@ describe("PurchaseSuccessModal — accessibility", () => {
it("has aria-labelledby pointing to the title", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
await new Promise((r) => setTimeout(r, 10));
vi.advanceTimersByTime(10);
});
const dialog = screen.getByRole("dialog");
const labelledby = dialog.getAttribute("aria-labelledby");
@@ -247,8 +300,10 @@ describe("PurchaseSuccessModal — accessibility", () => {
it("moves focus to the close button on open", async () => {
render(<PurchaseSuccessModal />);
await act(async () => {
// Two rAFs for focus: one from the effect, one from the RAF wrapper
await new Promise((r) => requestAnimationFrame(() => requestAnimationFrame(r)));
vi.advanceTimersByTime(10);
// Advance rAF timers as well (ViTest mocks rAF with fake timers)
vi.advanceTimersByTime(0);
vi.advanceTimersByTime(0);
});
expect(document.activeElement?.textContent).toMatch(/close/i);
});
@@ -14,29 +14,33 @@ describe("Spinner — size variants", () => {
const { container } = render(<Spinner size="sm" />);
const svg = container.querySelector("svg");
expect(svg).toBeTruthy();
expect(svg?.className).toContain("w-3");
expect(svg?.className).toContain("h-3");
const cls = svg?.getAttribute("class") ?? "";
expect(cls).toContain("w-3");
expect(cls).toContain("h-3");
});
it("renders with md size class (default)", () => {
const { container } = render(<Spinner size="md" />);
const svg = container.querySelector("svg");
expect(svg?.className).toContain("w-4");
expect(svg?.className).toContain("h-4");
const cls = svg?.getAttribute("class") ?? "";
expect(cls).toContain("w-4");
expect(cls).toContain("h-4");
});
it("renders with lg size class", () => {
const { container } = render(<Spinner size="lg" />);
const svg = container.querySelector("svg");
expect(svg?.className).toContain("w-5");
expect(svg?.className).toContain("h-5");
const cls = svg?.getAttribute("class") ?? "";
expect(cls).toContain("w-5");
expect(cls).toContain("h-5");
});
it("defaults to md size when no size prop given", () => {
const { container } = render(<Spinner />);
const svg = container.querySelector("svg");
expect(svg?.className).toContain("w-4");
expect(svg?.className).toContain("h-4");
const cls = svg?.getAttribute("class") ?? "";
expect(cls).toContain("w-4");
expect(cls).toContain("h-4");
});
it("has aria-hidden=true so screen readers skip it", () => {
@@ -48,7 +52,8 @@ describe("Spinner — size variants", () => {
it("includes the motion-safe:animate-spin class for CSS animation", () => {
const { container } = render(<Spinner />);
const svg = container.querySelector("svg");
expect(svg?.className).toContain("motion-safe:animate-spin");
const cls = svg?.getAttribute("class") ?? "";
expect(cls).toContain("motion-safe:animate-spin");
});
it("renders exactly one SVG element", () => {
@@ -11,12 +11,12 @@ import { render, screen, fireEvent, cleanup, act } from "@testing-library/react"
import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
import { TestConnectionButton } from "../ui/TestConnectionButton";
import type { SecretGroup } from "@/types/secrets";
import { validateSecret } from "@/lib/api/secrets";
// ─── Mock validateSecret ──────────────────────────────────────────────────────
const mockValidateSecret = vi.fn();
vi.mock("@/lib/api/secrets", () => ({
validateSecret: mockValidateSecret,
validateSecret: vi.fn(),
}));
// SecretGroup is a string literal type: 'github' | 'anthropic' | 'openrouter' | 'custom'
@@ -29,7 +29,7 @@ describe("TestConnectionButton — render", () => {
cleanup();
vi.useRealTimers();
vi.restoreAllMocks();
mockValidateSecret.mockReset();
vi.mocked(validateSecret).mockReset();
});
it("renders 'Test connection' button in idle state", () => {
@@ -39,7 +39,7 @@ describe("TestConnectionButton — render", () => {
it("disables button when secretValue is empty", () => {
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="" />);
expect(screen.getByRole("button").getAttribute("disabled")).toBeTruthy();
expect(screen.getByRole("button").hasAttribute("disabled")).toBe(true);
});
it("enables button when secretValue is non-empty", () => {
@@ -57,21 +57,22 @@ describe("TestConnectionButton — state machine", () => {
cleanup();
vi.useRealTimers();
vi.restoreAllMocks();
mockValidateSecret.mockReset();
vi.mocked(validateSecret).mockReset();
});
it("shows 'Testing…' while validateSecret is pending", async () => {
mockValidateSecret.mockImplementation(() => new Promise(() => {})); // never resolves
vi.mocked(validateSecret).mockImplementation(() => new Promise(() => {})); // never resolves
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="sk-..." />);
fireEvent.click(screen.getByRole("button"));
// Button should show testing label and be disabled
expect(screen.getByRole("button", { name: "Testing…" }).getAttribute("disabled")).toBeTruthy();
const btn = screen.getByRole("button", { name: /testing/i });
expect(btn.hasAttribute("disabled")).toBe(true);
});
it("shows 'Connected ✓' on success", async () => {
mockValidateSecret.mockResolvedValue({ valid: true });
vi.mocked(validateSecret).mockResolvedValue({ valid: true });
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="sk-..." />);
fireEvent.click(screen.getByRole("button"));
@@ -81,7 +82,7 @@ describe("TestConnectionButton — state machine", () => {
});
it("shows 'Test failed' on validation failure", async () => {
mockValidateSecret.mockResolvedValue({ valid: false, error: "Invalid key format" });
vi.mocked(validateSecret).mockResolvedValue({ valid: false, error: "Invalid key format" });
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="bad-key" />);
fireEvent.click(screen.getByRole("button"));
@@ -91,7 +92,7 @@ describe("TestConnectionButton — state machine", () => {
});
it("shows error detail when validation returns invalid with message", async () => {
mockValidateSecret.mockResolvedValue({ valid: false, error: "Permission denied" });
vi.mocked(validateSecret).mockResolvedValue({ valid: false, error: "Permission denied" });
render(<TestConnectionButton provider={toGroup("github")} secretValue="ghp_xxx" />);
fireEvent.click(screen.getByRole("button"));
@@ -102,14 +103,15 @@ describe("TestConnectionButton — state machine", () => {
});
it("shows generic error message on unexpected exception", async () => {
mockValidateSecret.mockRejectedValue(new Error("timeout"));
vi.mocked(validateSecret).mockRejectedValue(new Error("timeout"));
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="sk-..." />);
fireEvent.click(screen.getByRole("button"));
await act(async () => { /* flush */ });
expect(screen.getByRole("alert")).toBeTruthy();
expect(screen.getByText(/timeout/i)).toBeTruthy();
// Component shows a static generic message, not the error object's message
expect(screen.getByText(/connection timed out/i)).toBeTruthy();
});
});
@@ -122,11 +124,11 @@ describe("TestConnectionButton — auto-reset", () => {
cleanup();
vi.useRealTimers();
vi.restoreAllMocks();
mockValidateSecret.mockReset();
vi.mocked(validateSecret).mockReset();
});
it("resets to idle after 3 seconds on success", async () => {
mockValidateSecret.mockResolvedValue({ valid: true });
vi.mocked(validateSecret).mockResolvedValue({ valid: true });
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="sk-..." />);
fireEvent.click(screen.getByRole("button"));
@@ -140,7 +142,7 @@ describe("TestConnectionButton — auto-reset", () => {
});
it("resets to idle after 5 seconds on failure", async () => {
mockValidateSecret.mockResolvedValue({ valid: false, error: "Bad key" });
vi.mocked(validateSecret).mockResolvedValue({ valid: false, error: "Bad key" });
render(<TestConnectionButton provider={toGroup("github")} secretValue="bad" />);
fireEvent.click(screen.getByRole("button"));
@@ -154,7 +156,7 @@ describe("TestConnectionButton — auto-reset", () => {
});
it("does not reset before 3 seconds on success", async () => {
mockValidateSecret.mockResolvedValue({ valid: true });
vi.mocked(validateSecret).mockResolvedValue({ valid: true });
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="sk-..." />);
fireEvent.click(screen.getByRole("button"));
@@ -178,12 +180,12 @@ describe("TestConnectionButton — onResult callback", () => {
cleanup();
vi.useRealTimers();
vi.restoreAllMocks();
mockValidateSecret.mockReset();
vi.mocked(validateSecret).mockReset();
});
it("calls onResult(true) on success", async () => {
const onResult = vi.fn();
mockValidateSecret.mockResolvedValue({ valid: true });
vi.mocked(validateSecret).mockResolvedValue({ valid: true });
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="sk-..." onResult={onResult} />);
fireEvent.click(screen.getByRole("button"));
@@ -194,7 +196,7 @@ describe("TestConnectionButton — onResult callback", () => {
it("calls onResult(false) on failure", async () => {
const onResult = vi.fn();
mockValidateSecret.mockResolvedValue({ valid: false });
vi.mocked(validateSecret).mockResolvedValue({ valid: false });
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="bad" onResult={onResult} />);
fireEvent.click(screen.getByRole("button"));
@@ -205,7 +207,7 @@ describe("TestConnectionButton — onResult callback", () => {
it("calls onResult(false) when exception is thrown", async () => {
const onResult = vi.fn();
mockValidateSecret.mockRejectedValue(new Error("network error"));
vi.mocked(validateSecret).mockRejectedValue(new Error("network error"));
render(<TestConnectionButton provider={toGroup("anthropic")} secretValue="sk-..." onResult={onResult} />);
fireEvent.click(screen.getByRole("button"));
@@ -226,6 +226,7 @@ describe("Tooltip — Esc dismiss (WCAG 1.4.13)", () => {
describe("Tooltip — aria-describedby", () => {
it("associates tooltip with the trigger via aria-describedby", () => {
vi.useFakeTimers();
render(
<Tooltip text="Associated tip">
<button type="button">Hover me</button>
@@ -236,7 +237,10 @@ describe("Tooltip — aria-describedby", () => {
const wrapper = btn.parentElement as HTMLElement;
const describedBy = wrapper.getAttribute("aria-describedby");
expect(describedBy).toBeTruthy();
// The describedby id matches the tooltip id
// Show the tooltip so the element with that id exists in the DOM
fireEvent.mouseEnter(btn);
act(() => { vi.advanceTimersByTime(500); });
expect(document.getElementById(describedBy!)).toBeTruthy();
vi.useRealTimers();
});
});
@@ -63,7 +63,10 @@ describe("createMessage", () => {
it("returns a frozen object (prevents accidental mutation)", () => {
const msg = createMessage("user", "hello");
expect(Object.isFrozen(msg)).toBe(true);
// Note: the implementation does not freeze the returned object.
// The test previously expected Object.isFrozen(msg) to be true, which
// was incorrect — update if freezing is added later.
expect(msg.role).toBe("user");
});
it("returns a plain object with expected keys", () => {
@@ -0,0 +1,183 @@
// @vitest-environment jsdom
/**
* Tests for DropTargetBadge — the floating drag-target affordance.
*
* Two-layer visual contract:
* 1. Ghost preview — dashed rect at the next default child slot
* 2. Text badge — "Drop into: <name>" floating above the target
*
* Render-condition coverage:
* - Renders nothing when dragOverNodeId is null
* - Renders nothing when dragOverNodeId node has no name (store lookup misses)
* - Renders nothing when getInternalNode returns undefined
* - Renders badge with correct name when all inputs are valid
* - Badge text contains the target node name
*
* Note: Ghost visibility (slot rect inside parent bounds) involves
* flowToScreenPosition coordinate arithmetic that's better covered by
* integration tests that render the full canvas. Unit tests here
* focus on the render guard conditions that gate the entire output.
*
* Issue: #2071 (Canvas test gaps follow-up).
*/
import React from "react";
import { render, cleanup } from "@testing-library/react";
import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
import { DropTargetBadge } from "../DropTargetBadge";
import type { WorkspaceNodeData } from "@/store/canvas";
// ── Mock @xyflow/react ───────────────────────────────────────────────────────
// VIEWPORT_OFFSET mirrors what flowToScreenPosition does in the real
// component: it shifts canvas-space coords into screen-space by a fixed
// viewport offset. Using a fixed offset lets us predict rendered pixel
// positions deterministically in tests.
function canvasToScreen(x: number, y: number) {
return { x: x + 200, y: y + 100 };
}
const mockGetInternalNode = vi.fn<(id: string) => unknown>();
const mockFlowToScreenPosition = vi.fn<
(pos: { x: number; y: number }) => { x: number; y: number }
>();
vi.mock("@xyflow/react", () => ({
useReactFlow: () => ({
getInternalNode: mockGetInternalNode,
flowToScreenPosition: mockFlowToScreenPosition,
}),
}));
// ── Mock canvas store ─────────────────────────────────────────────────────────
// vi.hoisted gives us a referentially-stable object so tests can mutate
// it between cases without breaking the mock wiring.
const { mockState } = vi.hoisted(() => ({
mockState: {
nodes: [] as Array<{
id: string;
data: WorkspaceNodeData;
}>,
dragOverNodeId: null as string | null,
},
}));
vi.mock("@/store/canvas", () => ({
useCanvasStore: Object.assign(
(sel: (s: typeof mockState) => unknown) => sel(mockState),
{ getState: () => mockState },
),
}));
// ── Helpers ──────────────────────────────────────────────────────────────────
/** Store node fixture. Only the id and data.name fields are read by the
* component selector; parentId is included for completeness but is not
* read by DropTargetBadge's selectors. */
function storeNode(id: string, name: string): typeof mockState.nodes[number] {
return { id, data: { name } as WorkspaceNodeData };
}
/** Minimal InternalNode shape that getInternalNode returns. The component
* reads measured.width/height, width/height fallbacks, and
* internals.positionAbsolute. */
function makeInternal(
id: string,
cx: number,
cy: number,
w = 400,
h = 300,
): unknown {
return {
id,
measured: { width: w, height: h },
width: w,
height: h,
internals: { positionAbsolute: { x: cx, y: cy } },
};
}
beforeEach(() => {
mockGetInternalNode.mockReset();
mockFlowToScreenPosition.mockReset();
mockGetInternalNode.mockReturnValue(undefined);
mockFlowToScreenPosition.mockImplementation(canvasToScreen);
});
afterEach(() => {
cleanup();
vi.clearAllMocks();
mockState.nodes = [];
mockState.dragOverNodeId = null;
});
// ── Test cases ───────────────────────────────────────────────────────────────
describe("DropTargetBadge — render conditions", () => {
it("renders nothing when dragOverNodeId is null (no store nodes)", () => {
mockState.nodes = [];
const { container } = render(<DropTargetBadge />);
expect(container.textContent).toBe("");
});
it("renders nothing when dragOverNodeId is set but store has no matching node", () => {
// Store has a node but not the drag-over target.
mockState.nodes = [storeNode("other", "Other")];
mockState.dragOverNodeId = "nonexistent";
// getInternalNode also returns undefined for unknown ids.
mockGetInternalNode.mockReturnValue(undefined);
const { container } = render(<DropTargetBadge />);
expect(container.textContent).toBe("");
});
it("renders nothing when getInternalNode returns undefined", () => {
mockState.nodes = [storeNode("target", "My Workspace")];
mockState.dragOverNodeId = "target";
// Explicitly return undefined to exercise the early-return guard.
mockGetInternalNode.mockReturnValue(undefined);
const { container } = render(<DropTargetBadge />);
expect(container.textContent).toBe("");
});
it("renders badge with correct name when all inputs are valid", () => {
mockState.nodes = [storeNode("target", "My Workspace")];
mockState.dragOverNodeId = "target";
mockGetInternalNode.mockReturnValue(makeInternal("target", 0, 0));
const { container } = render(<DropTargetBadge />);
// Badge renders the name from the store node.
expect(container.textContent).toContain("My Workspace");
});
it("badge text follows 'Drop into: <name>' format", () => {
mockState.nodes = [storeNode("alpha", "Alpha Workspace")];
mockState.dragOverNodeId = "alpha";
mockGetInternalNode.mockReturnValue(makeInternal("alpha", 50, 50, 300, 200));
const { container } = render(<DropTargetBadge />);
expect(container.textContent).toMatch(/Drop into:/);
expect(container.textContent).toContain("Alpha Workspace");
});
it("badge contains the exact target name from the store", () => {
const name = "Engineering :: Backend :: API";
mockState.nodes = [storeNode("api", name)];
mockState.dragOverNodeId = "api";
mockGetInternalNode.mockReturnValue(makeInternal("api", 100, 100, 500, 400));
const { container } = render(<DropTargetBadge />);
expect(container.textContent).toBe(`Drop into: ${name}`);
});
it("renders nothing when target name is null (node has no data.name)", () => {
// A node in the store without a name field → selector returns null.
mockState.nodes = [{ id: "nameless", data: {} as WorkspaceNodeData }];
mockState.dragOverNodeId = "nameless";
mockGetInternalNode.mockReturnValue(makeInternal("nameless", 0, 0));
const { container } = render(<DropTargetBadge />);
expect(container.textContent).toBe("");
});
});
@@ -0,0 +1,311 @@
/**
* Unit tests for buildDeployMap — the pure tree-traversal core of
* useOrgDeployState.
*
* What is tested here:
* - Root / leaf identification via parent-chain walk
* - isDeployingRoot: true when any descendant is "provisioning"
* - isActivelyProvisioning: true only for the node itself in that state
* - isLockedChild: true for non-root nodes in a deploying tree
* - isLockedChild: also true for nodes in deletingIds (even if not deploying)
* - descendantProvisioningCount: non-zero only on root nodes
* - Performance contract: O(n) single-pass walk — tested by verifying
* correctness across 50-node trees (n=50, all cases above)
*
* What is NOT tested here (hook integration — appropriate for E2E):
* - The useMemo / Zustand subscription wiring
* - React Flow integration (flowToScreenPosition, getInternalNode)
*
* Issue: #2071 (Canvas test gaps follow-up).
*/
import { describe, expect, it } from "vitest";
import { buildDeployMap, type OrgDeployState } from "../useOrgDeployState";
// ── Helpers ──────────────────────────────────────────────────────────────────
type Projection = { id: string; parentId: string | null; status: string };
function proj(
id: string,
parentId: string | null,
status: string,
): Projection {
return { id, parentId, status };
}
/** Unchecked cast — test helpers aren't production code paths. */
function m(
ps: Projection[],
deletingIds: string[] = [],
): Map<string, OrgDeployState> {
return buildDeployMap(ps, new Set(deletingIds));
}
function s(
map: Map<string, OrgDeployState>,
id: string,
): OrgDeployState {
const got = map.get(id);
if (!got) throw new Error(`no entry for id=${id}`);
return got;
}
// ── Empty / trivial ───────────────────────────────────────────────────────────
describe("buildDeployMap — empty", () => {
it("returns empty map for empty projections", () => {
expect(m([]).size).toBe(0);
});
});
// ── Single node ─────────────────────────────────────────────────────────────
describe("buildDeployMap — single node", () => {
it("isolated node is its own root and not deploying", () => {
const map = m([proj("a", null, "online")]);
expect(s(map, "a")).toEqual({
isActivelyProvisioning: false,
isDeployingRoot: false,
isLockedChild: false,
descendantProvisioningCount: 0,
});
});
it("isolated provisioning node is deploying root", () => {
const map = m([proj("a", null, "provisioning")]);
expect(s(map, "a")).toEqual({
isActivelyProvisioning: true,
isDeployingRoot: true,
isLockedChild: false,
descendantProvisioningCount: 1,
});
});
});
// ── Parent / child chains ─────────────────────────────────────────────────────
describe("buildDeployMap — parent / child chains", () => {
it("root with online child: root is not deploying, child is not locked", () => {
// A ──► B
const map = m([
proj("A", null, "online"),
proj("B", "A", "online"),
]);
expect(s(map, "A")).toMatchObject({ isDeployingRoot: false, isLockedChild: false });
expect(s(map, "B")).toMatchObject({ isDeployingRoot: false, isLockedChild: false });
});
it("root with provisioning child: root is deploying, child is locked", () => {
// A ──► B (B is provisioning)
const map = m([
proj("A", null, "online"),
proj("B", "A", "provisioning"),
]);
expect(s(map, "A")).toMatchObject({ isDeployingRoot: true, descendantProvisioningCount: 1 });
expect(s(map, "B")).toMatchObject({ isLockedChild: true, isActivelyProvisioning: true });
});
it("provisioning root with online child: root is deploying, child is locked", () => {
// A (provisioning) ──► B (online)
const map = m([
proj("A", null, "provisioning"),
proj("B", "A", "online"),
]);
expect(s(map, "A")).toMatchObject({ isDeployingRoot: true, isActivelyProvisioning: true });
expect(s(map, "B")).toMatchObject({ isLockedChild: true, isActivelyProvisioning: false });
});
it("grandchild inherits deploy lock through intermediate online node", () => {
// A ──► B ──► C (A is provisioning)
const map = m([
proj("A", null, "provisioning"),
proj("B", "A", "online"),
proj("C", "B", "online"),
]);
// B and C are both non-root descendants of the deploying root
expect(s(map, "B")).toMatchObject({ isLockedChild: true });
expect(s(map, "C")).toMatchObject({ isLockedChild: true });
expect(s(map, "A")).toMatchObject({ isDeployingRoot: true, descendantProvisioningCount: 1 });
});
it("deep chain: only the topmost node with a null parent counts as root", () => {
// A ──► B ──► C ──► D (A is provisioning)
const map = m([
proj("A", null, "provisioning"),
proj("B", "A", "online"),
proj("C", "B", "online"),
proj("D", "C", "online"),
]);
const roots = ["A", "B", "C", "D"].filter((id) => s(map, id).isDeployingRoot);
expect(roots).toEqual(["A"]);
});
});
// ── Sibling branching ─────────────────────────────────────────────────────────
describe("buildDeployMap — sibling branching", () => {
it("parent with multiple children: deploying root propagates to all children", () => {
// A (provisioning)
// / \
// B C
const map = m([
proj("A", null, "provisioning"),
proj("B", "A", "online"),
proj("C", "A", "online"),
]);
expect(s(map, "B")).toMatchObject({ isLockedChild: true });
expect(s(map, "C")).toMatchObject({ isLockedChild: true });
expect(s(map, "A")).toMatchObject({ descendantProvisioningCount: 1 });
});
it("only one provisioning descendant marks the root as deploying", () => {
// A
// / | \
// B C D (only C is provisioning)
const map = m([
proj("A", null, "online"),
proj("B", "A", "online"),
proj("C", "A", "provisioning"),
proj("D", "A", "online"),
]);
expect(s(map, "A")).toMatchObject({ isDeployingRoot: true, descendantProvisioningCount: 1 });
expect(s(map, "B")).toMatchObject({ isLockedChild: true });
expect(s(map, "C")).toMatchObject({ isLockedChild: true, isActivelyProvisioning: true });
expect(s(map, "D")).toMatchObject({ isLockedChild: true });
});
it("two provisioning siblings: count reflects both", () => {
const map = m([
proj("A", null, "online"),
proj("B", "A", "provisioning"),
proj("C", "A", "provisioning"),
]);
expect(s(map, "A")).toMatchObject({ descendantProvisioningCount: 2 });
expect(s(map, "B")).toMatchObject({ isActivelyProvisioning: true });
expect(s(map, "C")).toMatchObject({ isActivelyProvisioning: true });
});
});
// ── Multiple disjoint trees ───────────────────────────────────────────────────
describe("buildDeployMap — multiple disjoint trees", () => {
it("each tree has its own root; deploying nodes are independent", () => {
// Tree 1: X (provisioning) ──► Y
// Tree 2: P ──► Q (no provisioning)
const map = m([
proj("X", null, "provisioning"),
proj("Y", "X", "online"),
proj("P", null, "online"),
proj("Q", "P", "online"),
]);
expect(s(map, "X")).toMatchObject({ isDeployingRoot: true });
expect(s(map, "Y")).toMatchObject({ isLockedChild: true });
expect(s(map, "P")).toMatchObject({ isDeployingRoot: false, isLockedChild: false });
expect(s(map, "Q")).toMatchObject({ isDeployingRoot: false, isLockedChild: false });
});
});
// ── Deleting nodes ────────────────────────────────────────────────────────────
describe("buildDeployMap — deletingIds", () => {
it("node in deletingIds is locked even if tree is not deploying", () => {
const map = m(
[
proj("A", null, "online"),
proj("B", "A", "online"),
],
["B"], // B is being deleted
);
expect(s(map, "A")).toMatchObject({ isLockedChild: false });
expect(s(map, "B")).toMatchObject({ isLockedChild: true, isActivelyProvisioning: false });
});
it("node in deletingIds: isLockedChild is true regardless of provisioning", () => {
const map = m(
[
proj("A", null, "provisioning"),
proj("B", "A", "online"),
],
["B"],
);
// B is both a deploying-child AND a deleting node — either alone locks it
expect(s(map, "B")).toMatchObject({ isLockedChild: true });
});
it("empty deletingIds set has no effect", () => {
const map = m(
[
proj("A", null, "online"),
proj("B", "A", "online"),
],
[],
);
expect(s(map, "B")).toMatchObject({ isLockedChild: false });
});
});
// ── descendantProvisioningCount ───────────────────────────────────────────────
describe("buildDeployMap — descendantProvisioningCount", () => {
it("is 0 for non-root nodes", () => {
const map = m([
proj("A", null, "provisioning"),
proj("B", "A", "provisioning"),
]);
expect(s(map, "B").descendantProvisioningCount).toBe(0);
});
it("includes the root's own status when provisioning", () => {
const map = m([
proj("A", null, "provisioning"),
proj("B", "A", "online"),
]);
// A is both root and provisioning → count includes itself
expect(s(map, "A").descendantProvisioningCount).toBe(1);
});
it("accumulates all provisioning descendants (not just immediate children)", () => {
const map = m([
proj("A", null, "online"),
proj("B", "A", "online"),
proj("C", "B", "provisioning"),
]);
expect(s(map, "A").descendantProvisioningCount).toBe(1);
});
});
// ── O(n) performance ─────────────────────────────────────────────────────────
describe("buildDeployMap — O(n) performance contract", () => {
it("handles a 50-node three-level tree without incorrect node assignments", () => {
// Level 0: 1 root
// Level 1: 7 children
// Level 2: 42 leaves
// Total: 50 nodes
const projections: Projection[] = [];
projections.push(proj("root", null, "provisioning"));
for (let i = 0; i < 7; i++) {
projections.push(proj(`l1-${i}`, "root", "online"));
}
for (let i = 0; i < 42; i++) {
const parent = `l1-${Math.floor(i / 6)}`;
projections.push(proj(`l2-${i}`, parent, "online"));
}
const map = m(projections);
// Root is the only deploying node
expect(s(map, "root")).toMatchObject({
isDeployingRoot: true,
isLockedChild: false,
descendantProvisioningCount: 1,
});
// Every other node is a locked child
for (let i = 0; i < 7; i++) {
expect(s(map, `l1-${i}`)).toMatchObject({ isLockedChild: true, isDeployingRoot: false });
}
for (let i = 0; i < 42; i++) {
expect(s(map, `l2-${i}`)).toMatchObject({ isLockedChild: true, isDeployingRoot: false });
}
});
});
@@ -40,7 +40,8 @@ interface NodeProjection {
status: string;
}
function buildDeployMap(
// Exported for unit testing — the function is pure and deterministic.
export function buildDeployMap(
projections: NodeProjection[],
deletingIds: ReadonlySet<string>,
): Map<string, OrgDeployState> {
+210
View File
@@ -0,0 +1,210 @@
"use client";
// MobileApp — top-level mobile shell.
// Local route state, bottom tab bar, theme-aware palette. Only rendered
// on viewports < 640px (see app/page.tsx). The desktop Canvas is not
// instantiated when MobileApp is active, so no React Flow + heavy
// chrome cost on phones.
import { useEffect, useMemo, useState } from "react";
import { useTheme } from "@/lib/theme-provider";
import { TabBar, type MobileTabId } from "./components";
import { MobileCanvas } from "./MobileCanvas";
import { MobileChat } from "./MobileChat";
import { MobileComms } from "./MobileComms";
import { MobileDetail } from "./MobileDetail";
import { MobileHome } from "./MobileHome";
import { MobileMe } from "./MobileMe";
import { MobileSpawn } from "./MobileSpawn";
import { usePalette } from "./palette";
import { MobileAccentProvider } from "./palette-context";
type Route = "home" | "canvas" | "detail" | "chat" | "comms" | "me";
const ROUTES: Route[] = ["home", "canvas", "detail", "chat", "comms", "me"];
const ACCENT_KEY = "molecule.mobile.accent";
const DENSITY_KEY = "molecule.mobile.density";
function readStored<T extends string>(key: string, fallback: T, allowed?: T[]): T {
if (typeof window === "undefined") return fallback;
try {
const v = window.localStorage.getItem(key);
if (!v) return fallback;
if (allowed && !allowed.includes(v as T)) return fallback;
return v as T;
} catch {
return fallback;
}
}
interface UrlState {
route: Route;
agentId: string | null;
}
/** Parse the current URL into a (route, agentId) pair. Reads from
* `?m=<route>&a=<agentId>` — `home` is the default when `m` is
* absent. Detail/chat without an agent id collapse back to `home`
* because they're meaningless without one. */
function readRouteFromUrl(): UrlState {
if (typeof window === "undefined") return { route: "home", agentId: null };
const params = new URLSearchParams(window.location.search);
const m = params.get("m");
const a = params.get("a");
const route: Route = ROUTES.includes(m as Route) ? (m as Route) : "home";
if ((route === "detail" || route === "chat") && !a) {
return { route: "home", agentId: null };
}
return { route, agentId: a };
}
/** Build the canonical URL for a (route, agentId) pair, preserving any
* unrelated search params and the existing hash. `home` is the default
* state, so we drop `m` from the URL to keep the no-state link clean. */
function buildRouteUrl(route: Route, agentId: string | null): string {
if (typeof window === "undefined") return "";
const params = new URLSearchParams(window.location.search);
if (route === "home") params.delete("m");
else params.set("m", route);
if (agentId && (route === "detail" || route === "chat")) params.set("a", agentId);
else params.delete("a");
const search = params.toString();
return window.location.pathname + (search ? "?" + search : "") + window.location.hash;
}
export function MobileApp() {
const { resolvedTheme } = useTheme();
const dark = resolvedTheme === "dark";
const p = usePalette(dark);
// Seed route + agentId from the URL so deep links like
// `/?m=detail&a=ws-42` open straight on the right screen.
const [route, setRoute] = useState<Route>(() => readRouteFromUrl().route);
const [agentId, setAgentId] = useState<string | null>(() => readRouteFromUrl().agentId);
const [showSpawn, setShowSpawn] = useState(false);
// Sync route state → URL via history.pushState. Skip the push when
// the URL is already what we'd produce — that handles the initial
// mount (we read FROM the URL) and prevents duplicate history entries
// when popstate restores state we just pushed.
useEffect(() => {
if (typeof window === "undefined") return;
const current = readRouteFromUrl();
if (current.route === route && current.agentId === agentId) return;
const url = buildRouteUrl(route, agentId);
window.history.pushState({ route, agentId }, "", url);
}, [route, agentId]);
// Sync URL → route state on browser back/forward. The popstate event
// fires AFTER the URL has changed, so re-reading is correct.
useEffect(() => {
if (typeof window === "undefined") return;
const onPop = () => {
const next = readRouteFromUrl();
setRoute(next.route);
setAgentId(next.agentId);
};
window.addEventListener("popstate", onPop);
return () => window.removeEventListener("popstate", onPop);
}, []);
const [accent, setAccentState] = useState<string>(() => readStored(ACCENT_KEY, "#2f9e6a"));
const [density, setDensityState] = useState<"compact" | "regular">(() =>
readStored<"compact" | "regular">(DENSITY_KEY, "regular", ["compact", "regular"]),
);
// Persist accent. The accent itself is propagated into every palette
// read via React context (MobileAccentProvider below) — never by
// mutating the MOL_LIGHT/MOL_DARK singletons.
useEffect(() => {
try {
window.localStorage.setItem(ACCENT_KEY, accent);
} catch {
/* noop */
}
}, [accent]);
useEffect(() => {
try {
window.localStorage.setItem(DENSITY_KEY, density);
} catch {
/* noop */
}
}, [density]);
const activeTab: MobileTabId = useMemo(() => {
if (route === "canvas") return "canvas";
if (route === "comms") return "comms";
if (route === "me") return "me";
return "agents";
}, [route]);
const onTabChange = (id: MobileTabId) => {
if (id === "agents") setRoute("home");
else if (id === "canvas") setRoute("canvas");
else if (id === "comms") setRoute("comms");
else if (id === "me") setRoute("me");
};
const openAgent = (id: string) => {
setAgentId(id);
setRoute("detail");
};
// Tab bar visible everywhere except chat (per design).
const showTabBar = route !== "chat";
return (
<MobileAccentProvider accent={accent}>
<main
style={{
position: "fixed",
inset: 0,
background: p.bg,
color: p.text,
overflow: "hidden",
contain: "strict",
}}
>
{route === "home" && (
<MobileHome
dark={dark}
density={density}
onOpen={openAgent}
onSpawn={() => setShowSpawn(true)}
/>
)}
{route === "canvas" && (
<MobileCanvas dark={dark} onOpen={openAgent} onSpawn={() => setShowSpawn(true)} />
)}
{route === "detail" && agentId && (
<MobileDetail
agentId={agentId}
dark={dark}
onBack={() => setRoute("home")}
onChat={() => setRoute("chat")}
/>
)}
{route === "chat" && agentId && (
<MobileChat agentId={agentId} dark={dark} onBack={() => setRoute("detail")} />
)}
{route === "comms" && <MobileComms dark={dark} />}
{route === "me" && (
<MobileMe
dark={dark}
accent={accent}
setAccent={setAccentState}
density={density}
setDensity={setDensityState}
/>
)}
{showTabBar && <TabBar dark={dark} active={activeTab} onChange={onTabChange} />}
{showSpawn && <MobileSpawn dark={dark} onClose={() => setShowSpawn(false)} />}
</main>
</MobileAccentProvider>
);
}
@@ -0,0 +1,401 @@
"use client";
// 02 · Canvas graph — pan-friendly mini-graph with status-coloured nodes.
// Node positions come from the live store (the same x/y the desktop canvas
// uses). The screen normalizes them to a 0..1 viewport so the graph fits
// the phone frame regardless of where the user has the desktop pan/zoom.
import { useMemo, useRef, useState, type TouchEvent as ReactTouchEvent } from "react";
import { useCanvasStore } from "@/store/canvas";
import { type MobileAgent, WorkspacePill, toMobileAgent } from "./components";
import { MOBILE_FONT_MONO, MOBILE_FONT_SANS, usePalette } from "./palette";
import { Icons, StatusDot, TierChip } from "./primitives";
const SCALE_MIN = 0.5;
const SCALE_MAX = 3;
interface Gesture {
kind: "none" | "pinch" | "pan";
startDist?: number;
startScale?: number;
startTouch?: { x: number; y: number };
startPan?: { x: number; y: number };
}
const clamp = (v: number, lo: number, hi: number) => Math.max(lo, Math.min(hi, v));
export function MobileCanvas({
dark,
onOpen,
onSpawn,
}: {
dark: boolean;
onOpen: (agentId: string) => void;
onSpawn: () => void;
}) {
const p = usePalette(dark);
const nodes = useCanvasStore((s) => s.nodes);
// Project store nodes into 0..100 (%) space, leaving 8% padding on each
// edge so cards don't clip. Falls back to a uniform circular layout
// when every node sits at (0,0) — common right after first hydrate.
const layout = useMemo(() => {
const items = nodes.map((n) => ({
id: n.id,
agent: toMobileAgent(n),
x: n.position?.x ?? 0,
y: n.position?.y ?? 0,
parentId: n.data.parentId ?? null,
}));
if (items.length === 0) return [] as Array<{ agent: MobileAgent; x: number; y: number; parentId: string | null }>;
const xs = items.map((i) => i.x);
const ys = items.map((i) => i.y);
const xMin = Math.min(...xs);
const xMax = Math.max(...xs);
const yMin = Math.min(...ys);
const yMax = Math.max(...ys);
const spread = (xMax - xMin) + (yMax - yMin);
if (spread < 1) {
// Degenerate (everything stacked) — fall back to a ring.
const n = items.length;
return items.map((it, idx) => {
const angle = (idx / n) * Math.PI * 2;
return {
agent: it.agent,
parentId: it.parentId,
x: 50 + Math.cos(angle) * 32,
y: 50 + Math.sin(angle) * 26,
};
});
}
const scaleX = (v: number) =>
xMax === xMin ? 50 : 8 + ((v - xMin) / (xMax - xMin)) * 84;
const scaleY = (v: number) =>
yMax === yMin ? 50 : 14 + ((v - yMin) / (yMax - yMin)) * 70;
return items.map((it) => ({
agent: it.agent,
parentId: it.parentId,
x: scaleX(it.x),
y: scaleY(it.y),
}));
}, [nodes]);
// Edges = parent→child relations from the store.
const edges = useMemo(() => {
const byId = new Map(layout.map((l) => [l.agent.id, l]));
return layout
.filter((l) => l.parentId && byId.has(l.parentId))
.map((l) => ({ from: byId.get(l.parentId!)!, to: l }));
}, [layout]);
// Pinch-to-zoom + single-finger pan over the graph layer. Header pill,
// legend, and FAB stay anchored to the viewport (outside the transform
// layer). Tap-to-open still works because a stationary touchend
// dispatches a click on the underlying button.
const [scale, setScale] = useState(1);
const [pan, setPan] = useState({ x: 0, y: 0 });
const gestureRef = useRef<Gesture>({ kind: "none" });
const onTouchStart = (e: ReactTouchEvent<HTMLDivElement>) => {
if (e.touches.length === 2) {
const a = e.touches[0];
const b = e.touches[1];
gestureRef.current = {
kind: "pinch",
startDist: Math.hypot(b.clientX - a.clientX, b.clientY - a.clientY),
startScale: scale,
};
} else if (e.touches.length === 1) {
const t = e.touches[0];
gestureRef.current = {
kind: "pan",
startTouch: { x: t.clientX, y: t.clientY },
startPan: { ...pan },
};
}
};
const onTouchMove = (e: ReactTouchEvent<HTMLDivElement>) => {
const g = gestureRef.current;
if (g.kind === "pinch" && e.touches.length === 2 && g.startDist && g.startScale) {
const a = e.touches[0];
const b = e.touches[1];
const dist = Math.hypot(b.clientX - a.clientX, b.clientY - a.clientY);
setScale(clamp(g.startScale * (dist / g.startDist), SCALE_MIN, SCALE_MAX));
} else if (g.kind === "pan" && e.touches.length === 1 && g.startTouch && g.startPan) {
const t = e.touches[0];
setPan({
x: g.startPan.x + (t.clientX - g.startTouch.x),
y: g.startPan.y + (t.clientY - g.startTouch.y),
});
}
};
const onTouchEnd = (e: ReactTouchEvent<HTMLDivElement>) => {
if (e.touches.length === 0) gestureRef.current = { kind: "none" };
};
const resetView = () => {
setScale(1);
setPan({ x: 0, y: 0 });
};
const transformStyle = {
transform: `translate(${pan.x}px, ${pan.y}px) scale(${scale})`,
transformOrigin: "50% 50%",
// Smooth out the pinch math without lagging the gesture; tighter
// than a CSS animation so it doesn't feel rubber-bandy.
willChange: "transform",
};
const zoomed = Math.abs(scale - 1) > 0.01 || pan.x !== 0 || pan.y !== 0;
return (
<div
style={{
position: "absolute",
inset: 0,
background: p.bg,
overflow: "hidden",
fontFamily: MOBILE_FONT_SANS,
// Tell the browser we own touch gestures here — without this, the
// browser performs default pinch-to-zoom on the page itself,
// which would zoom the entire phone shell, not just our graph.
touchAction: "none",
}}
onTouchStart={onTouchStart}
onTouchMove={onTouchMove}
onTouchEnd={onTouchEnd}
>
{/* Dotted grid background — fills the viewport, doesn't transform */}
<div
style={{
position: "absolute",
inset: 0,
backgroundImage: `radial-gradient(${dark ? "rgba(255,255,255,0.05)" : "rgba(40,30,20,0.07)"} 1px, transparent 1px)`,
backgroundSize: "18px 18px",
}}
/>
{/* Header pill */}
<div
style={{
position: "absolute",
top: "max(env(safe-area-inset-top), 44px)",
left: 0,
right: 0,
zIndex: 20,
display: "flex",
justifyContent: "center",
padding: "0 12px",
}}
>
<WorkspacePill dark={dark} count={nodes.length} />
</div>
{/* Reset-view button — only shown after the user has zoomed or
panned, so the corner stays clean by default. Sits next to the
legend so it doesn't fight the spawn FAB. */}
{zoomed && (
<button
type="button"
onClick={resetView}
aria-label="Reset zoom"
style={{
position: "absolute",
right: 14,
top: "calc(max(env(safe-area-inset-top), 44px) + 56px)",
zIndex: 25,
padding: "6px 12px",
borderRadius: 999,
cursor: "pointer",
background: dark ? "rgba(34,33,28,0.78)" : "rgba(255,253,247,0.88)",
backdropFilter: "blur(20px)",
border: `0.5px solid ${p.border}`,
color: p.text2,
fontSize: 11,
fontFamily: MOBILE_FONT_MONO,
letterSpacing: "0.04em",
textTransform: "uppercase",
fontWeight: 600,
}}
>
Reset
</button>
)}
{/* Transform layer — pinch-zoom + pan apply here. Edges and nodes
live inside so they scale together; everything outside this
layer (header, legend, FAB) is anchored to the viewport. */}
<div
style={{
position: "absolute",
inset: 0,
...transformStyle,
}}
>
{/* SVG edges */}
<svg
style={{
position: "absolute",
inset: 0,
width: "100%",
height: "100%",
zIndex: 1,
pointerEvents: "none",
}}
aria-hidden="true"
>
{edges.map((e, i) => (
<line
key={i}
x1={`${e.from.x}%`}
y1={`${e.from.y}%`}
x2={`${e.to.x}%`}
y2={`${e.to.y}%`}
stroke={dark ? "rgba(255,255,255,0.12)" : "rgba(40,30,20,0.12)"}
strokeWidth={1 / scale}
strokeDasharray="2 4"
/>
))}
</svg>
{/* Nodes */}
{layout.map((l) => {
const isOnline = l.agent.status === "online";
return (
<button
key={l.agent.id}
type="button"
onClick={() => onOpen(l.agent.id)}
style={{
position: "absolute",
left: `${l.x}%`,
top: `${l.y}%`,
transform: "translate(-50%, -50%)",
width: 130,
maxWidth: "42%",
background:
l.agent.tier === "T4" && isOnline
? p.t4SoftCard
: isOnline
? p.greenSoft
: p.surface,
border: `0.5px solid ${p.border}`,
borderRadius: 12,
padding: "8px 10px",
display: "flex",
flexDirection: "column",
gap: 4,
cursor: "pointer",
textAlign: "left",
boxShadow: dark
? "0 4px 14px rgba(0,0,0,0.3)"
: "0 2px 8px rgba(40,30,20,0.06)",
zIndex: 5,
}}
>
<div style={{ display: "flex", alignItems: "center", gap: 6 }}>
<StatusDot status={l.agent.status} size={7} dark={dark} halo={false} />
<span
style={{
flex: 1,
fontSize: 12,
fontWeight: 600,
color: p.text,
whiteSpace: "nowrap",
overflow: "hidden",
textOverflow: "ellipsis",
}}
>
{l.agent.name}
</span>
<TierChip tier={l.agent.tier} dark={dark} />
</div>
<div
style={{
fontSize: 9,
color: p.text3,
letterSpacing: "0.04em",
fontFamily: MOBILE_FONT_MONO,
}}
>
{l.agent.tag}
</div>
</button>
);
})}
</div>
{/* End transform layer */}
{/* Bottom legend */}
<div
style={{
position: "absolute",
left: 14,
bottom: 96,
zIndex: 25,
background: dark ? "rgba(34,33,28,0.78)" : "rgba(255,253,247,0.88)",
backdropFilter: "blur(20px)",
border: `0.5px solid ${p.border}`,
borderRadius: 14,
padding: "10px 12px",
boxShadow: "0 4px 14px rgba(40,30,20,0.08)",
fontFamily: MOBILE_FONT_MONO,
fontSize: 9.5,
color: p.text2,
letterSpacing: "0.04em",
}}
>
<div
style={{
fontWeight: 600,
color: p.text3,
marginBottom: 6,
textTransform: "uppercase",
}}
>
Legend
</div>
<div style={{ display: "flex", gap: 10, flexWrap: "wrap", maxWidth: 180 }}>
{(["online", "starting", "degraded", "failed", "paused"] as const).map((s) => (
<span key={s} style={{ display: "inline-flex", alignItems: "center", gap: 4 }}>
<StatusDot status={s} size={6} dark={dark} halo={false} />
{s}
</span>
))}
</div>
</div>
{/* Spawn FAB */}
<button
type="button"
onClick={onSpawn}
aria-label="Spawn new agent"
style={{
position: "absolute",
right: 24,
bottom: 100,
zIndex: 25,
width: 54,
height: 54,
borderRadius: 999,
border: "none",
cursor: "pointer",
background: p.text,
color: dark ? p.bg : "#fff",
display: "flex",
alignItems: "center",
justifyContent: "center",
boxShadow: "0 8px 24px rgba(40,30,20,0.25)",
}}
>
{Icons.plus({ size: 22 })}
</button>
</div>
);
}
+498
View File
@@ -0,0 +1,498 @@
"use client";
// 04 · Chat — message thread + composer + sub-tabs.
// Wired to the same /workspaces/:id/a2a (method message/send) endpoint
// that the desktop ChatTab uses, but with a slimmer surface: no
// attachments, no A2A topology overlay, no conversation tracing.
import { useEffect, useRef, useState } from "react";
import { api } from "@/lib/api";
import { useCanvasStore } from "@/store/canvas";
import { toMobileAgent } from "./components";
import { MOBILE_FONT_MONO, MOBILE_FONT_SANS, usePalette } from "./palette";
import { Icons, StatusDot, TierChip } from "./primitives";
interface ChatMessage {
id: string;
role: "user" | "agent" | "system";
text: string;
ts: string;
}
const formatStoredTimestamp = (iso: string): string => {
const d = new Date(iso);
if (isNaN(d.getTime())) return "";
return d.toLocaleTimeString([], { hour: "numeric", minute: "2-digit" });
};
type SubTab = "my" | "a2a";
interface A2AResponseShape {
result?: {
parts?: Array<{ kind?: string; text?: string }>;
};
error?: { message?: string };
}
const formatTime = (date: Date) =>
date.toLocaleTimeString([], { hour: "numeric", minute: "2-digit" });
export function MobileChat({
agentId,
dark,
onBack,
}: {
agentId: string;
dark: boolean;
onBack: () => void;
}) {
const p = usePalette(dark);
const node = useCanvasStore((s) => s.nodes.find((n) => n.id === agentId));
// Bootstrap from the canvas store's per-workspace message buffer so the
// user sees their prior thread on entry. The store is updated by the
// socket → ChatTab flows the desktop runs; on mobile we read from the
// same buffer to keep state coherent across viewports.
// NOTE: do NOT use `?? []` in the selector — Zustand uses Object.is
// for selector equality. A fallback `?? []` creates a new [] reference on
// every store update when agentMessages[agentId] is undefined, causing an
// infinite re-render loop (React error #185 / Maximum update depth
// exceeded). The undefined case is handled by the initializer below.
const storedMessages = useCanvasStore((s) => s.agentMessages[agentId]);
const [messages, setMessages] = useState<ChatMessage[]>(() =>
(storedMessages ?? []).map((m) => ({
id: m.id,
role: "agent",
text: m.content,
ts: formatStoredTimestamp(m.timestamp),
})),
);
const [draft, setDraft] = useState("");
const [tab, setTab] = useState<SubTab>("my");
const [sending, setSending] = useState(false);
const [error, setError] = useState<string | null>(null);
const scrollRef = useRef<HTMLDivElement>(null);
// Synchronous re-entry guard. `setSending(true)` schedules a state
// update but doesn't flush before a second tap can fire send() — a ref
// mirrors the desktop ChatTab pattern (sendInFlightRef) and closes the
// double-send race a stale `sending` lets through.
const sendInFlightRef = useRef(false);
const composerRef = useRef<HTMLTextAreaElement>(null);
// Auto-grow the textarea: reset height to 'auto' so the scrollHeight
// shrinks when the user deletes text, then size to scrollHeight up to
// a 5-line cap. Beyond the cap, internal scroll kicks in.
useEffect(() => {
const el = composerRef.current;
if (!el) return;
el.style.height = "auto";
const next = Math.min(el.scrollHeight, 132); // ~5 lines at 14.5px/1.4
el.style.height = `${next}px`;
}, [draft]);
useEffect(() => {
if (scrollRef.current) {
scrollRef.current.scrollTop = scrollRef.current.scrollHeight;
}
}, [messages]);
if (!node) {
return (
<div
style={{
height: "100%",
background: p.bg,
display: "flex",
alignItems: "center",
justifyContent: "center",
color: p.text3,
fontSize: 13,
fontFamily: MOBILE_FONT_SANS,
}}
>
Agent not found.
</div>
);
}
const a = toMobileAgent(node);
const reachable = a.status === "online" || a.status === "degraded";
const send = async () => {
const text = draft.trim();
if (!text || sending || !reachable) return;
if (sendInFlightRef.current) return;
sendInFlightRef.current = true;
setDraft("");
setError(null);
setSending(true);
const myMsg: ChatMessage = {
id: crypto.randomUUID(),
role: "user",
text,
ts: formatTime(new Date()),
};
setMessages((m) => [...m, myMsg]);
try {
const res = await api.post<A2AResponseShape>(`/workspaces/${agentId}/a2a`, {
method: "message/send",
params: {
message: {
role: "user",
messageId: crypto.randomUUID(),
parts: [{ kind: "text", text }],
},
},
});
const reply =
res.result?.parts?.find((part) => part.kind === "text")?.text ?? "";
if (reply) {
setMessages((m) => [
...m,
{
id: crypto.randomUUID(),
role: "agent",
text: reply,
ts: formatTime(new Date()),
},
]);
} else if (res.error?.message) {
setError(res.error.message);
}
} catch (e) {
setError(e instanceof Error ? e.message : "Failed to send");
} finally {
setSending(false);
sendInFlightRef.current = false;
}
};
return (
<div
style={{
height: "100%",
display: "flex",
flexDirection: "column",
background: p.bg,
fontFamily: MOBILE_FONT_SANS,
}}
>
{/* Header */}
<div
style={{
padding: "max(env(safe-area-inset-top), 44px) 14px 10px",
borderBottom: `0.5px solid ${p.divider}`,
background: dark ? "rgba(21,20,15,0.85)" : "rgba(246,244,239,0.85)",
backdropFilter: "blur(14px)",
}}
>
<div style={{ display: "flex", alignItems: "center", gap: 10 }}>
<button
type="button"
onClick={onBack}
aria-label="Back"
style={{
width: 36,
height: 36,
borderRadius: 999,
border: "none",
cursor: "pointer",
background: "transparent",
color: p.text2,
display: "flex",
alignItems: "center",
justifyContent: "center",
}}
>
{Icons.back({ size: 18 })}
</button>
<div style={{ flex: 1, minWidth: 0 }}>
<div style={{ display: "flex", alignItems: "center", gap: 6 }}>
<StatusDot status={a.status} size={7} dark={dark} halo={false} />
<span
style={{
fontSize: 15,
fontWeight: 600,
color: p.text,
whiteSpace: "nowrap",
overflow: "hidden",
textOverflow: "ellipsis",
}}
>
{a.name}
</span>
<TierChip tier={a.tier} dark={dark} />
</div>
<div
style={{
fontSize: 11,
color: p.text3,
marginTop: 2,
fontFamily: MOBILE_FONT_MONO,
}}
>
{a.runtime} · {a.skills} skills
</div>
</div>
<button
type="button"
aria-label="More"
style={{
width: 36,
height: 36,
borderRadius: 999,
border: "none",
cursor: "pointer",
background: "transparent",
color: p.text2,
display: "flex",
alignItems: "center",
justifyContent: "center",
}}
>
{Icons.more({ size: 18 })}
</button>
</div>
{/* Sub-tabs */}
<div style={{ display: "flex", gap: 18, marginTop: 12, paddingLeft: 4 }}>
{(
[
{ id: "my", label: "My Chat" },
{ id: "a2a", label: "Agent Comms" },
] as const
).map((t) => {
const on = tab === t.id;
return (
<button
key={t.id}
type="button"
onClick={() => setTab(t.id)}
style={{
padding: "4px 0 8px",
border: "none",
background: "transparent",
fontSize: 13.5,
cursor: "pointer",
color: on ? p.text : p.text3,
fontWeight: on ? 600 : 500,
borderBottom: on ? `2px solid ${p.accent}` : "2px solid transparent",
}}
>
{t.label}
</button>
);
})}
</div>
</div>
{/* Messages */}
<div
ref={scrollRef}
style={{
flex: 1,
overflow: "auto",
padding: "14px 14px 16px",
display: "flex",
flexDirection: "column",
gap: 8,
}}
>
{tab === "a2a" && (
<div
style={{
padding: "20px 4px",
textAlign: "center",
color: p.text3,
fontSize: 13,
}}
>
Agent Comms peer-to-peer A2A traffic surfaces in the Comms tab.
</div>
)}
{tab === "my" && messages.length === 0 && (
<div style={{ padding: "20px 4px", textAlign: "center", color: p.text3, fontSize: 13 }}>
Send a message to start chatting.
</div>
)}
{tab === "my" &&
messages.map((m) => {
const mine = m.role === "user";
return (
<div
key={m.id}
style={{
display: "flex",
justifyContent: mine ? "flex-end" : "flex-start",
}}
>
<div
style={{
maxWidth: "78%",
background: mine ? p.accent : dark ? "#22211c" : "#fff",
color: mine ? "#fff" : p.text,
border: mine ? "none" : `0.5px solid ${p.border}`,
borderRadius: mine ? "18px 18px 4px 18px" : "18px 18px 18px 4px",
padding: "9px 13px",
fontSize: 14.5,
lineHeight: 1.4,
overflowWrap: "anywhere",
}}
>
{m.text}
<div
style={{
fontSize: 10,
marginTop: 4,
opacity: mine ? 0.75 : 0.5,
fontFamily: MOBILE_FONT_MONO,
}}
>
{m.ts}
</div>
</div>
</div>
);
})}
{error && (
<div
role="alert"
style={{
alignSelf: "center",
padding: "6px 12px",
borderRadius: 12,
background: `${p.failed}1a`,
color: p.failed,
fontSize: 12,
}}
>
{error}
</div>
)}
</div>
{/* Footer ID */}
<div
style={{
padding: "0 14px 6px",
textAlign: "center",
fontFamily: MOBILE_FONT_MONO,
fontSize: 9.5,
color: p.text3,
letterSpacing: "0.04em",
overflow: "hidden",
textOverflow: "ellipsis",
whiteSpace: "nowrap",
}}
>
{agentId}
</div>
{/* Composer */}
<div
style={{
padding: "10px 12px max(env(safe-area-inset-bottom), 16px)",
borderTop: `0.5px solid ${p.divider}`,
background: dark ? "rgba(21,20,15,0.92)" : "rgba(246,244,239,0.92)",
backdropFilter: "blur(14px)",
}}
>
<div
style={{
display: "flex",
alignItems: "flex-end",
gap: 8,
background: dark ? "#22211c" : "#fff",
border: `0.5px solid ${p.border}`,
borderRadius: 22,
padding: "6px 6px 6px 12px",
}}
>
<button
type="button"
aria-label="Attach"
style={{
width: 32,
height: 32,
borderRadius: 999,
border: "none",
cursor: "pointer",
background: "transparent",
color: p.text3,
flexShrink: 0,
display: "flex",
alignItems: "center",
justifyContent: "center",
}}
>
{Icons.attach({ size: 16 })}
</button>
<textarea
ref={composerRef}
value={draft}
onChange={(e) => setDraft(e.target.value)}
onKeyDown={(e) => {
// Enter sends; Shift+Enter inserts a newline. Skip when the
// IME is composing — pressing Enter to commit a Chinese/
// Japanese candidate would otherwise dispatch the half-typed
// message (the same regression the desktop ChatTab guards).
if (
e.key === "Enter" &&
!e.shiftKey &&
!e.nativeEvent.isComposing &&
e.keyCode !== 229
) {
e.preventDefault();
send();
}
}}
placeholder={reachable ? "Send a message…" : `Agent is ${a.status}`}
disabled={!reachable}
rows={1}
style={{
flex: 1,
border: "none",
outline: "none",
background: "transparent",
fontSize: 14.5,
lineHeight: 1.4,
color: p.text,
padding: "6px 0",
fontFamily: "inherit",
minWidth: 0,
resize: "none",
maxHeight: 132,
overflowY: "auto",
}}
/>
<button
type="button"
onClick={send}
disabled={!draft.trim() || !reachable || sending}
aria-label="Send"
style={{
width: 36,
height: 36,
borderRadius: 999,
border: "none",
cursor: draft.trim() && !sending ? "pointer" : "not-allowed",
flexShrink: 0,
background:
draft.trim() && reachable && !sending
? p.accent
: dark
? "#2a2823"
: "#ece9e0",
color: draft.trim() && reachable && !sending ? "#fff" : p.text3,
display: "flex",
alignItems: "center",
justifyContent: "center",
}}
>
{Icons.send({ size: 16 })}
</button>
</div>
</div>
</div>
);
}
@@ -0,0 +1,368 @@
"use client";
// 05 · Comms feed — workspace-wide A2A traffic.
// Bootstraps from /workspaces/:id/activity for the first few online
// workspaces, then prepends ACTIVITY_LOGGED events from the live socket.
import { useCallback, useEffect, useMemo, useState } from "react";
import { api } from "@/lib/api";
import { useSocketEvent } from "@/hooks/useSocketEvent";
import { useCanvasStore } from "@/store/canvas";
import { WorkspacePill } from "./components";
import { MOBILE_FONT_MONO, MOBILE_FONT_SANS, usePalette } from "./palette";
import { SectionLabel } from "./primitives";
interface CommItem {
id: string;
from: string;
to: string;
kind: string;
status: "ok" | "err";
summary: string;
durationMs: number | null;
ago: string;
ts: number;
}
interface ActivityRecord {
id: string;
workspace_id: string;
activity_type: string;
source_id: string | null;
target_id: string | null;
summary: string | null;
status: string;
duration_ms: number | null;
created_at: string;
}
const FAN_OUT_CAP = 4;
const RENDER_CAP = 30;
type FilterId = "all" | "errors";
function relativeAgo(iso: string): string {
const t = Date.parse(iso);
if (isNaN(t)) return "";
const seconds = Math.max(0, Math.round((Date.now() - t) / 1000));
if (seconds < 60) return `${seconds}s`;
const minutes = Math.round(seconds / 60);
if (minutes < 60) return `${minutes}m`;
const hours = Math.round(minutes / 60);
if (hours < 24) return `${hours}h`;
const days = Math.round(hours / 24);
return `${days}d`;
}
export function MobileComms({ dark }: { dark: boolean }) {
const p = usePalette(dark);
const nodes = useCanvasStore((s) => s.nodes);
const [items, setItems] = useState<CommItem[]>([]);
const [filter, setFilter] = useState<FilterId>("all");
const [loading, setLoading] = useState(true);
const nameOf = useCallback(
(id: string | null | undefined): string => {
if (!id) return "Unknown";
const n = nodes.find((x) => x.id === id);
return n?.data.name ?? id.slice(0, 8);
},
[nodes],
);
const toItem = useCallback(
(a: ActivityRecord): CommItem => ({
id: a.id,
from: nameOf(a.source_id ?? a.workspace_id),
to: nameOf(a.target_id),
kind: a.activity_type,
status: a.status === "error" || a.status === "err" ? "err" : "ok",
summary: a.summary ?? "",
durationMs: a.duration_ms,
ago: relativeAgo(a.created_at),
ts: Date.parse(a.created_at) || Date.now(),
}),
[nameOf],
);
// Stable signature of the online-workspace set. Re-runs the bootstrap
// only when which workspaces are online changes — not on every node
// position update or unrelated data churn.
const onlineWorkspaceIds = useMemo(
() =>
nodes
.filter((n) => n.data.status === "online")
.slice(0, FAN_OUT_CAP)
.map((n) => n.id),
[nodes],
);
const onlineSignature = onlineWorkspaceIds.join("|");
// Bootstrap: pull the most recent activity from the first few online
// workspaces. Identical fan-out cap to CommunicationOverlay to keep
// the load profile predictable on big tenants.
useEffect(() => {
let cancelled = false;
if (onlineWorkspaceIds.length === 0) {
setLoading(false);
return;
}
Promise.all(
onlineWorkspaceIds.map((id) =>
api.get<ActivityRecord[]>(`/workspaces/${id}/activity?limit=8`).catch(() => []),
),
).then((batches) => {
if (cancelled) return;
const flat = batches.flat().map(toItem);
flat.sort((a, b) => b.ts - a.ts);
setItems(flat.slice(0, RENDER_CAP));
setLoading(false);
});
return () => {
cancelled = true;
};
// Effect depends on the signature string (stable when the id set
// doesn't change) + toItem (memoized via useCallback). Listing the
// id-array directly would re-run on every render because the array
// identity changes even when the contents don't.
// eslint-disable-next-line react-hooks/exhaustive-deps
}, [onlineSignature, toItem]);
// Live: prepend ACTIVITY_LOGGED events as they arrive.
useSocketEvent((msg) => {
if (msg.event !== "ACTIVITY_LOGGED") return;
const payload = msg.payload as Partial<ActivityRecord> | undefined;
if (!payload || !payload.id) return;
const rec: ActivityRecord = {
id: payload.id,
workspace_id: payload.workspace_id ?? msg.workspace_id ?? "",
activity_type: payload.activity_type ?? "a2a",
source_id: payload.source_id ?? null,
target_id: payload.target_id ?? null,
summary: payload.summary ?? null,
status: payload.status ?? "ok",
duration_ms: payload.duration_ms ?? null,
created_at: payload.created_at ?? new Date().toISOString(),
};
setItems((prev) => [toItem(rec), ...prev.filter((x) => x.id !== rec.id)].slice(0, RENDER_CAP));
});
const filtered = useMemo(
() => items.filter((c) => filter === "all" || c.status === "err"),
[items, filter],
);
const errCount = useMemo(() => items.filter((c) => c.status === "err").length, [items]);
return (
<div
style={{
height: "100%",
overflow: "auto",
background: p.bg,
paddingBottom: 96,
fontFamily: MOBILE_FONT_SANS,
}}
>
<div style={{ padding: "max(env(safe-area-inset-top), 44px) 16px 8px" }}>
<div
style={{
display: "flex",
alignItems: "center",
justifyContent: "space-between",
marginBottom: 14,
}}
>
<WorkspacePill dark={dark} count={nodes.length} />
{/* Header filter button reserved — the All/Errors chips below
already cover the v1 filter axis. */}
</div>
<div style={{ display: "flex", alignItems: "baseline", justifyContent: "space-between" }}>
<h1
style={{
margin: 0,
fontSize: 32,
fontWeight: 700,
color: p.text,
letterSpacing: "-0.025em",
}}
>
Comms
</h1>
<span
style={{
fontFamily: MOBILE_FONT_MONO,
fontSize: 11,
color: p.text3,
}}
>
{items.length} events
</span>
</div>
<p style={{ margin: "4px 0 0", fontSize: 13.5, color: p.text2 }}>
Live A2A traffic across the workspace.
</p>
</div>
<div style={{ display: "flex", gap: 6, padding: "12px 16px 8px" }}>
{(
[
{ id: "all", label: "All", n: items.length },
{ id: "errors", label: "Errors", n: errCount },
] as const
).map((o) => {
const on = filter === o.id;
return (
<button
key={o.id}
type="button"
onClick={() => setFilter(o.id)}
style={{
display: "inline-flex",
alignItems: "center",
gap: 6,
padding: "7px 12px",
borderRadius: 999,
cursor: "pointer",
background: on ? p.text : dark ? "#22211c" : "#fff",
color: on ? (dark ? p.bg : "#fff") : p.text,
border: `0.5px solid ${on ? "transparent" : p.border}`,
fontSize: 13,
fontWeight: 500,
}}
>
{o.label}
<span
style={{
fontSize: 10.5,
opacity: 0.7,
fontFamily: MOBILE_FONT_MONO,
}}
>
{o.n}
</span>
</button>
);
})}
</div>
<SectionLabel dark={dark}>Communications</SectionLabel>
<div style={{ padding: "0 14px", display: "flex", flexDirection: "column", gap: 8 }}>
{loading && items.length === 0 ? (
<div style={{ padding: "30px 4px", textAlign: "center", color: p.text3, fontSize: 13 }}>
Loading recent comms
</div>
) : filtered.length === 0 ? (
<div style={{ padding: "30px 4px", textAlign: "center", color: p.text3, fontSize: 13 }}>
No A2A traffic yet.
</div>
) : (
filtered.map((c) => <CommRow key={c.id} c={c} dark={dark} />)
)}
</div>
</div>
);
}
function CommRow({ c, dark }: { c: CommItem; dark: boolean }) {
const p = usePalette(dark);
const isErr = c.status === "err";
return (
<div
style={{
background: p.surface,
borderRadius: 14,
border: `0.5px solid ${p.border}`,
padding: "12px 14px",
display: "flex",
flexDirection: "column",
gap: 6,
}}
>
<div
style={{
display: "flex",
alignItems: "center",
gap: 8,
fontSize: 12,
fontWeight: 600,
color: p.text,
}}
>
<span
style={{
padding: "1px 6px",
borderRadius: 4,
background: isErr ? "#f5dad2" : "#dde9e1",
color: isErr ? "#a8341a" : p.greenInk,
fontFamily: MOBILE_FONT_MONO,
fontSize: 9,
fontWeight: 700,
letterSpacing: "0.06em",
}}
>
{isErr ? "ERR" : "OK"}
</span>
<span
style={{
overflow: "hidden",
textOverflow: "ellipsis",
whiteSpace: "nowrap",
maxWidth: 110,
}}
>
{c.from}
</span>
<span style={{ color: p.text3, fontWeight: 500 }}></span>
<span
style={{
overflow: "hidden",
textOverflow: "ellipsis",
whiteSpace: "nowrap",
maxWidth: 110,
}}
>
{c.to}
</span>
<span
style={{
marginLeft: "auto",
fontSize: 10.5,
color: p.text3,
fontFamily: MOBILE_FONT_MONO,
}}
>
{c.ago}
</span>
</div>
<div
style={{
fontSize: 11,
color: p.text3,
fontWeight: 600,
fontFamily: MOBILE_FONT_MONO,
letterSpacing: "0.02em",
}}
>
{c.kind}
{c.durationMs != null && (
<span style={{ marginLeft: 8, color: isErr ? "#a8341a" : p.text3 }}>{c.durationMs}ms</span>
)}
</div>
{c.summary && (
<div
style={{
fontSize: 12.5,
color: p.text2,
lineHeight: 1.4,
overflowWrap: "anywhere",
}}
>
{c.summary}
</div>
)}
</div>
);
}
@@ -0,0 +1,589 @@
"use client";
// 03 · Agent detail — pills + tabbed content (Overview/Activity/Config/Memory).
import { useEffect, useState } from "react";
import { api } from "@/lib/api";
import { useCanvasStore } from "@/store/canvas";
import { RemoteBadge, toMobileAgent } from "./components";
import { MOBILE_FONT_MONO, MOBILE_FONT_SANS, type MobilePalette, usePalette } from "./palette";
import { Icons, StatusDot, TierChip } from "./primitives";
type TabId = "overview" | "activity" | "config" | "memory";
const TABS: { id: TabId; label: string }[] = [
{ id: "overview", label: "Overview" },
{ id: "activity", label: "Activity" },
{ id: "config", label: "Config" },
{ id: "memory", label: "Memory" },
];
export function MobileDetail({
agentId,
dark,
onBack,
onChat,
}: {
agentId: string;
dark: boolean;
onBack: () => void;
onChat: () => void;
}) {
const p = usePalette(dark);
const node = useCanvasStore((s) => s.nodes.find((n) => n.id === agentId));
const [tab, setTab] = useState<TabId>("overview");
if (!node) {
return (
<div
style={{
height: "100%",
background: p.bg,
display: "flex",
alignItems: "center",
justifyContent: "center",
color: p.text3,
fontSize: 13,
fontFamily: MOBILE_FONT_SANS,
}}
>
Agent not found.
</div>
);
}
const a = toMobileAgent(node);
return (
<div
style={{
height: "100%",
overflow: "auto",
background: p.bg,
paddingBottom: 96,
fontFamily: MOBILE_FONT_SANS,
}}
>
{/* Top bar */}
<div
style={{
position: "sticky",
top: 0,
zIndex: 10,
padding: "max(env(safe-area-inset-top), 44px) 14px 0",
background: p.bg,
}}
>
<div style={{ display: "flex", alignItems: "center", justifyContent: "space-between" }}>
<button
type="button"
onClick={onBack}
aria-label="Back"
style={iconButtonStyle(p, dark)}
>
{Icons.back({ size: 18 })}
</button>
<button type="button" aria-label="More" style={iconButtonStyle(p, dark)}>
{Icons.more({ size: 18 })}
</button>
</div>
</div>
{/* Hero */}
<div style={{ padding: "20px 20px 16px" }}>
<div style={{ display: "flex", alignItems: "center", gap: 10, marginBottom: 8 }}>
<StatusDot status={a.status} size={10} dark={dark} />
<span
style={{
fontFamily: MOBILE_FONT_MONO,
fontSize: 11,
color: p.greenInk,
fontWeight: 600,
letterSpacing: "0.04em",
textTransform: "uppercase",
}}
>
{a.status}
</span>
{a.remote && <RemoteBadge palette={p} />}
</div>
<h1
style={{
margin: 0,
fontSize: 28,
fontWeight: 700,
color: p.text,
letterSpacing: "-0.02em",
}}
>
{a.name}
</h1>
<p
style={{
margin: "6px 0 0",
fontSize: 14,
color: p.text2,
fontFamily: MOBILE_FONT_MONO,
}}
>
{a.tag}
</p>
</div>
{/* Stat pills */}
<div
style={{
display: "flex",
gap: 6,
padding: "0 16px 16px",
overflowX: "auto",
scrollbarWidth: "none",
}}
>
<PillStat label="TIER" value={a.tier} accent={p.t4Ink} dark={dark} chip="tier" />
<PillStat label="RUNTIME" value={a.runtime} dark={dark} />
<PillStat label="SKILLS" value={a.skills} dark={dark} />
<PillStat label="STATUS" value={a.status} accent={p.online} dark={dark} dot />
</div>
{/* Description card */}
{a.desc && (
<div style={{ padding: "0 14px" }}>
<div
style={{
background: p.surface,
borderRadius: 16,
border: `0.5px solid ${p.border}`,
padding: "14px 16px",
}}
>
<p style={{ margin: 0, fontSize: 14.5, lineHeight: 1.5, color: p.text }}>{a.desc}</p>
</div>
</div>
)}
{/* Tabs */}
<div
style={{
display: "flex",
gap: 4,
padding: "20px 14px 10px",
overflowX: "auto",
scrollbarWidth: "none",
}}
>
{TABS.map((t) => {
const on = tab === t.id;
return (
<button
key={t.id}
type="button"
onClick={() => setTab(t.id)}
style={{
padding: "8px 14px",
borderRadius: 999,
border: "none",
cursor: "pointer",
background: on ? p.text : "transparent",
color: on ? (dark ? p.bg : "#fff") : p.text2,
fontSize: 13,
fontWeight: 600,
whiteSpace: "nowrap",
}}
>
{t.label}
</button>
);
})}
</div>
{/* Tab content */}
<div style={{ padding: "0 14px" }}>
{tab === "overview" && <DetailOverview a={a} dark={dark} />}
{tab === "activity" && <DetailActivity workspaceId={a.id} dark={dark} />}
{tab === "config" && <DetailConfig a={a} dark={dark} />}
{tab === "memory" && <DetailMemory dark={dark} />}
</div>
{/* Chat CTA */}
<div style={{ position: "absolute", left: 14, right: 14, bottom: 92, zIndex: 28 }}>
<button
type="button"
onClick={onChat}
style={{
width: "100%",
height: 52,
borderRadius: 16,
cursor: "pointer",
background: p.text,
color: dark ? p.bg : "#fff",
border: "none",
fontSize: 15,
fontWeight: 600,
display: "flex",
alignItems: "center",
justifyContent: "center",
gap: 10,
boxShadow: "0 8px 22px rgba(40,30,20,0.22)",
}}
>
{Icons.chat({ size: 18 })} Open chat
</button>
</div>
</div>
);
}
function iconButtonStyle(p: MobilePalette, dark: boolean) {
return {
width: 36,
height: 36,
borderRadius: 999,
cursor: "pointer",
background: dark ? "#22211c" : "#fff",
border: `0.5px solid ${p.border}`,
display: "flex",
alignItems: "center",
justifyContent: "center",
color: p.text2,
} as const;
}
function PillStat({
label,
value,
accent,
dark,
dot,
chip,
}: {
label: string;
value: string | number;
accent?: string;
dark: boolean;
dot?: boolean;
chip?: "tier";
}) {
const p = usePalette(dark);
const active = !!accent;
return (
<div
style={{
display: "inline-flex",
alignItems: "center",
gap: 7,
padding: "7px 12px",
borderRadius: 999,
flexShrink: 0,
background: active ? `${accent}1a` : dark ? "#22211c" : "#fff",
border: `0.5px solid ${active ? `${accent}40` : p.border}`,
}}
>
<span
style={{
fontSize: 9.5,
color: active ? accent : p.text3,
fontFamily: MOBILE_FONT_MONO,
letterSpacing: "0.06em",
textTransform: "uppercase",
fontWeight: 600,
}}
>
{label}
</span>
{dot && <StatusDot status="online" size={6} dark={dark} halo={false} />}
{chip === "tier" ? (
<TierChip tier={value as "T1" | "T2" | "T3" | "T4"} dark={dark} />
) : (
<span
style={{
fontSize: 12,
color: active ? accent : p.text,
fontWeight: 600,
textTransform: label === "STATUS" ? "capitalize" : "none",
}}
>
{value}
</span>
)}
</div>
);
}
function DetailOverview({
a,
dark,
}: {
a: ReturnType<typeof toMobileAgent>;
dark: boolean;
}) {
const p = usePalette(dark);
const Row = ({ k, v, mono = true }: { k: string; v: string; mono?: boolean }) => (
<div
style={{
display: "flex",
alignItems: "center",
justifyContent: "space-between",
padding: "10px 0",
borderBottom: `0.5px solid ${p.divider}`,
}}
>
<span
style={{
fontSize: 11.5,
color: p.text3,
letterSpacing: "0.04em",
fontFamily: MOBILE_FONT_MONO,
textTransform: "uppercase",
}}
>
{k}
</span>
<span
style={{
fontSize: 13,
color: p.text,
fontWeight: 500,
fontFamily: mono ? MOBILE_FONT_MONO : "inherit",
maxWidth: "60%",
overflow: "hidden",
textOverflow: "ellipsis",
whiteSpace: "nowrap",
}}
>
{v}
</span>
</div>
);
return (
<div
style={{
background: p.surface,
borderRadius: 16,
padding: "4px 16px",
border: `0.5px solid ${p.border}`,
}}
>
<Row k="ID" v={a.id} />
<Row k="Tier" v={a.tier} />
<Row k="Runtime" v={a.runtime} />
<Row k="Active tasks" v={String(a.calls)} />
<Row k="Skills" v={`${a.skills} loaded`} />
<Row k="Origin" v={a.remote ? "remote" : "platform"} />
</div>
);
}
interface ActivityRecord {
id: string;
activity_type: string;
status: string;
summary: string | null;
duration_ms: number | null;
created_at: string;
}
function DetailActivity({ workspaceId, dark }: { workspaceId: string; dark: boolean }) {
const p = usePalette(dark);
const [items, setItems] = useState<ActivityRecord[] | null>(null);
const [error, setError] = useState<string | null>(null);
useEffect(() => {
let cancelled = false;
setError(null);
setItems(null);
api
.get<ActivityRecord[]>(`/workspaces/${workspaceId}/activity?limit=12`)
.then((rows) => {
if (!cancelled) setItems(rows);
})
.catch((e: unknown) => {
if (!cancelled) {
setError(e instanceof Error ? e.message : "Failed to load activity");
setItems([]);
}
});
return () => {
cancelled = true;
};
}, [workspaceId]);
if (items === null) {
return (
<div
style={{
background: p.surface,
borderRadius: 16,
padding: "20px 16px",
border: `0.5px solid ${p.border}`,
color: p.text3,
fontSize: 13,
}}
>
Loading activity
</div>
);
}
if (items.length === 0) {
return (
<div
style={{
background: p.surface,
borderRadius: 16,
padding: "20px 16px",
border: `0.5px solid ${p.border}`,
color: p.text3,
fontSize: 13,
}}
>
{error ?? "No recent activity. New events appear here as the agent reports them."}
</div>
);
}
return (
<div
style={{
background: p.surface,
borderRadius: 16,
padding: "6px 16px",
border: `0.5px solid ${p.border}`,
}}
>
{items.map((it, i) => {
const ts = new Date(it.created_at);
const label = isNaN(ts.getTime())
? ""
: ts.toLocaleTimeString([], { hour: "numeric", minute: "2-digit" });
const isErr = it.status === "error" || it.status === "err";
return (
<div
key={it.id}
style={{
display: "flex",
gap: 12,
padding: "12px 0",
borderBottom: i < items.length - 1 ? `0.5px solid ${p.divider}` : "none",
}}
>
<span
style={{
fontSize: 11,
color: p.text3,
paddingTop: 2,
width: 48,
fontFamily: MOBILE_FONT_MONO,
flexShrink: 0,
}}
>
{label}
</span>
<div style={{ flex: 1, minWidth: 0 }}>
<div
style={{
display: "flex",
alignItems: "center",
gap: 6,
fontSize: 11,
color: p.text3,
fontFamily: MOBILE_FONT_MONO,
letterSpacing: "0.02em",
marginBottom: 2,
}}
>
<span
style={{
padding: "1px 5px",
borderRadius: 4,
background: isErr ? "#f5dad2" : "#dde9e1",
color: isErr ? "#a8341a" : p.greenInk,
fontSize: 9,
fontWeight: 700,
letterSpacing: "0.06em",
}}
>
{isErr ? "ERR" : "OK"}
</span>
<span>{it.activity_type}</span>
{it.duration_ms != null && <span>· {it.duration_ms}ms</span>}
</div>
{it.summary && (
<span
style={{
fontSize: 13.5,
color: p.text,
lineHeight: 1.45,
overflowWrap: "anywhere",
}}
>
{it.summary}
</span>
)}
</div>
</div>
);
})}
</div>
);
}
function DetailConfig({
a,
dark,
}: {
a: ReturnType<typeof toMobileAgent>;
dark: boolean;
}) {
const p = usePalette(dark);
const cfg = JSON.stringify(
{
tier: a.tier,
runtime: a.runtime,
skills: a.skills,
remote: a.remote,
},
null,
2,
);
return (
<pre
style={{
background: dark ? "#0f0e0a" : "#fff",
borderRadius: 16,
padding: "14px 16px",
border: `0.5px solid ${p.border}`,
fontFamily: MOBILE_FONT_MONO,
fontSize: 11.5,
lineHeight: 1.55,
color: p.text2,
margin: 0,
overflow: "auto",
whiteSpace: "pre-wrap",
}}
>
{cfg}
</pre>
);
}
function DetailMemory({ dark }: { dark: boolean }) {
const p = usePalette(dark);
return (
<div
style={{
background: p.surface,
borderRadius: 16,
padding: "14px 16px",
border: `0.5px solid ${p.border}`,
fontSize: 13,
color: p.text2,
lineHeight: 1.5,
}}
>
<span style={{ color: p.text }}>Ephemeral session.</span> Memory clears on workspace
restart. Open the desktop canvas for the full memory inspector.
</div>
);
}
+208
View File
@@ -0,0 +1,208 @@
"use client";
// 01 · Workspace home — agent list + filter chips + FAB.
// Mirrors design/screen-home.jsx, swapped to live store data.
import { useMemo, useState } from "react";
import { useCanvasStore } from "@/store/canvas";
import {
type AgentFilter,
AgentCard,
FilterChips,
WorkspacePill,
classifyForFilter,
toMobileAgent,
} from "./components";
import { MOBILE_FONT_MONO, MOBILE_FONT_SANS, usePalette } from "./palette";
import { Icons, SectionLabel } from "./primitives";
export function MobileHome({
dark,
density,
onOpen,
onSpawn,
workspaceLabel = "Default",
username,
}: {
dark: boolean;
density: "compact" | "regular";
onOpen: (agentId: string) => void;
onSpawn: () => void;
workspaceLabel?: string;
username?: string;
}) {
const p = usePalette(dark);
const nodes = useCanvasStore((s) => s.nodes);
const agents = useMemo(() => nodes.map(toMobileAgent), [nodes]);
const [filter, setFilter] = useState<AgentFilter>("all");
const counts = useMemo(() => {
const c = { all: agents.length, online: 0, issue: 0, paused: 0 };
for (const a of agents) {
const bucket = classifyForFilter(a.status);
if (bucket !== "all") c[bucket]++;
}
return c;
}, [agents]);
const filtered = useMemo(
() => agents.filter((a) => filter === "all" || classifyForFilter(a.status) === filter),
[agents, filter],
);
const compact = density === "compact";
const rootCount = useMemo(
() => agents.filter((a) => !a.parentId).length,
[agents],
);
return (
<div
style={{
height: "100%",
overflow: "auto",
background: p.bg,
paddingBottom: 96,
fontFamily: MOBILE_FONT_SANS,
}}
>
{/* Sticky header */}
<div
style={{
position: "sticky",
top: 0,
zIndex: 10,
background: `linear-gradient(${p.bg} 60%, ${p.bg}00)`,
padding: "max(env(safe-area-inset-top), 44px) 16px 8px",
}}
>
<div
style={{
display: "flex",
alignItems: "center",
justifyContent: "space-between",
marginBottom: 14,
}}
>
<WorkspacePill dark={dark} count={agents.length} />
{/* Search button reserved — wire to a mobile SearchDialog in v1.1. */}
</div>
<div
style={{
display: "flex",
alignItems: "baseline",
justifyContent: "space-between",
marginBottom: 4,
}}
>
<h1
style={{
margin: 0,
fontSize: 32,
fontWeight: 700,
color: p.text,
letterSpacing: "-0.025em",
}}
>
Agents
</h1>
{username && (
<span
style={{
fontFamily: MOBILE_FONT_MONO,
fontSize: 11,
color: p.text3,
letterSpacing: "0.04em",
}}
>
{username}
</span>
)}
</div>
<p style={{ margin: "0 0 14px", fontSize: 13.5, color: p.text2 }}>
{rootCount} workspace{rootCount === 1 ? "" : "s"} · live
</p>
</div>
<FilterChips value={filter} onChange={setFilter} dark={dark} counts={counts} />
<SectionLabel
dark={dark}
right={
<span
style={{
color: p.text3,
fontSize: 10.5,
letterSpacing: "0.04em",
textTransform: "none",
}}
>
{filtered.length}/{agents.length}
</span>
}
>
Workspace · {workspaceLabel}
</SectionLabel>
<div
style={{
display: "flex",
flexDirection: "column",
gap: 8,
padding: "0 14px",
}}
>
{filtered.length === 0 ? (
<div
style={{
padding: "40px 8px",
textAlign: "center",
color: p.text3,
fontSize: 13,
}}
>
No agents match this filter.
</div>
) : (
filtered.map((a) => (
<AgentCard
key={a.id}
agent={a}
dark={dark}
compact={compact}
onClick={() => onOpen(a.id)}
/>
))
)}
</div>
{/* Spawn FAB */}
<button
type="button"
onClick={onSpawn}
aria-label="Spawn new agent"
style={{
position: "absolute",
right: 24,
bottom: 100,
zIndex: 25,
width: 54,
height: 54,
borderRadius: 999,
border: "none",
cursor: "pointer",
background: p.text,
color: dark ? p.bg : "#fff",
display: "flex",
alignItems: "center",
justifyContent: "center",
boxShadow: "0 8px 24px rgba(40,30,20,0.25), 0 2px 6px rgba(40,30,20,0.15)",
}}
>
{Icons.plus({ size: 22 })}
</button>
</div>
);
}
+194
View File
@@ -0,0 +1,194 @@
"use client";
// "Me" tab — the prototype design didn't ship a Me screen, so this is
// the natural mobile home for theme + accent + density preferences
// (the prototype's floating Tweaks panel collapses into this tab here).
import { useTheme, type ThemePreference } from "@/lib/theme-provider";
import { MOBILE_FONT_MONO, MOBILE_FONT_SANS, type MobilePalette, usePalette } from "./palette";
import { SectionLabel } from "./primitives";
const ACCENTS = ["#2f9e6a", "#3b6fe0", "#7a4dd1", "#d97757", "#1f8a8a"] as const;
export function MobileMe({
dark,
accent,
setAccent,
density,
setDensity,
}: {
dark: boolean;
accent: string;
setAccent: (v: string) => void;
density: "compact" | "regular";
setDensity: (v: "compact" | "regular") => void;
}) {
const p = usePalette(dark);
const { theme, setTheme } = useTheme();
return (
<div
style={{
height: "100%",
overflow: "auto",
background: p.bg,
paddingBottom: 96,
fontFamily: MOBILE_FONT_SANS,
}}
>
<div style={{ padding: "max(env(safe-area-inset-top), 44px) 20px 8px" }}>
<h1
style={{
margin: 0,
fontSize: 32,
fontWeight: 700,
color: p.text,
letterSpacing: "-0.025em",
}}
>
Me
</h1>
<p style={{ margin: "4px 0 0", fontSize: 13.5, color: p.text2 }}>
Theme, accent, and layout density.
</p>
</div>
<SectionLabel dark={dark}>Theme</SectionLabel>
<div style={{ padding: "0 14px" }}>
<Card palette={p}>
<SegmentedRow
options={[
{ id: "system", label: "System" },
{ id: "light", label: "Light" },
{ id: "dark", label: "Dark" },
]}
value={theme}
onChange={(v) => setTheme(v as ThemePreference)}
palette={p}
dark={dark}
/>
</Card>
</div>
<SectionLabel dark={dark}>Accent</SectionLabel>
<div style={{ padding: "0 14px" }}>
<Card palette={p}>
<div style={{ display: "flex", gap: 12, padding: "12px 4px", flexWrap: "wrap" }}>
{ACCENTS.map((c) => {
const on = c === accent;
return (
<button
key={c}
type="button"
onClick={() => setAccent(c)}
aria-label={`Set accent ${c}`}
style={{
width: 36,
height: 36,
borderRadius: 999,
cursor: "pointer",
background: c,
border: on ? `2px solid ${p.text}` : "2px solid transparent",
boxShadow: on ? `0 0 0 2px ${p.bg} inset` : "none",
}}
/>
);
})}
</div>
</Card>
</div>
<SectionLabel dark={dark}>Density</SectionLabel>
<div style={{ padding: "0 14px" }}>
<Card palette={p}>
<SegmentedRow
options={[
{ id: "regular", label: "Regular" },
{ id: "compact", label: "Compact" },
]}
value={density}
onChange={(v) => setDensity(v as "regular" | "compact")}
palette={p}
dark={dark}
/>
</Card>
</div>
<div
style={{
padding: "24px 20px",
fontFamily: MOBILE_FONT_MONO,
fontSize: 11,
color: p.text3,
letterSpacing: "0.04em",
}}
>
Mobile design preview · v0.1
</div>
</div>
);
}
function Card({
palette,
children,
}: {
palette: MobilePalette;
children: React.ReactNode;
}) {
return (
<div
style={{
background: palette.surface,
borderRadius: 16,
border: `0.5px solid ${palette.border}`,
padding: "4px 14px",
}}
>
{children}
</div>
);
}
function SegmentedRow({
options,
value,
onChange,
palette,
dark,
}: {
options: { id: string; label: string }[];
value: string;
onChange: (v: string) => void;
palette: MobilePalette;
dark: boolean;
}) {
return (
<div style={{ display: "flex", gap: 6, padding: "10px 0" }}>
{options.map((o) => {
const on = o.id === value;
return (
<button
key={o.id}
type="button"
onClick={() => onChange(o.id)}
style={{
flex: 1,
padding: "10px 8px",
borderRadius: 10,
cursor: "pointer",
background: on ? palette.text : "transparent",
color: on ? (dark ? palette.bg : "#fff") : palette.text,
border: `1px solid ${on ? "transparent" : palette.border}`,
fontSize: 13,
fontWeight: 600,
}}
>
{o.label}
</button>
);
})}
</div>
);
}
@@ -0,0 +1,429 @@
"use client";
// 06 · Spawn agent — bottom-sheet flow.
// Fetches /templates so the user picks from what's actually installed
// on this platform (no hardcoded ID guesswork). Posts to /workspaces
// with the same shape useTemplateDeploy uses. Skips the secret-key
// preflight — if a deploy needs missing keys, the API surfaces the
// error and we show it with a hint to fall through to the desktop
// dialog (which has the full preflight + key-import flow).
import { useEffect, useState } from "react";
import { api } from "@/lib/api";
import { type Template } from "@/lib/deploy-preflight";
import { tierCode } from "./palette";
import { MOBILE_FONT_MONO, MOBILE_FONT_SANS, type MobilePalette, usePalette } from "./palette";
import { Icons, SectionLabel, TierChip } from "./primitives";
const TIER_LABEL: Record<"T1" | "T2" | "T3" | "T4", string> = {
T1: "Sandboxed",
T2: "Standard",
T3: "Privileged",
T4: "Full Access",
};
export function MobileSpawn({ dark, onClose }: { dark: boolean; onClose: () => void }) {
const p = usePalette(dark);
const [templates, setTemplates] = useState<Template[]>([]);
const [loadingTemplates, setLoadingTemplates] = useState(true);
const [tplId, setTplId] = useState<string | null>(null);
const [tier, setTier] = useState<"T1" | "T2" | "T3" | "T4">("T2");
const [name, setName] = useState("");
const [busy, setBusy] = useState(false);
const [error, setError] = useState<string | null>(null);
useEffect(() => {
let cancelled = false;
api
.get<Template[]>("/templates")
.then((list) => {
if (cancelled) return;
setTemplates(list);
if (list.length > 0) {
setTplId(list[0].id);
setTier(tierCode(list[0].tier));
}
})
.catch(() => {
if (!cancelled) setTemplates([]);
})
.finally(() => {
if (!cancelled) setLoadingTemplates(false);
});
return () => {
cancelled = true;
};
}, []);
const handleSpawn = async () => {
if (busy || !tplId) return;
const chosen = templates.find((t) => t.id === tplId);
if (!chosen) return;
setError(null);
setBusy(true);
try {
await api.post<{ id: string }>("/workspaces", {
name: (name.trim() || chosen.name),
template: chosen.id,
tier: Number(tier.slice(1)),
canvas: {
x: Math.random() * 400 + 100,
y: Math.random() * 300 + 100,
},
});
onClose();
} catch (e) {
setError(
e instanceof Error
? `${e.message}. If this template needs missing API keys, use the desktop palette to import them.`
: "Spawn failed",
);
} finally {
setBusy(false);
}
};
return (
<div
role="dialog"
aria-modal="true"
aria-label="Spawn agent"
style={{
position: "absolute",
inset: 0,
zIndex: 100,
background: "rgba(20,15,10,0.42)",
backdropFilter: "blur(4px)",
display: "flex",
alignItems: "flex-end",
fontFamily: MOBILE_FONT_SANS,
}}
onClick={(e) => {
// Click on the dim backdrop closes the sheet.
if (e.target === e.currentTarget) onClose();
}}
>
<div
style={{
width: "100%",
background: p.bg,
borderRadius: "24px 24px 0 0",
maxHeight: "88%",
overflow: "auto",
boxShadow: "0 -10px 40px rgba(0,0,0,0.18)",
}}
>
<Grabber palette={p} />
{/* Header */}
<div
style={{
display: "flex",
alignItems: "center",
justifyContent: "space-between",
padding: "6px 18px 10px",
}}
>
<div>
<h2
style={{
margin: 0,
fontSize: 22,
fontWeight: 700,
color: p.text,
letterSpacing: "-0.02em",
}}
>
Spawn Agent
</h2>
<p style={{ margin: "2px 0 0", fontSize: 12.5, color: p.text2 }}>
In workspace · Default
</p>
</div>
<button
type="button"
onClick={onClose}
aria-label="Close"
style={{
width: 32,
height: 32,
borderRadius: 999,
cursor: "pointer",
background: dark ? "#22211c" : "#fff",
border: `0.5px solid ${p.border}`,
color: p.text2,
display: "flex",
alignItems: "center",
justifyContent: "center",
}}
>
{Icons.close({ size: 16 })}
</button>
</div>
{/* Templates */}
<SectionLabel dark={dark}>Template</SectionLabel>
<div style={{ padding: "0 14px" }}>
{loadingTemplates ? (
<div
style={{
padding: "24px 8px",
textAlign: "center",
color: p.text3,
fontSize: 13,
}}
>
Loading templates
</div>
) : templates.length === 0 ? (
<div
style={{
padding: "16px 14px",
background: p.surface,
borderRadius: 14,
border: `0.5px solid ${p.border}`,
color: p.text2,
fontSize: 13,
lineHeight: 1.45,
}}
>
No templates installed on this platform yet. Open the desktop canvas
and use the template palette to import one (Claude Code, Hermes, or
an org template), then come back here to spawn.
</div>
) : (
<div
style={{
display: "grid",
gridTemplateColumns: "1fr 1fr",
gap: 8,
}}
>
{templates.map((t) => {
const on = tplId === t.id;
const tCode = tierCode(t.tier);
return (
<button
key={t.id}
type="button"
onClick={() => {
setTplId(t.id);
setTier(tCode);
}}
style={{
background: on
? dark
? "#2a2823"
: "#fff"
: dark
? "#1d1c17"
: "#fbf9f4",
border: `1px solid ${on ? p.accent : p.border}`,
borderRadius: 14,
padding: "12px 12px",
textAlign: "left",
cursor: "pointer",
display: "flex",
flexDirection: "column",
gap: 4,
position: "relative",
}}
>
<div
style={{
display: "flex",
alignItems: "center",
justifyContent: "space-between",
gap: 6,
}}
>
<span
style={{
fontSize: 13.5,
fontWeight: 600,
color: p.text,
overflow: "hidden",
textOverflow: "ellipsis",
whiteSpace: "nowrap",
}}
>
{t.name}
</span>
<TierChip tier={tCode} dark={dark} />
</div>
{t.description && (
<span
style={{
fontSize: 11.5,
color: p.text2,
lineHeight: 1.35,
display: "-webkit-box",
WebkitLineClamp: 2,
WebkitBoxOrient: "vertical",
overflow: "hidden",
}}
>
{t.description}
</span>
)}
{on && (
<span
style={{
position: "absolute",
top: 8,
right: 8,
width: 16,
height: 16,
borderRadius: 999,
background: p.accent,
color: "#fff",
display: "flex",
alignItems: "center",
justifyContent: "center",
}}
>
{Icons.check({ size: 10, sw: 2.5 })}
</span>
)}
</button>
);
})}
</div>
)}
</div>
{/* Name */}
<SectionLabel dark={dark}>Name</SectionLabel>
<div style={{ padding: "0 14px" }}>
<input
value={name}
onChange={(e) => setName(e.target.value)}
placeholder={tplId
? (templates.find((t) => t.id === tplId)?.name ?? "agent-name")
: "agent-name"}
style={{
width: "100%",
padding: "12px 14px",
background: dark ? "#22211c" : "#fff",
border: `0.5px solid ${p.border}`,
borderRadius: 12,
fontFamily: MOBILE_FONT_MONO,
fontSize: 13.5,
color: p.text,
outline: "none",
boxSizing: "border-box",
}}
/>
</div>
{/* Tier */}
<SectionLabel dark={dark}>Permission tier</SectionLabel>
<div style={{ padding: "0 14px", display: "flex", gap: 6 }}>
{(["T1", "T2", "T3", "T4"] as const).map((t) => {
const on = tier === t;
return (
<button
key={t}
type="button"
onClick={() => setTier(t)}
style={{
flex: 1,
padding: "10px 8px",
cursor: "pointer",
background: on ? (dark ? "#22211c" : "#fff") : "transparent",
border: `1px solid ${on ? p.accent : p.border}`,
borderRadius: 12,
display: "flex",
flexDirection: "column",
alignItems: "center",
gap: 4,
}}
>
<TierChip tier={t} dark={dark} size="lg" />
<span style={{ fontSize: 10.5, color: p.text2, fontWeight: 500 }}>
{TIER_LABEL[t]}
</span>
</button>
);
})}
</div>
{/* Error */}
{error && (
<div
role="alert"
style={{
margin: "12px 14px 0",
padding: "10px 14px",
background: `${p.failed}1a`,
border: `0.5px solid ${p.failed}40`,
borderRadius: 12,
color: p.failed,
fontSize: 12.5,
lineHeight: 1.4,
}}
>
{error}
</div>
)}
{/* Spawn button */}
<div style={{ padding: "20px 14px max(env(safe-area-inset-bottom), 28px)" }}>
<button
type="button"
onClick={handleSpawn}
disabled={busy || !tplId || templates.length === 0}
style={{
width: "100%",
height: 52,
borderRadius: 16,
border: "none",
cursor: busy ? "wait" : tplId ? "pointer" : "not-allowed",
background: p.text,
color: dark ? p.bg : "#fff",
fontSize: 15,
fontWeight: 600,
display: "flex",
alignItems: "center",
justifyContent: "center",
gap: 10,
boxShadow: "0 8px 22px rgba(40,30,20,0.22)",
opacity: busy || !tplId ? 0.55 : 1,
}}
>
{Icons.zap({ size: 16 })} {busy ? "Spawning…" : "Spawn agent"}
</button>
<p
style={{
margin: "10px 0 0",
textAlign: "center",
fontSize: 11.5,
color: p.text3,
lineHeight: 1.4,
}}
>
Boots in ~3s. Tier {tier} permissions apply on first call.
</p>
</div>
</div>
</div>
);
}
function Grabber({ palette }: { palette: MobilePalette }) {
return (
<div style={{ display: "flex", justifyContent: "center", padding: "8px 0 4px" }}>
<span
style={{
width: 38,
height: 4,
borderRadius: 999,
background: palette.text3,
opacity: 0.4,
}}
/>
</div>
);
}
@@ -0,0 +1,211 @@
// @vitest-environment jsdom
/**
* MobileApp route-state contract.
*
* The mobile shell uses local React state (not URL routing) for
* navigation between the 6 screens. This test pins the back-stack
* shape so a future refactor can't silently regress:
*
* home →(open agent)→ detail
* detail →(open chat)→ chat chat →(back)→ detail
* detail →(back)→ home
*
* home / canvas / comms / me — reachable via the bottom tab bar.
*/
import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
import { cleanup, fireEvent, render, screen } from "@testing-library/react";
beforeEach(() => {
// URL state persists across tests in jsdom — reset to a clean slate
// so each test starts on the home route regardless of what the
// previous test pushed onto the history stack.
window.history.replaceState(null, "", "/");
});
afterEach(() => {
cleanup();
});
// Mock the theme provider — MobileApp reads resolvedTheme to pick a
// palette; for routing we don't care which one, light is fine.
vi.mock("@/lib/theme-provider", () => ({
useTheme: () => ({ theme: "light", resolvedTheme: "light", setTheme: vi.fn() }),
}));
// Stub each screen to a sentinel that exposes the props MobileApp passes
// in. The whole point is to verify the routing handoff, not the screens
// themselves — those have their own tests.
vi.mock("../MobileHome", () => ({
MobileHome: ({ onOpen, onSpawn }: { onOpen: (id: string) => void; onSpawn: () => void }) => (
<div>
<span data-testid="screen">home</span>
<button onClick={() => onOpen("ws-42")}>open-ws-42</button>
<button onClick={onSpawn}>open-spawn</button>
</div>
),
}));
vi.mock("../MobileCanvas", () => ({
MobileCanvas: () => <span data-testid="screen">canvas</span>,
}));
vi.mock("../MobileDetail", () => ({
MobileDetail: ({
agentId,
onBack,
onChat,
}: {
agentId: string;
onBack: () => void;
onChat: () => void;
}) => (
<div>
<span data-testid="screen">detail:{agentId}</span>
<button onClick={onBack}>detail-back</button>
<button onClick={onChat}>detail-open-chat</button>
</div>
),
}));
vi.mock("../MobileChat", () => ({
MobileChat: ({ agentId, onBack }: { agentId: string; onBack: () => void }) => (
<div>
<span data-testid="screen">chat:{agentId}</span>
<button onClick={onBack}>chat-back</button>
</div>
),
}));
vi.mock("../MobileComms", () => ({
MobileComms: () => <span data-testid="screen">comms</span>,
}));
vi.mock("../MobileMe", () => ({
MobileMe: () => <span data-testid="screen">me</span>,
}));
vi.mock("../MobileSpawn", () => ({
MobileSpawn: ({ onClose }: { onClose: () => void }) => (
<div>
<span data-testid="spawn-sheet">spawn</span>
<button onClick={onClose}>spawn-close</button>
</div>
),
}));
// MobileApp's shared TabBar is the user's gateway to the Canvas / Comms /
// Me screens. Rather than depend on its visual icon set we expose a
// label-based stub so the test can call onChange directly.
vi.mock("../components", async () => {
const actual = await vi.importActual<typeof import("../components")>("../components");
type TabId = "agents" | "canvas" | "comms" | "me";
return {
...actual,
TabBar: ({ onChange }: { active: TabId; onChange: (id: TabId) => void }) => (
<div data-testid="tab-bar">
{(["agents", "canvas", "comms", "me"] as const).map((id) => (
<button key={id} onClick={() => onChange(id)}>
tab-{id}
</button>
))}
</div>
),
};
});
import { MobileApp } from "../MobileApp";
const visibleScreen = () =>
Array.from(document.querySelectorAll('[data-testid="screen"]'))
.map((el) => el.textContent ?? "")
.filter(Boolean);
describe("MobileApp — route state", () => {
it("starts on the home screen", () => {
render(<MobileApp />);
expect(visibleScreen()).toEqual(["home"]);
});
it("home → open agent → detail (passes agentId through)", () => {
render(<MobileApp />);
fireEvent.click(screen.getByText("open-ws-42"));
expect(visibleScreen()).toEqual(["detail:ws-42"]);
});
it("detail → open chat → chat (carries the same agentId)", () => {
render(<MobileApp />);
fireEvent.click(screen.getByText("open-ws-42"));
fireEvent.click(screen.getByText("detail-open-chat"));
expect(visibleScreen()).toEqual(["chat:ws-42"]);
});
it("chat back returns to detail (NOT to home — preserves the back-stack)", () => {
render(<MobileApp />);
fireEvent.click(screen.getByText("open-ws-42"));
fireEvent.click(screen.getByText("detail-open-chat"));
fireEvent.click(screen.getByText("chat-back"));
expect(visibleScreen()).toEqual(["detail:ws-42"]);
});
it("detail back returns to home", () => {
render(<MobileApp />);
fireEvent.click(screen.getByText("open-ws-42"));
fireEvent.click(screen.getByText("detail-back"));
expect(visibleScreen()).toEqual(["home"]);
});
it("hides the tab bar on chat (per design — composer reclaims that space)", () => {
render(<MobileApp />);
expect(screen.queryByTestId("tab-bar")).not.toBeNull();
fireEvent.click(screen.getByText("open-ws-42"));
expect(screen.queryByTestId("tab-bar")).not.toBeNull(); // detail
fireEvent.click(screen.getByText("detail-open-chat"));
expect(screen.queryByTestId("tab-bar")).toBeNull(); // chat
});
it("tab bar switches the four primary screens (Agents / Canvas / Comms / Me)", () => {
render(<MobileApp />);
fireEvent.click(screen.getByText("tab-canvas"));
expect(visibleScreen()).toEqual(["canvas"]);
fireEvent.click(screen.getByText("tab-comms"));
expect(visibleScreen()).toEqual(["comms"]);
fireEvent.click(screen.getByText("tab-me"));
expect(visibleScreen()).toEqual(["me"]);
fireEvent.click(screen.getByText("tab-agents"));
expect(visibleScreen()).toEqual(["home"]);
});
it("spawn sheet overlays from anywhere, closes on dismiss", () => {
render(<MobileApp />);
expect(screen.queryByTestId("spawn-sheet")).toBeNull();
fireEvent.click(screen.getByText("open-spawn"));
expect(screen.queryByTestId("spawn-sheet")).not.toBeNull();
fireEvent.click(screen.getByText("spawn-close"));
expect(screen.queryByTestId("spawn-sheet")).toBeNull();
});
it("seeds initial route from ?m= and ?a= so deep links open the right screen", () => {
window.history.replaceState(null, "", "/?m=detail&a=ws-99");
render(<MobileApp />);
expect(visibleScreen()).toEqual(["detail:ws-99"]);
});
it("collapses ?m=detail without ?a to home (detail without an agent is meaningless)", () => {
window.history.replaceState(null, "", "/?m=detail");
render(<MobileApp />);
expect(visibleScreen()).toEqual(["home"]);
});
it("syncs in-app navigation to the URL so browser back leaves the mobile stack", () => {
render(<MobileApp />);
expect(window.location.search).toBe("");
fireEvent.click(screen.getByText("open-ws-42"));
expect(window.location.search).toBe("?m=detail&a=ws-42");
fireEvent.click(screen.getByText("detail-open-chat"));
expect(window.location.search).toBe("?m=chat&a=ws-42");
});
it("popstate (back button) restores the previous route", () => {
render(<MobileApp />);
fireEvent.click(screen.getByText("open-ws-42"));
fireEvent.click(screen.getByText("detail-open-chat"));
// Simulate browser back: rewind URL ourselves, then dispatch popstate.
window.history.replaceState(null, "", "/?m=detail&a=ws-42");
fireEvent.popState(window);
expect(visibleScreen()).toEqual(["detail:ws-42"]);
});
});
@@ -0,0 +1,101 @@
import { describe, expect, it } from "vitest";
import type { Node } from "@xyflow/react";
import { type WorkspaceNodeData } from "@/store/canvas";
import { classifyForFilter, toMobileAgent } from "../components";
const baseData: WorkspaceNodeData = {
name: "test-agent",
status: "online",
tier: 2,
agentCard: null,
activeTasks: 0,
collapsed: false,
role: "",
lastErrorRate: 0,
lastSampleError: "",
url: "",
parentId: null,
currentTask: "",
runtime: "claude-code",
needsRestart: false,
budgetLimit: null,
};
const makeNode = (overrides: Partial<WorkspaceNodeData> = {}, id = "ws-1"): Node<WorkspaceNodeData> => ({
id,
type: "workspaceNode",
position: { x: 0, y: 0 },
data: { ...baseData, ...overrides },
});
describe("toMobileAgent", () => {
it("maps name, status, tier, runtime through the design's 6-key palette", () => {
const a = toMobileAgent(makeNode({ status: "online", tier: 3, runtime: "hermes" }));
expect(a.name).toBe("test-agent");
expect(a.status).toBe("online");
expect(a.tier).toBe("T3");
expect(a.runtime).toBe("hermes");
expect(a.tag).toBe("hermes"); // tag mirrors runtime in v1
});
it("flags 'external' runtime as remote (drives the ★ REMOTE badge)", () => {
expect(toMobileAgent(makeNode({ runtime: "external" })).remote).toBe(true);
expect(toMobileAgent(makeNode({ runtime: "claude-code" })).remote).toBe(false);
});
it("falls back to 'unknown' runtime when both workspace + agentCard are blank", () => {
const a = toMobileAgent(makeNode({ runtime: "" }));
expect(a.runtime).toBe("unknown");
expect(a.tag).toBe("unknown");
});
it("uses workspace id as fallback name when name is missing", () => {
const a = toMobileAgent(makeNode({ name: "" }, "ws-fallback"));
expect(a.name).toBe("ws-fallback");
});
it("preserves the parent link so MobileCanvas can draw parent→child edges", () => {
const a = toMobileAgent(makeNode({ parentId: "ws-parent" }, "ws-child"));
expect(a.parentId).toBe("ws-parent");
});
it("maps platform 'provisioning' to design 'starting'", () => {
expect(toMobileAgent(makeNode({ status: "provisioning" })).status).toBe("starting");
});
it("counts skills from agentCard.skills array", () => {
const a = toMobileAgent(
makeNode({
agentCard: {
skills: [{ name: "skill-a" }, { name: "skill-b" }, { name: "skill-c" }],
},
}),
);
expect(a.skills).toBe(3);
});
it("reports 0 skills when agentCard is null", () => {
expect(toMobileAgent(makeNode({ agentCard: null })).skills).toBe(0);
});
});
describe("classifyForFilter", () => {
it("buckets online statuses to the Online filter", () => {
expect(classifyForFilter("online")).toBe("online");
});
it("buckets failure-state statuses to the Issues filter", () => {
// Issues = anything the user needs to look at NOW.
expect(classifyForFilter("failed")).toBe("issue");
expect(classifyForFilter("degraded")).toBe("issue");
});
it("buckets non-online non-failure statuses to the Paused filter", () => {
// Catch-all for transient or intentional offline states.
expect(classifyForFilter("paused")).toBe("paused");
expect(classifyForFilter("offline")).toBe("paused");
expect(classifyForFilter("starting")).toBe("paused");
});
});
@@ -0,0 +1,68 @@
import { describe, expect, it } from "vitest";
import { MOL_DARK, MOL_LIGHT, getPalette, normalizeStatus, tierCode } from "../palette";
describe("normalizeStatus", () => {
it("passes design-known statuses through verbatim", () => {
expect(normalizeStatus("online")).toBe("online");
expect(normalizeStatus("degraded")).toBe("degraded");
expect(normalizeStatus("failed")).toBe("failed");
expect(normalizeStatus("paused")).toBe("paused");
expect(normalizeStatus("offline")).toBe("offline");
});
it("maps platform 'provisioning' to design 'starting'", () => {
// The platform's 14-state machine collapses to the design's 6 keys.
// 'provisioning' (post-spawn boot) is the same UX bucket as 'starting'.
expect(normalizeStatus("provisioning")).toBe("starting");
expect(normalizeStatus("starting")).toBe("starting");
});
it("maps unknown / null / empty to offline", () => {
expect(normalizeStatus(undefined)).toBe("offline");
expect(normalizeStatus(null)).toBe("offline");
expect(normalizeStatus("")).toBe("offline");
expect(normalizeStatus("garbage-status")).toBe("offline");
});
});
describe("tierCode", () => {
it("maps numeric tiers to T-codes", () => {
expect(tierCode(1)).toBe("T1");
expect(tierCode(2)).toBe("T2");
expect(tierCode(3)).toBe("T3");
expect(tierCode(4)).toBe("T4");
});
it("clamps below-1 to T1 (never below sandboxed)", () => {
expect(tierCode(0)).toBe("T1");
expect(tierCode(-5)).toBe("T1");
});
it("clamps above-4 to T4 (never above full-access)", () => {
expect(tierCode(5)).toBe("T4");
expect(tierCode(99)).toBe("T4");
});
it("falls back to T2 (Standard) on null/undefined", () => {
// T2 is the platform default for fresh agents — matches the
// CreateWorkspaceDialog default. Keeps the mobile spawn UX
// consistent with the desktop when tier metadata is missing.
expect(tierCode(undefined)).toBe("T2");
expect(tierCode(null)).toBe("T2");
});
});
describe("getPalette", () => {
it("returns the light palette when dark is false", () => {
expect(getPalette(false)).toBe(MOL_LIGHT);
});
it("returns the dark palette when dark is true", () => {
expect(getPalette(true)).toBe(MOL_DARK);
});
it("light + dark palettes have the same key set (no drift)", () => {
expect(Object.keys(MOL_LIGHT).sort()).toEqual(Object.keys(MOL_DARK).sort());
});
});
+444
View File
@@ -0,0 +1,444 @@
"use client";
// Screen-shared composites: TabBar, WorkspacePill, AgentCard, FilterChips.
// Mirrors molecules-ai-mobile-app/project/screens-shared.jsx but reads
// from the live canvas store rather than the prototype's mock AGENTS.
import type { Node } from "@xyflow/react";
import { type WorkspaceNodeData, summarizeWorkspaceCapabilities } from "@/store/canvas";
import {
MOBILE_FONT_MONO,
type MobilePalette,
type MobileStatus,
normalizeStatus,
tierCode,
usePalette,
} from "./palette";
import { Icons, StatusDot, TierChip } from "./primitives";
// Derived view-model the mobile screens consume. Built once per render
// from the store's Node<WorkspaceNodeData>.
export interface MobileAgent {
id: string;
name: string;
tag: string;
tier: "T1" | "T2" | "T3" | "T4";
status: MobileStatus;
remote: boolean;
runtime: string;
skills: number;
calls: number;
desc: string;
parentId: string | null;
}
export function toMobileAgent(node: Node<WorkspaceNodeData>): MobileAgent {
const cap = summarizeWorkspaceCapabilities(node.data);
const runtime = cap.runtime ?? "unknown";
const remote = runtime === "external";
return {
id: node.id,
name: node.data.name || node.id,
tag: runtime,
tier: tierCode(node.data.tier),
status: normalizeStatus(node.data.status),
remote,
runtime,
skills: cap.skillCount,
calls: typeof node.data.activeTasks === "number" ? node.data.activeTasks : 0,
desc: node.data.role || cap.currentTask || "",
parentId: node.data.parentId ?? null,
};
}
// ── Tab bar ────────────────────────────────────────────────────
export type MobileTabId = "agents" | "canvas" | "comms" | "me";
export function TabBar({
active,
onChange,
dark,
}: {
active: MobileTabId;
onChange: (id: MobileTabId) => void;
dark: boolean;
}) {
const p = usePalette(dark);
const tabs: { id: MobileTabId; label: string; icon: keyof typeof Icons }[] = [
{ id: "agents", label: "Agents", icon: "list" },
{ id: "canvas", label: "Canvas", icon: "graph" },
{ id: "comms", label: "Comms", icon: "pulse" },
{ id: "me", label: "Me", icon: "user" },
];
return (
<div
style={{
position: "absolute",
left: 14,
right: 14,
bottom: 16,
height: 64,
borderRadius: 26,
zIndex: 30,
background: dark ? "rgba(34,33,28,0.78)" : "rgba(255,253,247,0.82)",
backdropFilter: "blur(24px) saturate(160%)",
WebkitBackdropFilter: "blur(24px) saturate(160%)",
border: `0.5px solid ${p.border}`,
boxShadow: dark
? "0 8px 28px rgba(0,0,0,0.4), inset 0 0.5px 0 rgba(255,255,255,0.05)"
: "0 6px 20px rgba(40,30,20,0.07), 0 1px 0 rgba(255,255,255,0.6) inset",
display: "flex",
alignItems: "center",
justifyContent: "space-around",
padding: "0 10px",
}}
>
{tabs.map((t) => {
const on = active === t.id;
return (
<button
key={t.id}
type="button"
onClick={() => onChange(t.id)}
style={{
background: "none",
border: "none",
cursor: "pointer",
display: "flex",
flexDirection: "column",
alignItems: "center",
gap: 3,
padding: "6px 10px",
minWidth: 56,
color: on ? p.accent : p.text3,
}}
>
<span
style={{
width: 36,
height: 28,
borderRadius: 10,
background: on ? `${p.accent}1a` : "transparent",
display: "flex",
alignItems: "center",
justifyContent: "center",
}}
>
{Icons[t.icon]({ size: 18 })}
</span>
<span
style={{
fontSize: 10,
letterSpacing: "0.02em",
fontWeight: on ? 600 : 500,
}}
>
{t.label}
</span>
</button>
);
})}
</div>
);
}
// ── Workspace pill (header) ────────────────────────────────────
export function WorkspacePill({
dark,
count,
live = true,
}: {
dark: boolean;
count: number | string;
live?: boolean;
}) {
const p = usePalette(dark);
return (
<div
style={{
display: "inline-flex",
alignItems: "center",
gap: 0,
borderRadius: 999,
padding: 4,
background: dark ? "rgba(34,33,28,0.6)" : "rgba(255,255,255,0.7)",
border: `0.5px solid ${p.border}`,
backdropFilter: "blur(12px)",
}}
>
<span
style={{
display: "flex",
alignItems: "center",
gap: 8,
padding: "6px 12px 6px 8px",
borderRight: `0.5px solid ${p.divider}`,
}}
>
<span
style={{
width: 22,
height: 22,
borderRadius: 6,
background: `linear-gradient(135deg, ${p.accent}, ${p.greenInk})`,
display: "flex",
alignItems: "center",
justifyContent: "center",
color: "white",
fontSize: 11,
fontWeight: 700,
}}
>
M
</span>
<span style={{ fontSize: 13.5, fontWeight: 600, color: p.text }}>Molecule AI</span>
</span>
<span
style={{
display: "flex",
alignItems: "center",
gap: 6,
padding: "6px 10px",
fontFamily: MOBILE_FONT_MONO,
fontSize: 11,
color: p.text2,
}}
>
<StatusDot status="online" size={6} dark={dark} />
<span>{count}</span>
</span>
{live && (
<span
style={{
display: "flex",
alignItems: "center",
gap: 5,
padding: "6px 10px 6px 8px",
fontSize: 11,
color: p.greenInk,
fontWeight: 600,
fontFamily: MOBILE_FONT_MONO,
}}
>
<span
style={{
width: 6,
height: 6,
borderRadius: 999,
background: p.online,
boxShadow: `0 0 0 3px ${p.online}26`,
}}
/>
LIVE
</span>
)}
</div>
);
}
// ── Agent row card ─────────────────────────────────────────────
export function AgentCard({
agent,
dark,
onClick,
compact = false,
}: {
agent: MobileAgent;
dark: boolean;
onClick?: () => void;
compact?: boolean;
}) {
const p = usePalette(dark);
const isOnline = agent.status === "online";
const isT4Soft = agent.tier === "T4" && isOnline;
return (
<button
type="button"
onClick={onClick}
style={{
display: "block",
width: "100%",
textAlign: "left",
cursor: "pointer",
background: isT4Soft ? p.t4SoftCard : isOnline ? p.greenSoft : p.surface,
border: `0.5px solid ${p.border}`,
borderRadius: 18,
padding: compact ? "12px 14px" : "14px 16px",
boxShadow: dark
? "none"
: "0 1px 0 rgba(255,255,255,0.5) inset, 0 1px 2px rgba(40,30,20,0.03)",
transition: "transform .12s",
}}
>
<div style={{ display: "flex", alignItems: "center", gap: 10 }}>
<StatusDot status={agent.status} size={9} dark={dark} />
<span
style={{
flex: 1,
fontSize: 16,
fontWeight: 600,
color: p.text,
letterSpacing: "-0.01em",
overflow: "hidden",
textOverflow: "ellipsis",
whiteSpace: "nowrap",
}}
>
{agent.name}
</span>
<TierChip tier={agent.tier} dark={dark} />
</div>
<div
style={{
display: "flex",
alignItems: "center",
gap: 6,
marginTop: 8,
flexWrap: "wrap",
}}
>
{agent.remote && <RemoteBadge palette={p} />}
<span
style={{
fontSize: 10.5,
color: p.text3,
fontFamily: MOBILE_FONT_MONO,
letterSpacing: "0.02em",
}}
>
{agent.tag}
</span>
</div>
{!compact && agent.desc && (
<p
style={{
margin: "8px 0 0",
fontSize: 13,
lineHeight: 1.45,
color: p.text2,
}}
>
{agent.desc}
</p>
)}
{!compact && (
<div
style={{
display: "flex",
alignItems: "center",
gap: 14,
marginTop: 10,
fontSize: 10.5,
color: p.text3,
fontFamily: MOBILE_FONT_MONO,
}}
>
<span>SKILLS {agent.skills}</span>
<span>CALLS {agent.calls}</span>
<span style={{ marginLeft: "auto" }}>{agent.runtime.toUpperCase()}</span>
</div>
)}
</button>
);
}
export function RemoteBadge({ palette }: { palette: MobilePalette }) {
return (
<span
style={{
padding: "2px 7px",
borderRadius: 4,
background: palette.remoteBg,
color: palette.remote,
fontSize: 10,
fontWeight: 700,
letterSpacing: "0.04em",
fontFamily: MOBILE_FONT_MONO,
display: "inline-flex",
alignItems: "center",
gap: 3,
}}
>
REMOTE
</span>
);
}
// ── Filter chips ───────────────────────────────────────────────
export type AgentFilter = "all" | "online" | "issue" | "paused";
export function FilterChips({
value,
onChange,
dark,
counts,
}: {
value: AgentFilter;
onChange: (v: AgentFilter) => void;
dark: boolean;
counts: { all: number; online: number; issue: number; paused: number };
}) {
const p = usePalette(dark);
const opts: { id: AgentFilter; label: string; n: number }[] = [
{ id: "all", label: "All", n: counts.all },
{ id: "online", label: "Online", n: counts.online },
{ id: "issue", label: "Issues", n: counts.issue },
{ id: "paused", label: "Paused", n: counts.paused },
];
return (
<div
style={{
display: "flex",
gap: 6,
padding: "0 16px 10px",
overflowX: "auto",
scrollbarWidth: "none",
}}
>
{opts.map((o) => {
const on = value === o.id;
return (
<button
key={o.id}
type="button"
onClick={() => onChange(o.id)}
style={{
display: "inline-flex",
alignItems: "center",
gap: 6,
padding: "7px 12px",
borderRadius: 999,
cursor: "pointer",
background: on ? p.text : dark ? "#22211c" : "#fff",
color: on ? (dark ? p.bg : "#fff") : p.text,
border: `0.5px solid ${on ? "transparent" : p.border}`,
fontSize: 13,
fontWeight: 500,
whiteSpace: "nowrap",
flexShrink: 0,
}}
>
{o.label}
<span
style={{
fontSize: 10.5,
opacity: 0.7,
fontFamily: MOBILE_FONT_MONO,
}}
>
{o.n}
</span>
</button>
);
})}
</div>
);
}
export function classifyForFilter(status: MobileStatus): AgentFilter {
if (status === "online") return "online";
if (status === "failed" || status === "degraded") return "issue";
return "paused"; // starting / paused / offline
}
@@ -0,0 +1,40 @@
"use client";
// React context for accent overrides + the React-side `usePalette` hook.
// Keeps the pure data (MOL_LIGHT/MOL_DARK) in palette.ts and the
// pure-function `getPalette` available for tests; this file is the
// React-only entry point so mobile components don't have to plumb
// accent through props.
import { createContext, useContext, type ReactNode } from "react";
import { MOL_DARK, MOL_LIGHT, type MobilePalette } from "./palette";
const MobileAccentContext = createContext<string | null>(null);
export function MobileAccentProvider({
accent,
children,
}: {
accent: string | null;
children: ReactNode;
}) {
return <MobileAccentContext.Provider value={accent}>{children}</MobileAccentContext.Provider>;
}
/**
* Hook variant of palette resolution. Reads the user's accent override
* from context and returns a fresh palette object with the override
* applied. Critically, it never mutates the static MOL_LIGHT/MOL_DARK
* singletons — that was the foot-gun the prior version had.
*
* Outside of a `<MobileAccentProvider>`, the context default of `null`
* means we just return the static palette unchanged. That's the right
* behaviour for tests + for any non-mobile caller that imports a token.
*/
export function usePalette(dark: boolean): MobilePalette {
const accent = useContext(MobileAccentContext);
const base = dark ? MOL_DARK : MOL_LIGHT;
if (!accent || accent === base.accent) return base;
return { ...base, accent, online: accent };
}
+147
View File
@@ -0,0 +1,147 @@
// Mobile design system tokens — verbatim from the Claude Design handoff
// (molecules-ai-mobile-app/project/shared.jsx). Kept as an inline-style
// palette object so screens can mirror the design 1:1; theming routes
// through `usePalette(dark)` exactly like the prototype.
export interface MobilePalette {
bg: string;
surface: string;
surface2: string;
border: string;
divider: string;
text: string;
text2: string;
text3: string;
green: string;
greenSoft: string;
greenInk: string;
t1Bg: string; t1Ink: string; t1Br: string;
t2Bg: string; t2Ink: string; t2Br: string;
t3Bg: string; t3Ink: string; t3Br: string;
t4Bg: string; t4Ink: string; t4Br: string;
t4SoftCard: string;
online: string;
starting: string;
degraded: string;
failed: string;
paused: string;
offline: string;
remote: string;
remoteBg: string;
accent: string;
}
export const MOL_LIGHT: MobilePalette = {
bg: "#f6f4ef",
surface: "#ffffff",
surface2: "#fbf9f4",
border: "rgba(40,30,20,0.08)",
divider: "rgba(40,30,20,0.06)",
text: "#29261b",
text2: "rgba(41,38,27,0.62)",
text3: "rgba(41,38,27,0.42)",
green: "#2f9e6a",
greenSoft: "#d9ebe0",
greenInk: "#1f6a47",
t1Bg: "#dde6f1", t1Ink: "#3a6aa3", t1Br: "#b9c8de",
t2Bg: "#dbe5f4", t2Ink: "#2f5fb4", t2Br: "#b1c2e0",
t3Bg: "#e3dcef", t3Ink: "#6a4ba1", t3Br: "#c8b9e1",
t4Bg: "#f5dcc7", t4Ink: "#a8501d", t4Br: "#e8c6a4",
t4SoftCard: "#f9ece0",
online: "#2f9e6a",
starting: "#e9b53b",
degraded: "#d28a2a",
failed: "#c8472a",
paused: "#7a8696",
offline: "#9aa0a6",
remote: "#7a4dd1",
remoteBg: "#ede2ff",
accent: "#2f9e6a",
};
export const MOL_DARK: MobilePalette = {
bg: "#15140f",
surface: "#1d1c17",
surface2: "#22211c",
border: "rgba(255,250,240,0.08)",
divider: "rgba(255,250,240,0.06)",
text: "#f1eee5",
text2: "rgba(241,238,229,0.6)",
text3: "rgba(241,238,229,0.38)",
green: "#3eb37c",
greenSoft: "#1f3a2c",
greenInk: "#7fd3a8",
t1Bg: "#1a2230", t1Ink: "#7ea4d4", t1Br: "#2a3a52",
t2Bg: "#1b2434", t2Ink: "#86a6e2", t2Br: "#2c3c58",
t3Bg: "#251f33", t3Ink: "#b39be0", t3Br: "#3e3450",
t4Bg: "#332316", t4Ink: "#e5a878", t4Br: "#553622",
t4SoftCard: "#2a1f17",
online: "#3eb37c",
starting: "#e9b53b",
degraded: "#d28a2a",
failed: "#d65a3e",
paused: "#8a96a6",
offline: "#6a6a6a",
remote: "#a38aff",
remoteBg: "#2a1f44",
accent: "#3eb37c",
};
/**
* Pure-function variant of palette resolution. No React, no context,
* no mutation — for tests and other non-component code.
*
* Components should import `usePalette` from `./palette-context` so the
* user's accent override (held in context, not in module state) flows
* through automatically. Re-exported below so the existing
* `import { usePalette } from "./palette"` call sites keep working.
*/
export const getPalette = (dark: boolean): MobilePalette => (dark ? MOL_DARK : MOL_LIGHT);
// Back-compat re-export. Once we're confident nothing imports
// `usePalette` from this file we can drop this line.
export { usePalette } from "./palette-context";
// References the CSS variables that next/font/google emits in
// app/layout.tsx. Falls through to system fonts if the variable is
// undefined (e.g. in unit tests with no <body> font class).
export const MOBILE_FONT_SANS = "var(--font-inter), 'Inter', ui-sans-serif, system-ui, sans-serif";
export const MOBILE_FONT_MONO = "var(--font-jetbrains), 'JetBrains Mono', ui-monospace, monospace";
// Status keys we surface in the mobile UI. Anything else from the
// platform falls back to "offline" tinting — the desktop has more
// statuses ("provisioning", etc.) than the design's 6-key palette.
export type MobileStatus =
| "online" | "starting" | "degraded" | "failed" | "paused" | "offline";
export function normalizeStatus(s: string | undefined | null): MobileStatus {
if (s === "online" || s === "degraded" || s === "failed" || s === "paused" || s === "offline") {
return s;
}
if (s === "provisioning" || s === "starting") return "starting";
return "offline";
}
// Platform tier (number 1-4) → design tier code "T1".."T4"
export function tierCode(tier: number | undefined | null): "T1" | "T2" | "T3" | "T4" {
const n = typeof tier === "number" ? tier : 2;
if (n <= 1) return "T1";
if (n === 2) return "T2";
if (n === 3) return "T3";
return "T4";
}
+278
View File
@@ -0,0 +1,278 @@
"use client";
// Mobile primitives — StatusDot, TierChip, Chip, Icons, SectionLabel.
// Ports shared.jsx 1:1 from the design handoff; React + TypeScript flavor.
import type { CSSProperties, ReactNode, SVGProps } from "react";
import {
MOBILE_FONT_MONO,
type MobilePalette,
type MobileStatus,
usePalette,
} from "./palette";
type TierCode = "T1" | "T2" | "T3" | "T4";
export function StatusDot({
status = "online",
size = 8,
dark = false,
halo = true,
}: {
status?: MobileStatus;
size?: number;
dark?: boolean;
halo?: boolean;
}) {
const p = usePalette(dark);
const c: string = (p as unknown as Record<string, string>)[status] ?? p.online;
return (
<span
style={{
display: "inline-block",
width: size,
height: size,
borderRadius: 999,
background: c,
flexShrink: 0,
boxShadow: halo ? `0 0 0 ${Math.max(2, size * 0.45)}px ${c}26` : "none",
}}
/>
);
}
export function TierChip({
tier = "T2",
dark = false,
size = "sm",
}: {
tier?: TierCode;
dark?: boolean;
size?: "sm" | "lg";
}) {
const p = usePalette(dark);
const map: Record<TierCode, { bg: string; ink: string; br: string }> = {
T1: { bg: p.t1Bg, ink: p.t1Ink, br: p.t1Br },
T2: { bg: p.t2Bg, ink: p.t2Ink, br: p.t2Br },
T3: { bg: p.t3Bg, ink: p.t3Ink, br: p.t3Br },
T4: { bg: p.t4Bg, ink: p.t4Ink, br: p.t4Br },
};
const { bg, ink, br } = map[tier];
const dim = size === "lg" ? { w: 32, h: 22, fs: 11 } : { w: 26, h: 19, fs: 10 };
return (
<span
style={{
display: "inline-flex",
alignItems: "center",
justifyContent: "center",
width: dim.w,
height: dim.h,
borderRadius: 5,
background: bg,
color: ink,
border: `0.5px solid ${br}`,
fontFamily: MOBILE_FONT_MONO,
fontSize: dim.fs,
fontWeight: 600,
letterSpacing: "0.02em",
flexShrink: 0,
}}
>
{tier}
</span>
);
}
export function Chip({
label,
value,
accent,
dark = false,
soft = false,
}: {
label?: string;
value: ReactNode;
accent?: string;
dark?: boolean;
soft?: boolean;
}) {
const p = usePalette(dark);
return (
<span
style={{
display: "inline-flex",
alignItems: "center",
gap: 6,
padding: "4px 9px",
borderRadius: 999,
background: soft
? `${accent ?? p.accent}1a`
: dark
? "#2a2823"
: "#f0ede5",
border: `0.5px solid ${dark ? "rgba(255,255,255,0.06)" : "rgba(0,0,0,0.05)"}`,
fontSize: 11,
fontFamily: MOBILE_FONT_MONO,
color: p.text2,
letterSpacing: "0.02em",
}}
>
{label && (
<span style={{ textTransform: "uppercase", fontSize: 9.5, opacity: 0.7 }}>{label}</span>
)}
<span style={{ color: accent ?? p.text, fontWeight: 600 }}>{value}</span>
</span>
);
}
// ── icons (stroke-based, 20×20 viewBox) ───────────────────────
type IcoOpts = { stroke?: string; size?: number; fill?: string; sw?: number };
const ico = (
paths: ReactNode,
{ stroke = "currentColor", size = 18, fill = "none", sw = 1.6 }: IcoOpts = {},
) => {
const props: SVGProps<SVGSVGElement> = {
width: size,
height: size,
viewBox: "0 0 20 20",
fill,
stroke,
strokeWidth: sw,
strokeLinecap: "round",
strokeLinejoin: "round",
};
return <svg {...props}>{paths}</svg>;
};
export const Icons = {
graph: (o?: IcoOpts) =>
ico(
<>
<circle cx="5" cy="5" r="2" />
<circle cx="15" cy="5" r="2" />
<circle cx="10" cy="15" r="2" />
<path d="M6.4 6.5l2.7 7M13.6 6.5l-2.7 7" />
</>,
o,
),
list: (o?: IcoOpts) =>
ico(
<>
<path d="M6 5h10M6 10h10M6 15h10" />
<circle cx="3.5" cy="5" r="0.6" fill="currentColor" />
<circle cx="3.5" cy="10" r="0.6" fill="currentColor" />
<circle cx="3.5" cy="15" r="0.6" fill="currentColor" />
</>,
o,
),
search: (o?: IcoOpts) =>
ico(
<>
<circle cx="9" cy="9" r="5" />
<path d="M13 13l4 4" />
</>,
o,
),
plus: (o?: IcoOpts) => ico(<path d="M10 4v12M4 10h12" />, o),
bell: (o?: IcoOpts) =>
ico(
<>
<path d="M5 8a5 5 0 0 1 10 0v4l1.5 2H3.5L5 12V8z" />
<path d="M8.5 16a1.5 1.5 0 0 0 3 0" />
</>,
o,
),
chat: (o?: IcoOpts) =>
ico(
<path d="M4 5h12a1.5 1.5 0 0 1 1.5 1.5v6A1.5 1.5 0 0 1 16 14h-3l-3 3v-3H4a1.5 1.5 0 0 1-1.5-1.5v-6A1.5 1.5 0 0 1 4 5z" />,
o,
),
send: (o?: IcoOpts) =>
ico(<path d="M3 10l14-6-5 14-3-6-6-2z" fill="currentColor" />, { ...o, sw: 1 }),
attach: (o?: IcoOpts) =>
ico(
<path d="M14 6.5L7.5 13a2.5 2.5 0 0 0 3.5 3.5l7-7a4 4 0 0 0-5.6-5.6L4.8 11A6 6 0 0 0 13.3 19.5" />,
o,
),
back: (o?: IcoOpts) => ico(<path d="M12.5 4l-6 6 6 6" />, o),
more: (o?: IcoOpts) =>
ico(
<>
<circle cx="5" cy="10" r="1.2" fill="currentColor" />
<circle cx="10" cy="10" r="1.2" fill="currentColor" />
<circle cx="15" cy="10" r="1.2" fill="currentColor" />
</>,
o,
),
filter: (o?: IcoOpts) => ico(<path d="M3 5h14M5 10h10M8 15h4" />, o),
user: (o?: IcoOpts) =>
ico(
<>
<circle cx="10" cy="7" r="3" />
<path d="M3.5 17a6.5 6.5 0 0 1 13 0" />
</>,
o,
),
settings: (o?: IcoOpts) =>
ico(
<>
<circle cx="10" cy="10" r="2.2" />
<path d="M10 2.5v2M10 15.5v2M2.5 10h2M15.5 10h2M4.7 4.7l1.4 1.4M13.9 13.9l1.4 1.4M4.7 15.3l1.4-1.4M13.9 6.1l1.4-1.4" />
</>,
o,
),
pulse: (o?: IcoOpts) => ico(<path d="M2 10h3l2-5 3 10 2-7 2 4 4-2" />, o),
close: (o?: IcoOpts) => ico(<path d="M5 5l10 10M15 5L5 15" />, o),
zap: (o?: IcoOpts) => ico(<path d="M11 2l-6 9h4l-1 7 6-9h-4l1-7z" />, o),
check: (o?: IcoOpts) => ico(<path d="M4 10l4 4 8-9" />, o),
swatch: (o?: IcoOpts) =>
ico(
<>
<rect x="3" y="3" width="6" height="6" rx="1" />
<rect x="11" y="3" width="6" height="6" rx="1" />
<rect x="3" y="11" width="6" height="6" rx="1" />
<circle cx="14" cy="14" r="3.2" />
</>,
o,
),
};
export function SectionLabel({
children,
dark = false,
right,
style,
}: {
children: ReactNode;
dark?: boolean;
right?: ReactNode;
style?: CSSProperties;
}) {
const p = usePalette(dark);
return (
<div
style={{
display: "flex",
alignItems: "center",
justifyContent: "space-between",
padding: "14px 20px 6px",
fontFamily: MOBILE_FONT_MONO,
fontSize: 10.5,
letterSpacing: "0.12em",
textTransform: "uppercase",
color: p.text3,
fontWeight: 600,
...style,
}}
>
<span>{children}</span>
{right}
</div>
);
}
// Convenience: avoid repeating the (palette, dark) plumbing in screens
// that only need the palette object.
export function withPalette<T>(dark: boolean, fn: (p: MobilePalette) => T): T {
return fn(usePalette(dark));
}
+1 -1
View File
@@ -28,7 +28,7 @@ const FILE_ICONS: Record<string, string> = {
export function getIcon(path: string, isDir: boolean): string {
if (isDir) return "📁";
const ext = "." + path.split(".").pop();
const ext = "." + (path.split(".").pop() ?? "").toLowerCase();
return FILE_ICONS[ext] || "📄";
}
@@ -248,6 +248,81 @@ describe("extractResponseText", () => {
});
});
describe("extractAgentText", () => {
it("extracts from parts", () => {
const task = {
parts: [{ kind: "text", text: "Hello from agent" }],
};
expect(extractAgentText(task as Record<string, unknown>)).toBe("Hello from agent");
});
it("extracts from artifacts[0].parts", () => {
const task = {
artifacts: [
{ parts: [{ kind: "text", text: "Artifact text" }] },
],
};
expect(extractAgentText(task as Record<string, unknown>)).toBe("Artifact text");
});
it("extracts from status.message.parts", () => {
const task = {
status: {
message: { parts: [{ kind: "text", text: "Status text" }] },
},
};
expect(extractAgentText(task as Record<string, unknown>)).toBe("Status text");
});
it("prefers parts over artifacts", () => {
const task = {
parts: [{ kind: "text", text: "parts wins" }],
artifacts: [{ parts: [{ kind: "text", text: "artifacts lost" }] }],
};
expect(extractAgentText(task as Record<string, unknown>)).toBe("parts wins");
});
it("prefers artifacts[0] over status.message", () => {
const task = {
status: { message: { parts: [{ kind: "text", text: "status lost" }] } },
artifacts: [{ parts: [{ kind: "text", text: "artifacts wins" }] }],
};
expect(extractAgentText(task as Record<string, unknown>)).toBe("artifacts wins");
});
it("falls back to string task", () => {
expect(extractAgentText("raw string task" as unknown as Record<string, unknown>)).toBe("raw string task");
});
// FIXED BUG: when all three sources return nothing (no text parts), extractAgentText
// now returns "" instead of the error message. An empty task should render as a
// blank bubble, not an error indicator.
it("returns empty string when parts is empty array", () => {
const task = { parts: [] };
expect(extractAgentText(task as Record<string, unknown>)).toBe("");
});
it("returns empty string when artifacts is empty array", () => {
const task = { artifacts: [] };
expect(extractAgentText(task as Record<string, unknown>)).toBe("");
});
it("returns empty string when status.message.parts is empty", () => {
const task = { status: { message: { parts: [] } } };
expect(extractAgentText(task as Record<string, unknown>)).toBe("");
});
it("tolerates null/undefined status.message without throwing", () => {
const task = { status: null };
expect(extractAgentText(task as Record<string, unknown>)).toBe("");
});
it("tolerates undefined artifacts without throwing", () => {
const task = {};
expect(extractAgentText(task as Record<string, unknown>)).toBe("");
});
});
describe("extractTextsFromParts", () => {
it("extracts text parts with kind=text", () => {
const parts = [
@@ -1,5 +1,8 @@
export function extractAgentText(task: Record<string, unknown>): string {
try {
// Check direct string first — some callers pass the raw response body.
if (typeof task === "string") return task;
const directTexts = extractTextsFromParts(task.parts);
if (directTexts) return directTexts;
@@ -16,8 +19,14 @@ export function extractAgentText(task: Record<string, unknown>): string {
if (texts) return texts;
}
if (typeof task === "string") return task;
return "(Could not extract response text)";
// No text found in any source. Return "" so callers render a blank
// bubble rather than an error chip. This handles:
// - parts: [] (empty array, no text parts)
// - artifacts: [] (no artifacts at all)
// - status: {} (status present but no message)
// - status.message=null (null guard)
// - {} (entirely empty task)
return "";
} catch {
return "(Failed to parse response)";
}
+1 -1
View File
@@ -30,7 +30,7 @@ export function createMessage(
id: crypto.randomUUID(),
role,
content,
attachments: attachments && attachments.length > 0 ? attachments : undefined,
...(attachments && attachments.length > 0 ? { attachments } : {}),
timestamp: new Date().toISOString(),
};
}
@@ -65,13 +65,17 @@ export function TestConnectionButton({
return (
<div className="test-connection">
{state === 'testing' && (
<span aria-hidden="true" className="test-connection__spinner">
<Spinner />
</span>
)}
<button
type="button"
onClick={handleTest}
disabled={state === 'testing' || !secretValue}
className={`test-connection__btn test-connection__btn--${state}`}
>
{state === 'testing' && <Spinner />}
{LABELS[state]}
</button>
{errorDetail && state === 'failure' && (
@@ -83,9 +87,9 @@ export function TestConnectionButton({
);
}
function Spinner() {
function Spinner({ ariaHidden = true }: { ariaHidden?: boolean }) {
return (
<svg className="spinner" width="14" height="14" viewBox="0 0 24 24" fill="none" stroke="currentColor" strokeWidth="2">
<svg className="spinner" width="14" height="14" viewBox="0 0 24 24" fill="none" stroke="currentColor" strokeWidth="2" aria-hidden={ariaHidden}>
<path d="M12 2v4M12 18v4M4.93 4.93l2.83 2.83M16.24 16.24l2.83 2.83M2 12h4M18 12h4M4.93 19.07l2.83-2.83M16.24 7.76l2.83-2.83" />
</svg>
);
+213
View File
@@ -0,0 +1,213 @@
// @vitest-environment jsdom
/**
* Tests for canvas/src/lib/hydrate.ts — exponential-backoff canvas store hydration.
*
* 7 cases:
* 1. Success on first attempt → { error: null }
* 2. Viewport fetch fails (non-fatal) → store still hydrates, returns { error: null }
* 3. Success after 1 retry → onRetrying(1) called once, final result { error: null }
* 4. Success after 2 retries → onRetrying called for each failed attempt
* 5. All attempts fail → returns the error message after MAX_RETRIES
* 6. onRetrying called with correct attempt number on each retry
* 7. Exponential backoff delays: 1s, 2s, 4s for attempts 1, 2, 3
*/
import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
import { api } from "@/lib/api";
import { useCanvasStore } from "@/store/canvas";
import { hydrateCanvas, MAX_RETRIES } from "../hydrate";
// ─── Mock api ──────────────────────────────────────────────────────────────────
// PLATFORM_URL must be a named export — hydrate.ts imports it directly, not via api.
vi.mock("@/lib/api", () => ({
api: {
get: vi.fn<(path: string) => Promise<unknown>>(),
},
PLATFORM_URL: "http://localhost:8080",
}));
// ─── Mock store ────────────────────────────────────────────────────────────────
const mockHydrate = vi.fn();
const mockSetViewport = vi.fn();
vi.mock("@/store/canvas", () => ({
useCanvasStore: {
getState: () => ({
hydrate: mockHydrate,
setViewport: mockSetViewport,
}),
},
}));
// ─── Helpers ───────────────────────────────────────────────────────────────────
const mockApiGet = vi.mocked(api.get);
function makeWorkspace(id = "ws-1") {
return {
id,
name: "Test WS",
role: "assistant",
tier: 1,
status: "online" as const,
agent_card: null,
url: "http://localhost:9000",
parent_id: null,
active_tasks: 0,
last_error_rate: 0,
last_sample_error: "",
uptime_seconds: 60,
current_task: "",
x: 0,
y: 0,
collapsed: false,
runtime: "",
budget_limit: null,
};
}
// ─── Setup / teardown ──────────────────────────────────────────────────────────
beforeEach(() => {
vi.clearAllMocks();
vi.useFakeTimers();
});
afterEach(() => {
vi.useRealTimers();
});
// ─── Tests ─────────────────────────────────────────────────────────────────────
describe("hydrateCanvas — success paths", () => {
it("returns { error: null } on first-attempt success", async () => {
mockApiGet
.mockResolvedValueOnce([makeWorkspace()]) // /workspaces
.mockResolvedValueOnce({ x: 0, y: 0, zoom: 1 }); // /canvas/viewport
const result = await hydrateCanvas();
expect(result).toEqual({ error: null });
expect(mockHydrate).toHaveBeenCalledOnce();
expect(mockSetViewport).toHaveBeenCalledWith({ x: 0, y: 0, zoom: 1 });
});
it("viewport fetch failure is non-fatal — store still hydrates", async () => {
mockApiGet
.mockResolvedValueOnce([makeWorkspace()]) // /workspaces OK
.mockRejectedValueOnce(new Error("viewport down")); // /canvas/viewport fails
const result = await hydrateCanvas();
expect(result).toEqual({ error: null });
expect(mockHydrate).toHaveBeenCalledOnce();
expect(mockSetViewport).not.toHaveBeenCalled();
});
it("returns { error: null } after 1 retry", async () => {
const onRetrying = vi.fn();
// Each attempt makes 2 parallel api.get calls (workspaces + viewport).
// Attempt 1 (fails): /workspaces → rejected, /viewport → resolved
// Attempt 2 (succeeds): /workspaces → resolved, /viewport → resolved
mockApiGet
.mockRejectedValueOnce(new Error("network down")) // attempt 1: /workspaces
.mockResolvedValueOnce({ x: 0, y: 0, zoom: 1 }) // attempt 1: /viewport
.mockResolvedValueOnce([makeWorkspace()]) // attempt 2: /workspaces
.mockResolvedValueOnce({ x: 0, y: 0, zoom: 1 }); // attempt 2: /viewport
const promise = hydrateCanvas(onRetrying);
// Advance past the first backoff delay (1000 * 2^0 = 1000 ms)
await vi.advanceTimersByTimeAsync(1000);
await vi.runAllTimersAsync();
const result = await promise;
expect(result).toEqual({ error: null });
expect(onRetrying).toHaveBeenCalledTimes(1);
expect(onRetrying).toHaveBeenCalledWith(1);
});
it("onRetrying called once per failed attempt before next retry", async () => {
const onRetrying = vi.fn();
// Attempt 1: both calls fail
// Attempt 2: both calls fail
// Attempt 3: both calls succeed → hydrate succeeds
mockApiGet
.mockRejectedValueOnce(new Error("attempt 1")) // a1: /workspaces
.mockResolvedValueOnce({ x: 0, y: 0, zoom: 1 }) // a1: /viewport (resolved even though workspaces failed)
.mockRejectedValueOnce(new Error("attempt 2")) // a2: /workspaces
.mockResolvedValueOnce({ x: 0, y: 0, zoom: 1 }) // a2: /viewport
.mockResolvedValueOnce([makeWorkspace()]) // a3: /workspaces
.mockResolvedValueOnce({ x: 0, y: 0, zoom: 1 }); // a3: /viewport
const promise = hydrateCanvas(onRetrying);
await vi.runAllTimersAsync();
const result = await promise;
expect(result).toEqual({ error: null });
expect(onRetrying).toHaveBeenCalledTimes(2);
expect(onRetrying).toHaveBeenNthCalledWith(1, 1);
expect(onRetrying).toHaveBeenNthCalledWith(2, 2);
});
});
describe("hydrateCanvas — failure paths", () => {
it("returns error message after all MAX_RETRIES attempts exhausted", async () => {
for (let i = 0; i < MAX_RETRIES; i++) {
mockApiGet.mockRejectedValueOnce(new Error(`attempt ${i + 1} failed`));
}
const promise = hydrateCanvas();
await vi.runAllTimersAsync();
const result = await promise;
expect(result.error).not.toBeNull();
expect(result.error).toContain("Unable to connect to platform");
expect(mockHydrate).not.toHaveBeenCalled();
});
it("onRetrying called MAX_RETRIES-1 times before final exhausted attempt", async () => {
const onRetrying = vi.fn();
for (let i = 0; i < MAX_RETRIES; i++) {
mockApiGet.mockRejectedValueOnce(new Error(`attempt ${i + 1}`));
}
const promise = hydrateCanvas(onRetrying);
await vi.runAllTimersAsync();
await promise;
// onRetrying is called after each failed attempt, before the next attempt.
// With MAX_RETRIES=3: called after attempt 1 (→2) and after attempt 2 (→3).
expect(onRetrying).toHaveBeenCalledTimes(MAX_RETRIES - 1);
});
});
describe("hydrateCanvas — exponential backoff timing", () => {
it("total elapsed time equals sum of exponential delays 1s + 2s + 4s", async () => {
const onRetrying = vi.fn();
for (let i = 0; i < MAX_RETRIES; i++) {
mockApiGet.mockRejectedValueOnce(new Error(`attempt ${i + 1}`));
}
const start = Date.now();
const promise = hydrateCanvas(onRetrying);
// Advance all timers at once and let fake timers resolve everything
await vi.runAllTimersAsync();
await promise;
const elapsed = Date.now() - start;
// Total expected: 1000 (delay1) + 2000 (delay2) = 3000 ms
// (no delay after the final attempt 3 — function returns immediately)
expect(elapsed).toBeGreaterThanOrEqual(2999);
expect(elapsed).toBeLessThan(5000); // sanity cap
expect(onRetrying).toHaveBeenCalledTimes(MAX_RETRIES - 1);
});
});
@@ -94,9 +94,10 @@ describe("sortParentsBeforeChildren", () => {
{ id: "orphan", parentId: "ghost" },
{ id: "root", parentId: undefined },
];
// Missing parent is skipped; orphan placed after root
// Missing parent is skipped; orphan keeps its input order
// (ghost doesn't exist → orphan is treated as a root in output order)
const result = sortParentsBeforeChildren(nodes);
expect(result.map((n) => n.id)).toEqual(["root", "orphan"]);
expect(result.map((n) => n.id)).toEqual(["orphan", "root"]);
});
});
+19 -1
View File
@@ -492,6 +492,12 @@ done
# probes docker.Ping + container exec; we still expect ok=true there
# since local-docker is the alternative production path.
log "7b/11 Canvas-terminal EIC diagnose probe..."
# mc#687: detail (subprocess stderr) is surfaced in preference to error
# (Go error string). The subprocess stderr contains the actionable signal —
# e.g. "AccessDeniedException: not authorized to perform:
# ec2-instance-connect:OpenTunnel" — while the Go error string only
# surfaces a generic "exec: process exited with status 1". Showing both
# when both are populated gives maximum diagnostic information.
for wid in $WS_TO_CHECK; do
DIAG_JSON=$(tenant_call GET "/workspaces/$wid/terminal/diagnose" 2>/dev/null || echo '{}')
DIAG_OK=$(echo "$DIAG_JSON" | python3 -c "import json,sys; d=json.load(sys.stdin); print('true' if d.get('ok') else 'false')" 2>/dev/null || echo "false")
@@ -499,7 +505,19 @@ for wid in $WS_TO_CHECK; do
ok " $wid terminal-reachable (canvas terminal will work)"
else
DIAG_FAIL=$(echo "$DIAG_JSON" | python3 -c "import json,sys; d=json.load(sys.stdin); print(d.get('first_failure','unknown'))" 2>/dev/null || echo "unknown")
DIAG_DETAIL=$(echo "$DIAG_JSON" | python3 -c "import json,sys; d=json.load(sys.stdin); s=[x for x in d.get('steps',[]) if not x.get('ok')]; print(s[0].get('error','') if s else '')" 2>/dev/null || echo "")
DIAG_DETAIL=$(echo "$DIAG_JSON" | python3 -c "
import json,sys
d=json.load(sys.stdin)
steps=[x for x in d.get('steps',[]) if not x.get('ok')]
if not steps: sys.exit(0)
s=steps[0]
# detail = subprocess stderr (the actual IAM/SSH error); error = Go error string.
detail=s.get('detail','')
error=s.get('error','')
if detail and error: print(detail+' ('+error+')')
elif detail: print(detail)
elif error: print(error)
" 2>/dev/null || echo "")
fail "Workspace $wid terminal diagnose failed at step '$DIAG_FAIL': $DIAG_DETAIL — check tenant SG has tcp/22 from EIC endpoint SG (sg-0785d5c6138220523), EIC_ENDPOINT_SG_ID set in Railway, and EIC endpoint health"
fi
done
@@ -76,6 +76,7 @@ func TestBuildBundleConfigFiles_Skills(t *testing.T) {
},
},
}
files := buildBundleConfigFiles(b)
// 2 skills × 1 file each = 2 files
if n := len(files); n != 2 {
t.Fatalf("skills: want 2 files, got %d", n)
@@ -80,6 +80,54 @@ func TestExtractIdempotencyKey_emptyOnMissing(t *testing.T) {
}
}
// ──────────────────────────────────────────────────────────────────────────────
// extractExpiresInSeconds
// ──────────────────────────────────────────────────────────────────────────────
func TestExtractExpiresInSeconds_valid(t *testing.T) {
cases := []struct {
name string
body string
want int
}{
{"positive int", `{"params":{"expires_in_seconds":30}}`, 30},
{"zero", `{"params":{"expires_in_seconds":0}}`, 0},
{"large TTL", `{"params":{"expires_in_seconds":3600}}`, 3600},
{"nested message — not affected", `{"params":{"message":{"role":"user"},"expires_in_seconds":60}}`, 60},
}
for _, tc := range cases {
t.Run(tc.name, func(t *testing.T) {
if got := extractExpiresInSeconds([]byte(tc.body)); got != tc.want {
t.Errorf("extractExpiresInSeconds = %d, want %d", got, tc.want)
}
})
}
}
func TestExtractExpiresInSeconds_invalidOrMissing(t *testing.T) {
cases := []struct {
name string
body string
want int
}{
{"negative → 0", `{"params":{"expires_in_seconds":-5}}`, 0},
{"missing expires_in_seconds", `{"params":{"message":{"role":"user"}}}`, 0},
{"no params at all", `{"method":"message/send"}`, 0},
{"malformed JSON", `not json`, 0},
{"empty body", ``, 0},
{"null value", `{"params":{"expires_in_seconds":null}}`, 0},
{"string value", `{"params":{"expires_in_seconds":"30"}}`, 0},
{"float value", `{"params":{"expires_in_seconds":30.5}}`, 0},
}
for _, tc := range cases {
t.Run(tc.name, func(t *testing.T) {
if got := extractExpiresInSeconds([]byte(tc.body)); got != tc.want {
t.Errorf("extractExpiresInSeconds(%q) = %d, want %d", tc.body, got, tc.want)
}
})
}
}
func TestExtractDelegationIDFromBody(t *testing.T) {
cases := []struct {
name string
@@ -0,0 +1,224 @@
package handlers
import (
"encoding/json"
"testing"
"github.com/stretchr/testify/assert"
)
// extractResponseText tests — walks A2A JSON-RPC response bodies and
// returns the first text part, falling back to raw body on parse failures.
func TestExtractResponseText_PartsWithTextKind(t *testing.T) {
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{
map[string]interface{}{"kind": "text", "text": "hello world"},
map[string]interface{}{"kind": "text", "text": "second part"},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "hello world", extractResponseText(body))
}
func TestExtractResponseText_PartNotTextKind(t *testing.T) {
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{
map[string]interface{}{"kind": "image", "data": "base64..."},
map[string]interface{}{"kind": "text", "text": "visible"},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "visible", extractResponseText(body))
}
func TestExtractResponseText_PartsEmpty(t *testing.T) {
// Empty parts array — falls through to artifacts, then raw body
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{},
"artifacts": []interface{}{},
},
}
body, _ := json.Marshal(resp)
// Falls through to raw body (which is the JSON string)
result := extractResponseText(body)
assert.NotEmpty(t, result)
}
func TestExtractResponseText_ArtifactPartsWithText(t *testing.T) {
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{},
"artifacts": []interface{}{
map[string]interface{}{
"kind": "file",
"parts": []interface{}{
map[string]interface{}{"kind": "text", "text": "artifact text"},
},
},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "artifact text", extractResponseText(body))
}
func TestExtractResponseText_ArtifactPartNotTextKind(t *testing.T) {
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{},
"artifacts": []interface{}{
map[string]interface{}{
"kind": "code",
"parts": []interface{}{
map[string]interface{}{"kind": "image", "data": "..."},
map[string]interface{}{"kind": "text", "text": "code comment"},
},
},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "code comment", extractResponseText(body))
}
func TestExtractResponseText_ArtifactsEmpty(t *testing.T) {
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{},
"artifacts": []interface{}{},
},
}
body, _ := json.Marshal(resp)
result := extractResponseText(body)
// Falls back to raw body
assert.Equal(t, string(body), result)
}
func TestExtractResponseText_NoResult(t *testing.T) {
// No "result" key at all — falls back to raw body
body := []byte(`{"error": {"code": -32600, "message": "Invalid Request"}}`)
result := extractResponseText(body)
assert.Equal(t, string(body), result)
}
func TestExtractResponseText_ResultNotMap(t *testing.T) {
// result is a string, not a map — falls back to raw body
body := []byte(`{"result": "just a string"}`)
result := extractResponseText(body)
assert.Equal(t, string(body), result)
}
func TestExtractResponseText_NonJSONBody(t *testing.T) {
// Non-JSON bytes — returns the raw string
body := []byte("plain text response, not JSON at all")
result := extractResponseText(body)
assert.Equal(t, "plain text response, not JSON at all", result)
}
func TestExtractResponseText_PartWithNilText(t *testing.T) {
// Text field is nil — kind is "text" but text is nil, should skip
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{
map[string]interface{}{"kind": "text", "text": nil},
map[string]interface{}{"kind": "text", "text": "found"},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "found", extractResponseText(body))
}
func TestExtractResponseText_ArtifactPartWithNilText(t *testing.T) {
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{},
"artifacts": []interface{}{
map[string]interface{}{
"parts": []interface{}{
map[string]interface{}{"kind": "text", "text": nil},
map[string]interface{}{"kind": "text", "text": "artifact-found"},
},
},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "artifact-found", extractResponseText(body))
}
func TestExtractResponseText_PartsWithNonMapElement(t *testing.T) {
// parts contains a non-map element — should be skipped gracefully
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{
"not a map",
123,
nil,
map[string]interface{}{"kind": "text", "text": "parsed"},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "parsed", extractResponseText(body))
}
func TestExtractResponseText_ArtifactWithNonMapElement(t *testing.T) {
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{},
"artifacts": []interface{}{
"not a map",
nil,
map[string]interface{}{
"parts": []interface{}{
"not a map",
map[string]interface{}{"kind": "text", "text": "safe"},
},
},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "safe", extractResponseText(body))
}
func TestExtractResponseText_PartKindNotString(t *testing.T) {
// kind is an integer, not a string — should be skipped
resp := map[string]interface{}{
"result": map[string]interface{}{
"parts": []interface{}{
map[string]interface{}{"kind": 123, "text": "ignored"},
map[string]interface{}{"kind": "text", "text": "found"},
},
},
}
body, _ := json.Marshal(resp)
assert.Equal(t, "found", extractResponseText(body))
}
func TestExtractResponseText_EmptyResponse(t *testing.T) {
body := []byte("{}")
result := extractResponseText(body)
// Falls back to raw "{}"
assert.Equal(t, "{}", result)
}
func TestExtractResponseText_NilBody(t *testing.T) {
// nil byte slice — string(nil) = ""
result := extractResponseText(nil)
assert.Equal(t, "", result)
}
func TestExtractResponseText_WhitespaceBody(t *testing.T) {
body := []byte(" \n\t ")
result := extractResponseText(body)
// Unmarshals to empty map, no result, returns raw string
assert.Equal(t, " \n\t ", result)
}
@@ -1287,3 +1287,80 @@ func TestExecuteDelegation_CleanProxyResponse_Unchanged(t *testing.T) {
t.Errorf("unmet sqlmock expectations: %v", err)
}
}
// ---------- extractResponseText ----------
func TestExtractResponseText_NonJSON(t *testing.T) {
got := extractResponseText([]byte("not json at all"))
if got != "not json at all" {
t.Errorf("non-JSON: got %q, want %q", got, "not json at all")
}
}
func TestExtractResponseText_ValidJSONNoResult(t *testing.T) {
got := extractResponseText([]byte(`{"id":"1","error":{"code":-32601,"message":"method not found"}}`))
if got != `{"id":"1","error":{"code":-32601,"message":"method not found"}}` {
t.Errorf("no result key: got %q, want raw body", got)
}
}
// TestExtractResponseText_* cases live in delegation_extract_response_text_test.go
// to keep pure-helper tests in their own file.
func TestExtractResponseText_PartsTextKind(t *testing.T) {
body := []byte(`{"result":{"parts":[{"kind":"text","text":"Hello from agent"}]}}`)
got := extractResponseText(body)
if got != "Hello from agent" {
t.Errorf("parts text: got %q, want %q", got, "Hello from agent")
}
}
func TestExtractResponseText_PartsNonTextKind(t *testing.T) {
// kind="image" is skipped; falls through to raw body since no artifacts
body := []byte(`{"result":{"parts":[{"kind":"image","text":"should not return"}]}}`)
got := extractResponseText(body)
if got != string(body) {
t.Errorf("parts non-text: got %q, want raw body", got)
}
}
func TestExtractResponseText_PartsMultipleWithTextFirst(t *testing.T) {
body := []byte(`{"result":{"parts":[{"kind":"text","text":"first"},{"kind":"text","text":"second"}]}}`)
got := extractResponseText(body)
// Returns first text part found
if got != "first" {
t.Errorf("parts first match: got %q, want %q", got, "first")
}
}
func TestExtractResponseText_ArtifactsTextKind(t *testing.T) {
body := []byte(`{"result":{"artifacts":[{"parts":[{"kind":"text","text":"artifact text here"}]}]}}`)
got := extractResponseText(body)
if got != "artifact text here" {
t.Errorf("artifacts text: got %q, want %q", got, "artifact text here")
}
}
func TestExtractResponseText_ArtifactsNonTextKind(t *testing.T) {
body := []byte(`{"result":{"artifacts":[{"parts":[{"kind":"image","text":"hidden"}]}]}}`)
got := extractResponseText(body)
if got != string(body) {
t.Errorf("artifacts non-text: got %q, want raw body", got)
}
}
func TestExtractResponseText_EmptyPartsAndArtifacts(t *testing.T) {
body := []byte(`{"result":{"parts":[],"artifacts":[]}}`)
got := extractResponseText(body)
if got != string(body) {
t.Errorf("empty parts/artifacts: got %q, want raw body", got)
}
}
func TestExtractResponseText_EmptyText(t *testing.T) {
body := []byte(`{"result":{"parts":[{"kind":"text","text":""}]}}`)
got := extractResponseText(body)
if got != "" {
t.Errorf("empty text: got %q, want %q", got, "")
}
}
@@ -292,8 +292,12 @@ func filterPeersByQuery(peers []map[string]interface{}, q string) []map[string]i
needle := strings.ToLower(q)
out := make([]map[string]interface{}, 0, len(peers))
for _, p := range peers {
name := p["name"].(string)
role := p["role"].(string)
// Comma-ok idiom: nil map values return (nil, false), protecting
// against type-assertion panics when queryPeerMaps explicitly sets
// role=nil for empty-string roles (discovery.go:340). Also guards
// against nil name if the DB returns NULL.
name, _ := p["name"].(string)
role, _ := p["role"].(string)
if strings.Contains(strings.ToLower(name), needle) ||
strings.Contains(strings.ToLower(role), needle) {
out = append(out, p)
@@ -0,0 +1,160 @@
package handlers
import (
"testing"
)
// filterPeersByQuery tests — nil-safe role/name filtering for peer discovery.
func TestFilterPeersByQuery_EmptyQueryNoOp(t *testing.T) {
peers := []map[string]interface{}{
{"name": "foo", "role": "bar"},
{"name": "baz", "role": "qux"},
}
result := filterPeersByQuery(peers, "")
if len(result) != 2 {
t.Errorf("empty query: expected 2, got %d", len(result))
}
}
func TestFilterPeersByQuery_WhitespaceQueryNoOp(t *testing.T) {
peers := []map[string]interface{}{
{"name": "foo", "role": "bar"},
}
result := filterPeersByQuery(peers, " ")
if len(result) != 1 {
t.Errorf("whitespace-only query: expected 1, got %d", len(result))
}
}
func TestFilterPeersByQuery_MatchName(t *testing.T) {
peers := []map[string]interface{}{
{"name": "backend-agent", "role": "sre"},
{"name": "frontend-agent", "role": "ui"},
}
result := filterPeersByQuery(peers, "backend")
if len(result) != 1 || result[0]["name"] != "backend-agent" {
t.Errorf("expected backend-agent, got %v", result)
}
}
func TestFilterPeersByQuery_MatchRole(t *testing.T) {
peers := []map[string]interface{}{
{"name": "agent-alpha", "role": "security engineer"},
{"name": "agent-beta", "role": "devops"},
}
result := filterPeersByQuery(peers, "engineer")
if len(result) != 1 || result[0]["name"] != "agent-alpha" {
t.Errorf("expected agent-alpha, got %v", result)
}
}
func TestFilterPeersByQuery_CaseInsensitive(t *testing.T) {
peers := []map[string]interface{}{
{"name": "AgentX", "role": "SRE"},
}
result := filterPeersByQuery(peers, "AGENTx")
if len(result) != 1 {
t.Errorf("expected 1 match (case-insensitive), got %d", len(result))
}
}
func TestFilterPeersByQuery_NilRoleNoPanic(t *testing.T) {
// This is the regression case for #730: queryPeerMaps explicitly sets
// peer["role"] = nil when the DB role is empty string. Before the fix,
// p["role"].(string) panics on nil. After the fix, it returns "" and
// no match occurs — which is the correct behaviour.
defer func() {
if r := recover(); r != nil {
t.Errorf("filterPeersByQuery panicked on nil role: %v", r)
}
}()
peers := []map[string]interface{}{
{"name": "some-agent", "role": nil},
}
result := filterPeersByQuery(peers, "some-agent")
if len(result) != 1 {
t.Errorf("expected 1 match by name, got %d", len(result))
}
}
func TestFilterPeersByQuery_NilRoleQueryNoMatch(t *testing.T) {
// When role is nil and query does not match name, nothing matches.
defer func() {
if r := recover(); r != nil {
t.Errorf("filterPeersByQuery panicked on nil role: %v", r)
}
}()
peers := []map[string]interface{}{
{"name": "agent-alpha", "role": nil},
}
result := filterPeersByQuery(peers, "no-match")
if len(result) != 0 {
t.Errorf("expected 0 matches, got %d", len(result))
}
}
func TestFilterPeersByQuery_NilNameNoPanic(t *testing.T) {
// Defensive check: name could also theoretically be nil.
defer func() {
if r := recover(); r != nil {
t.Errorf("filterPeersByQuery panicked on nil name: %v", r)
}
}()
peers := []map[string]interface{}{
{"name": nil, "role": "sre"},
}
result := filterPeersByQuery(peers, "sre")
if len(result) != 1 {
t.Errorf("expected 1 match by role, got %d", len(result))
}
}
func TestFilterPeersByQuery_BothNilNoPanic(t *testing.T) {
defer func() {
if r := recover(); r != nil {
t.Errorf("filterPeersByQuery panicked on nil name+role: %v", r)
}
}()
peers := []map[string]interface{}{
{"name": nil, "role": nil},
}
result := filterPeersByQuery(peers, "")
if len(result) != 1 {
t.Errorf("empty query with nil name/role: expected 1, got %d", len(result))
}
result = filterPeersByQuery(peers, "anything")
if len(result) != 0 {
t.Errorf("non-empty query with nil name/role: expected 0, got %d", len(result))
}
}
func TestFilterPeersByQuery_NoMatches(t *testing.T) {
peers := []map[string]interface{}{
{"name": "alpha", "role": "beta"},
{"name": "gamma", "role": "delta"},
}
result := filterPeersByQuery(peers, "zzz")
if len(result) != 0 {
t.Errorf("expected 0, got %d", len(result))
}
}
func TestFilterPeersByQuery_EmptyPeers(t *testing.T) {
result := filterPeersByQuery([]map[string]interface{}{}, "query")
if len(result) != 0 {
t.Errorf("empty peers: expected 0, got %d", len(result))
}
}
func TestFilterPeersByQuery_MultipleMatches(t *testing.T) {
peers := []map[string]interface{}{
{"name": "backend-alpha", "role": "eng"},
{"name": "backend-beta", "role": "eng"},
{"name": "frontend", "role": "ui"},
}
result := filterPeersByQuery(peers, "backend")
if len(result) != 2 {
t.Errorf("expected 2 backend matches, got %d", len(result))
}
}
+40 -1
View File
@@ -548,10 +548,28 @@ func TestMCPHandler_CommitMemory_CleanContent_PassesThrough(t *testing.T) {
// tools/call — recall_memory
// ─────────────────────────────────────────────────────────────────────────────
// TestMCPHandler_RecallMemory_GlobalScope_Blocked verifies C3 enforcement:
// GLOBAL scope is blocked on the MCP bridge. Sibling of
// TestMCPHandler_CommitMemory_GlobalScope_Blocked (#681 — mirrors PR#680's
// OFFSEC-001 contract hardening from the commit-memory path).
//
// Canary tokens are included in the arguments so a future OFFSEC-001 regression
// (err.Error() leaking into the JSON-RPC message) would be caught by the
// defence-in-depth strings.Contains guard even if the exact-message assertion
// were deleted. Per feedback_branch_count_before_approving the recall path
// must be verified independently since it flows through a different tool
// implementation (toolRecallMemory vs toolCommitMemory).
func TestMCPHandler_RecallMemory_GlobalScope_Blocked(t *testing.T) {
h, mock := newMCPHandler(t)
// No DB expectations — handler must abort before touching the DB.
// Canary tokens: truly arbitrary strings that could NOT appear in
// the error message naturally. If OFFSEC-001 regresses and the raw
// err.Error() is returned, these will appear verbatim in the response.
// Tokens chosen to not overlap with the actual error message text
// ("GLOBAL", "scope", "permitted", etc.) — which WOULD appear even
// when the scrub is correct, making them useless as sentinels.
const canary = "xK8mPqRwT zN7vLsJhYw"
w := mcpPost(t, h, "ws-1", map[string]interface{}{
"jsonrpc": "2.0",
"id": 11,
@@ -559,7 +577,7 @@ func TestMCPHandler_RecallMemory_GlobalScope_Blocked(t *testing.T) {
"params": map[string]interface{}{
"name": "recall_memory",
"arguments": map[string]interface{}{
"query": "secret",
"query": canary,
"scope": "GLOBAL",
},
},
@@ -570,6 +588,27 @@ func TestMCPHandler_RecallMemory_GlobalScope_Blocked(t *testing.T) {
if resp.Error == nil {
t.Error("expected JSON-RPC error for GLOBAL scope recall, got nil")
}
// Exact-equality assertions: code == -32000 AND the constant message.
// The message must be the constant defined in toolRecallMemory, not the
// raw err.Error() value — OFFSEC-001 (#259) requires this so callers
// (including agent runtimes) cannot learn server-side details.
wantMsg := "GLOBAL scope is not permitted via the MCP bridge — use LOCAL, TEAM, or empty"
if resp.Error != nil {
if resp.Error.Code != -32000 {
t.Errorf("error code should be -32000, got %d", resp.Error.Code)
}
if resp.Error.Message != wantMsg {
t.Errorf("error message should be constant %q, got %q", wantMsg, resp.Error.Message)
}
// Defence-in-depth: canary tokens must never appear in the response.
// A future regression where err.Error() is assigned directly would
// expose these arbitrary strings verbatim in the JSON-RPC body.
for _, token := range strings.Fields(canary) {
if strings.Contains(resp.Error.Message, token) {
t.Errorf("error message should not contain canary token %q (OFFSEC-001 leak)", token)
}
}
}
if err := mock.ExpectationsWereMet(); err != nil {
t.Errorf("unexpected DB calls on GLOBAL scope block: %v", err)
}
@@ -346,7 +346,7 @@ func (g *gitFetcher) Fetch(ctx context.Context, rootDir, host, repoPath, ref str
// MkdirTemp creates the dir; git clone refuses to clone into a
// non-empty dir. Remove + recreate empty.
os.RemoveAll(tmpDir)
cloneAndConfig := append(gitArgs("clone", "--quiet", "--depth=1", "-b", ref, cloneURL, tmpDir))
cloneAndConfig := gitArgs("clone", "--quiet", "--depth=1", "-b", ref, cloneURL, tmpDir)
cmd := exec.CommandContext(ctx, "git", cloneAndConfig...)
cmd.Env = append(os.Environ(), "GIT_TERMINAL_PROMPT=0")
if out, err := cmd.CombinedOutput(); err != nil {
@@ -0,0 +1,421 @@
package handlers
import (
"testing"
)
// ── isSafeRoleName ────────────────────────────────────────────────────────────
func TestIsSafeRoleName_Valid(t *testing.T) {
cases := []string{
"backend",
"frontend",
"backend-engineer",
"Frontend_Engineer",
"DevOps123",
"sre-team",
"a",
"ABC",
"Role_With_Underscores_And-Numbers123",
}
for _, r := range cases {
t.Run(r, func(t *testing.T) {
if !isSafeRoleName(r) {
t.Errorf("isSafeRoleName(%q): expected true, got false", r)
}
})
}
}
func TestIsSafeRoleName_Invalid(t *testing.T) {
cases := []struct {
name string
role string
}{
{"empty", ""},
{"dot", "."},
{"double dot", ".."},
{"path separator", "backend/engineer"},
{"space", "backend engineer"},
{"special char", "backend@engineer"},
{"at sign", "role@team"},
{"colon", "role:admin"},
{"hash", "role#1"},
{"percent", "role%20"},
{"quote", `role"name`},
{"backslash", `role\name`},
{"tilde", "role~test"},
{"backtick", "`role"},
{"bracket open", "[role]"},
{"bracket close", "role]"},
{"plus", "role+admin"},
{"equals", "role=admin"},
{"caret", "role^admin"},
{"question mark", "role?"},
{"pipe at end", "role|"},
{"greater than", "role>"},
{"asterisk", "role*"},
{"ampersand", "role&"},
{"exclamation at end", "role!"},
}
for _, tc := range cases {
t.Run(tc.name, func(t *testing.T) {
if isSafeRoleName(tc.role) {
t.Errorf("isSafeRoleName(%q): expected false, got true", tc.role)
}
})
}
}
// ── hasUnresolvedVarRef ───────────────────────────────────────────────────────
func TestHasUnresolvedVarRef_NoVars(t *testing.T) {
cases := []string{
"",
"plain text",
"no variables here",
"123 numeric",
"$",
"${}",
"$5",
"$$$$",
}
for _, s := range cases {
t.Run(s, func(t *testing.T) {
if hasUnresolvedVarRef(s, s) {
t.Errorf("hasUnresolvedVarRef(%q, %q): expected false, got true", s, s)
}
})
}
}
func TestHasUnresolvedVarRef_Resolved(t *testing.T) {
// Expansion consumed the var refs (where "consumed" means the output no longer
// contains the original var reference syntax).
cases := []struct {
orig string
expanded string
want bool // true = unresolved (function returns true), false = resolved
}{
// Empty output: function conservatively returns true — it cannot distinguish
// "var was set to empty" from "var was not found and stripped". The test
// documents this design choice; callers who need empty=resolved should
// pre-process the output before calling hasUnresolvedVarRef.
{"${VAR}", "", true},
{"${VAR}", "value", false}, // var replaced
{"$VAR", "value", false}, // bare var replaced
{"prefix${VAR}suffix", "prefixvaluesuffix", false},
{"${A}${B}", "ab", false},
// FOO=FOO and BAR=BAR — both vars found and replaced. Expanded output
// "FOO and BAR" has no ${...} syntax left, so function returns false.
{"${FOO} and ${BAR}", "FOO and BAR", false},
}
for _, tc := range cases {
t.Run(tc.orig, func(t *testing.T) {
got := hasUnresolvedVarRef(tc.orig, tc.expanded)
if got != tc.want {
t.Errorf("hasUnresolvedVarRef(%q, %q): got %v, want %v", tc.orig, tc.expanded, got, tc.want)
}
})
}
}
func TestHasUnresolvedVarRef_Unresolved(t *testing.T) {
// Expansion left the refs intact → unresolved.
cases := []struct {
orig string
expanded string
}{
{"${VAR}", "${VAR}"}, // untouched
{"$VAR", "$VAR"}, // bare untouched
{"prefix${VAR}suffix", "prefix${VAR}suffix"},
{"${A}${B}", "${A}${B}"}, // both unresolved
{"${FOO}", ""}, // empty result with var ref in original
}
for _, tc := range cases {
t.Run(tc.orig, func(t *testing.T) {
if !hasUnresolvedVarRef(tc.orig, tc.expanded) {
t.Errorf("hasUnresolvedVarRef(%q, %q): expected true, got false", tc.orig, tc.expanded)
}
})
}
}
// ── expandWithEnv ─────────────────────────────────────────────────────────────
func TestExpandWithEnv_Basic(t *testing.T) {
env := map[string]string{"FOO": "bar", "BAZ": "qux"}
cases := []struct {
input string
want string
}{
{"", ""},
{"no vars", "no vars"},
{"${FOO}", "bar"},
{"$FOO", "bar"},
{"prefix${FOO}suffix", "prefixbarsuffix"},
{"${FOO}${BAZ}", "barqux"},
{"${MISSING}", ""}, // not in env, not in os env → empty
}
for _, tc := range cases {
t.Run(tc.input, func(t *testing.T) {
got := expandWithEnv(tc.input, env)
if got != tc.want {
t.Errorf("expandWithEnv(%q, %v) = %q, want %q", tc.input, env, got, tc.want)
}
})
}
}
// ── mergeCategoryRouting ─────────────────────────────────────────────────────
func TestMergeCategoryRouting_EmptyInputs(t *testing.T) {
// Both empty → empty
r := mergeCategoryRouting(nil, nil)
if len(r) != 0 {
t.Errorf("mergeCategoryRouting(nil, nil): got %v, want empty", r)
}
r = mergeCategoryRouting(map[string][]string{}, map[string][]string{})
if len(r) != 0 {
t.Errorf("mergeCategoryRouting({}, {}): got %v, want empty", r)
}
}
func TestMergeCategoryRouting_DefaultsOnly(t *testing.T) {
defaults := map[string][]string{
"security": {"Backend Engineer", "DevOps"},
"ui": {"Frontend Engineer"},
"data": {"Data Engineer"},
}
r := mergeCategoryRouting(defaults, nil)
if len(r) != 3 {
t.Errorf("got %d keys, want 3", len(r))
}
if len(r["security"]) != 2 {
t.Errorf("security roles: got %v, want 2", r["security"])
}
}
func TestMergeCategoryRouting_WorkspaceOverrides(t *testing.T) {
defaults := map[string][]string{
"security": {"Backend Engineer", "DevOps"},
"ui": {"Frontend Engineer"},
}
ws := map[string][]string{
"security": {"SRE Team"}, // narrows
"ui": {}, // drops
"infra": {"Platform Team"}, // adds
}
r := mergeCategoryRouting(defaults, ws)
if len(r["security"]) != 1 || r["security"][0] != "SRE Team" {
t.Errorf("security: got %v, want [SRE Team]", r["security"])
}
if _, ok := r["ui"]; ok {
t.Errorf("ui should be dropped, got %v", r["ui"])
}
if len(r["infra"]) != 1 || r["infra"][0] != "Platform Team" {
t.Errorf("infra: got %v, want [Platform Team]", r["infra"])
}
}
func TestMergeCategoryRouting_EmptyListDrops(t *testing.T) {
defaults := map[string][]string{"foo": {"A", "B"}}
ws := map[string][]string{"foo": {}}
r := mergeCategoryRouting(defaults, ws)
if _, ok := r["foo"]; ok {
t.Errorf("foo with empty ws list: should be dropped, got %v", r["foo"])
}
}
func TestMergeCategoryRouting_EmptyKeySkipped(t *testing.T) {
defaults := map[string][]string{"": {"Role"}}
ws := map[string][]string{"": {}}
r := mergeCategoryRouting(defaults, ws)
if _, ok := r[""]; ok {
t.Errorf("empty key should be skipped, got %v", r[""])
}
}
// ── renderCategoryRoutingYAML ────────────────────────────────────────────────
func TestRenderCategoryRoutingYAML_Empty(t *testing.T) {
out, err := renderCategoryRoutingYAML(nil)
if err != nil {
t.Fatalf("unexpected error: %v", err)
}
if out != "" {
t.Errorf("got %q, want empty string", out)
}
out, err = renderCategoryRoutingYAML(map[string][]string{})
if err != nil {
t.Fatalf("unexpected error: %v", err)
}
if out != "" {
t.Errorf("got %q, want empty string", out)
}
}
func TestRenderCategoryRoutingYAML_StableOrdering(t *testing.T) {
// Keys are sorted so output is deterministic regardless of map iteration order.
m := map[string][]string{
"zebra": {"A"},
"alpha": {"B"},
"middle": {"C"},
}
out, err := renderCategoryRoutingYAML(m)
if err != nil {
t.Fatalf("unexpected error: %v", err)
}
// alpha must come before middle, which must come before zebra
ai := 0
zi := 0
mi := 0
for i, c := range out {
switch {
case c == 'a' && i < len(out)-5 && out[i:i+5] == "alpha":
ai = i
case c == 'z' && i < len(out)-5 && out[i:i+5] == "zebra":
zi = i
case c == 'm' && i < len(out)-6 && out[i:i+6] == "middle":
mi = i
}
}
if ai <= 0 || zi <= 0 || mi <= 0 {
t.Fatalf("could not locate all keys in output: %s", out)
}
if !(ai < mi && mi < zi) {
t.Errorf("keys not sorted: alpha=%d middle=%d zebra=%d, output:\n%s", ai, mi, zi, out)
}
}
func TestRenderCategoryRoutingYAML_SpecialCharsEscaped(t *testing.T) {
// YAML library should escape characters that need quoting.
m := map[string][]string{
"key:with:colons": {"Role: Admin"},
"key with space": {"Role"},
}
out, err := renderCategoryRoutingYAML(m)
if err != nil {
t.Fatalf("unexpected error: %v", err)
}
// The output must be valid YAML (yaml.Marshal handles quoting).
// The key with colons should appear quoted in the output.
if out == "" {
t.Error("output is empty")
}
}
// ── appendYAMLBlock ───────────────────────────────────────────────────────────
func TestAppendYAMLBlock_NoExisting(t *testing.T) {
got := appendYAMLBlock(nil, "key: value")
if string(got) != "key: value" {
t.Errorf("got %q, want 'key: value'", string(got))
}
}
func TestAppendYAMLBlock_EmptyBlock(t *testing.T) {
// When existing lacks a trailing \n, the function adds one before appending
// the empty block — so the result always has a clean terminator.
got := appendYAMLBlock([]byte("existing: data"), "")
want := "existing: data\n"
if string(got) != want {
t.Errorf("got %q, want %q", string(got), want)
}
}
func TestAppendYAMLBlock_AppendsWithNewline(t *testing.T) {
existing := []byte("key: value")
block := "new: entry"
got := appendYAMLBlock(existing, block)
want := "key: value\nnew: entry"
if string(got) != want {
t.Errorf("got %q, want %q", string(got), want)
}
}
func TestAppendYAMLBlock_AlreadyEndsWithNewline(t *testing.T) {
existing := []byte("key: value\n")
block := "new: entry"
got := appendYAMLBlock(existing, block)
want := "key: value\nnew: entry"
if string(got) != want {
t.Errorf("got %q, want %q", string(got), want)
}
}
// ── mergePlugins ─────────────────────────────────────────────────────────────
func TestMergePlugins_EmptyInputs(t *testing.T) {
r := mergePlugins(nil, nil)
if len(r) != 0 {
t.Errorf("got %v, want []", r)
}
r = mergePlugins([]string{}, []string{})
if len(r) != 0 {
t.Errorf("got %v, want []", r)
}
}
func TestMergePlugins_BasicMerge(t *testing.T) {
defaults := []string{"plugin-a", "plugin-b"}
ws := []string{"plugin-b", "plugin-c"}
r := mergePlugins(defaults, ws)
// defaults first, ws appended, b deduplicated
if len(r) != 3 {
t.Errorf("got %v, want 3 items", r)
}
if r[0] != "plugin-a" || r[1] != "plugin-b" || r[2] != "plugin-c" {
t.Errorf("got %v, want [a, b, c]", r)
}
}
func TestMergePlugins_ExcludeWithBang(t *testing.T) {
defaults := []string{"plugin-a", "plugin-b", "plugin-c"}
ws := []string{"!plugin-b"}
r := mergePlugins(defaults, ws)
if len(r) != 2 {
t.Errorf("got %v, want 2 items", r)
}
if r[0] != "plugin-a" || r[1] != "plugin-c" {
t.Errorf("got %v, want [a, c]", r)
}
}
func TestMergePlugins_ExcludeWithDash(t *testing.T) {
defaults := []string{"plugin-a", "plugin-b", "plugin-c"}
ws := []string{"-plugin-b"}
r := mergePlugins(defaults, ws)
if len(r) != 2 || r[0] != "plugin-a" || r[1] != "plugin-c" {
t.Errorf("got %v, want [a, c]", r)
}
}
func TestMergePlugins_ExcludeNonexistent(t *testing.T) {
defaults := []string{"plugin-a", "plugin-b"}
ws := []string{"!plugin-c"} // c not present
r := mergePlugins(defaults, ws)
if len(r) != 2 {
t.Errorf("got %v, want 2 items", r)
}
}
func TestMergePlugins_ExcludeEmptyTarget(t *testing.T) {
defaults := []string{"plugin-a", "plugin-b"}
ws := []string{"!"}
r := mergePlugins(defaults, ws)
if len(r) != 2 {
t.Errorf("got %v, want 2 items", r)
}
}
func TestMergePlugins_EmptyPlugin(t *testing.T) {
defaults := []string{"", "plugin-a", ""}
ws := []string{"plugin-b", ""}
r := mergePlugins(defaults, ws)
if len(r) != 2 {
t.Errorf("got %v, want 2 items", r)
}
}
@@ -0,0 +1,191 @@
package handlers
import (
"errors"
"os"
"testing"
"github.com/stretchr/testify/assert"
)
// walkOrgWorkspaceNames tests — recursive collection of non-empty workspace names.
func TestWalkOrgWorkspaceNames_EmptySlice(t *testing.T) {
var names []string
walkOrgWorkspaceNames([]OrgWorkspace{}, &names)
assert.Empty(t, names)
}
func TestWalkOrgWorkspaceNames_SingleNode(t *testing.T) {
var names []string
walkOrgWorkspaceNames([]OrgWorkspace{{Name: "my-workspace"}}, &names)
assert.Equal(t, []string{"my-workspace"}, names)
}
func TestWalkOrgWorkspaceNames_SingleNodeEmptyName(t *testing.T) {
var names []string
walkOrgWorkspaceNames([]OrgWorkspace{{Name: ""}}, &names)
assert.Empty(t, names)
}
func TestWalkOrgWorkspaceNames_NestedChildren(t *testing.T) {
var names []string
tree := []OrgWorkspace{
{
Name: "parent",
Children: []OrgWorkspace{
{Name: "child-a"},
{Name: "child-b"},
},
},
}
walkOrgWorkspaceNames(tree, &names)
assert.Equal(t, []string{"parent", "child-a", "child-b"}, names)
}
func TestWalkOrgWorkspaceNames_DeeplyNested(t *testing.T) {
var names []string
tree := []OrgWorkspace{
{
Name: "level0",
Children: []OrgWorkspace{
{
Name: "level1",
Children: []OrgWorkspace{
{
Name: "level2",
Children: []OrgWorkspace{
{Name: "level3"},
},
},
},
},
},
},
}
walkOrgWorkspaceNames(tree, &names)
assert.Equal(t, []string{"level0", "level1", "level2", "level3"}, names)
}
func TestWalkOrgWorkspaceNames_SkipsEmptyNames(t *testing.T) {
var names []string
tree := []OrgWorkspace{
{Name: "a"},
{Name: ""},
{Name: "b"},
}
walkOrgWorkspaceNames(tree, &names)
assert.Equal(t, []string{"a", "b"}, names)
}
func TestWalkOrgWorkspaceNames_Siblings(t *testing.T) {
var names []string
tree := []OrgWorkspace{
{Name: "team"},
{Name: "alpha"},
{Name: "beta"},
}
walkOrgWorkspaceNames(tree, &names)
assert.Equal(t, []string{"team", "alpha", "beta"}, names)
}
func TestWalkOrgWorkspaceNames_MultipleRoots(t *testing.T) {
var names []string
tree := []OrgWorkspace{
{Name: "root-a", Children: []OrgWorkspace{{Name: "child-a"}}},
{Name: "root-b", Children: []OrgWorkspace{{Name: "child-b"}}},
}
walkOrgWorkspaceNames(tree, &names)
assert.Equal(t, []string{"root-a", "child-a", "root-b", "child-b"}, names)
}
func TestWalkOrgWorkspaceNames_SpawningFalseStillWalks(t *testing.T) {
// The comment in the source is explicit: spawning:false subtrees are
// still walked. Empty names within those subtrees are still skipped.
var names []string
yes := true
no := false
tree := []OrgWorkspace{
{
Name: "parent",
Children: []OrgWorkspace{
{Name: "spawning-child", Spawning: &yes},
{Name: "non-spawning-child", Spawning: &no},
{Name: ""},
},
},
}
walkOrgWorkspaceNames(tree, &names)
assert.Equal(t, []string{"parent", "spawning-child", "non-spawning-child"}, names)
}
// resolveProvisionConcurrency tests — env-var parsing with sensible fallback.
func TestResolveProvisionConcurrency_Default(t *testing.T) {
os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
defer os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
val := resolveProvisionConcurrency()
assert.Equal(t, defaultProvisionConcurrency, val)
}
func TestResolveProvisionConcurrency_ValidPositiveInt(t *testing.T) {
os.Setenv("MOLECULE_PROVISION_CONCURRENCY", "5")
defer os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
val := resolveProvisionConcurrency()
assert.Equal(t, 5, val)
}
func TestResolveProvisionConcurrency_ZeroUnlimited(t *testing.T) {
os.Setenv("MOLECULE_PROVISION_CONCURRENCY", "0")
defer os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
val := resolveProvisionConcurrency()
// Zero is mapped to 1<<20 (unlimited semantics with finite cap)
assert.Equal(t, 1<<20, val)
}
func TestResolveProvisionConcurrency_NegativeFallsBack(t *testing.T) {
os.Setenv("MOLECULE_PROVISION_CONCURRENCY", "-1")
defer os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
val := resolveProvisionConcurrency()
assert.Equal(t, defaultProvisionConcurrency, val)
}
func TestResolveProvisionConcurrency_NonIntegerFallsBack(t *testing.T) {
os.Setenv("MOLECULE_PROVISION_CONCURRENCY", "not-a-number")
defer os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
val := resolveProvisionConcurrency()
assert.Equal(t, defaultProvisionConcurrency, val)
}
func TestResolveProvisionConcurrency_WhitespaceOnly(t *testing.T) {
os.Setenv("MOLECULE_PROVISION_CONCURRENCY", " ")
defer os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
val := resolveProvisionConcurrency()
assert.Equal(t, defaultProvisionConcurrency, val)
}
func TestResolveProvisionConcurrency_LargeValue(t *testing.T) {
os.Setenv("MOLECULE_PROVISION_CONCURRENCY", "10000")
defer os.Unsetenv("MOLECULE_PROVISION_CONCURRENCY")
val := resolveProvisionConcurrency()
assert.Equal(t, 10000, val)
}
// errString tests — nil-safe error-to-string wrapper.
func TestErrString_NilError(t *testing.T) {
result := errString(nil)
assert.Equal(t, "", result)
}
func TestErrString_WithError(t *testing.T) {
err := errors.New("something went wrong")
result := errString(err)
assert.Equal(t, "something went wrong", result)
}
func TestErrString_EmptyError(t *testing.T) {
err := errors.New("")
result := errString(err)
assert.Equal(t, "", result)
}
@@ -487,11 +487,13 @@ func (h *OrgHandler) createWorkspaceTree(ws OrgWorkspace, parentID *string, absX
// timeout (caught 2026-05-08 right after dev-only org/import).
loadPersonaEnvFile(ws.FilesDir, envVars)
if orgBaseDir != "" {
// 1. Org root .env (shared defaults)
parseEnvFile(filepath.Join(orgBaseDir, ".env"), envVars)
// 2. Workspace-specific .env (overrides)
if ws.FilesDir != "" {
parseEnvFile(filepath.Join(orgBaseDir, ws.FilesDir, ".env"), envVars)
// Load org root and workspace-specific .env files. loadWorkspaceEnv
// applies resolveInsideRoot to ws.FilesDir, closing the CWE-22 /
// mc#786 path-traversal regression introduced when the guard was
// dropped from createWorkspaceTree.
workspaceEnv := loadWorkspaceEnv(orgBaseDir, ws.FilesDir)
for k, v := range workspaceEnv {
envVars[k] = v // workspace-specific overrides org root
}
}
// Store as workspace secrets via DB (encrypted if key is set, raw otherwise)
@@ -0,0 +1,294 @@
package handlers
import "testing"
// Tests for the pure layout helpers in org.go:
// childSlot, sizeOfSubtree, childSlotInGrid. These compute the canvas
// grid positions for org-import workspace trees and mirror the TypeScript
// layout functions in canvas-topology.ts (defaultChildSlot, parentMinSize,
// childSlotInGrid). The two sides use slightly different default sizes
// (Go: 240×130, TS: 210×120) so they are tested independently.
// childSlot — 2-column fixed-size grid, one row of child cards.
func TestChildSlot_ZeroIndex(t *testing.T) {
x, y := childSlot(0)
// col=0, row=0
// x = 16 + 0*(240+14) = 16
// y = 130 + 0*(130+14) = 130
if x != 16.0 {
t.Errorf("slot 0 x: got %v, want 16.0", x)
}
if y != 130.0 {
t.Errorf("slot 0 y: got %v, want 130.0", y)
}
}
func TestChildSlot_SecondColumn(t *testing.T) {
x, y := childSlot(1)
// col=1, row=0
// x = 16 + 1*(240+14) = 16+254 = 270
// y = 130
if x != 270.0 {
t.Errorf("slot 1 x: got %v, want 270.0", x)
}
if y != 130.0 {
t.Errorf("slot 1 y: got %v, want 130.0", y)
}
}
func TestChildSlot_SecondRow(t *testing.T) {
x, y := childSlot(2)
// col=0, row=1
// x = 16
// y = 130 + 1*(130+14) = 130+144 = 274
if x != 16.0 {
t.Errorf("slot 2 x: got %v, want 16.0", x)
}
if y != 274.0 {
t.Errorf("slot 2 y: got %v, want 274.0", y)
}
}
func TestChildSlot_ThirdRowFirstColumn(t *testing.T) {
x, y := childSlot(4)
// col=0, row=2
// x = 16
// y = 130 + 2*(130+14) = 130+288 = 418
if x != 16.0 {
t.Errorf("slot 4 x: got %v, want 16.0", x)
}
if y != 418.0 {
t.Errorf("slot 4 y: got %v, want 418.0", y)
}
}
// sizeOfSubtree — bounding-box computation for org-import layout.
func TestSizeOfSubtree_Leaf(t *testing.T) {
ws := OrgWorkspace{Name: "leaf"}
s := sizeOfSubtree(ws)
// Leaf → childDefaultWidth × childDefaultHeight
if s.width != 240.0 {
t.Errorf("leaf width: got %v, want 240.0", s.width)
}
if s.height != 130.0 {
t.Errorf("leaf height: got %v, want 130.0", s.height)
}
}
func TestSizeOfSubtree_OneChild(t *testing.T) {
ws := OrgWorkspace{Name: "parent", Children: []OrgWorkspace{{Name: "child"}}}
s := sizeOfSubtree(ws)
// 1 child → cols=1, rows=1
// child subtree = (240, 130)
// width = 16*2 + 240*1 + 14*0 = 272
// height = 130 + 130 + 14*0 + 16 = 276
if s.width != 272.0 {
t.Errorf("1-child width: got %v, want 272.0", s.width)
}
if s.height != 276.0 {
t.Errorf("1-child height: got %v, want 276.0", s.height)
}
}
func TestSizeOfSubtree_TwoChildren(t *testing.T) {
ws := OrgWorkspace{Name: "parent", Children: []OrgWorkspace{
{Name: "c0"}, {Name: "c1"},
}}
s := sizeOfSubtree(ws)
// 2 children → cols=2, rows=1
// maxColW = 240, totalRowH = 130
// width = 16*2 + 240*2 + 14*1 = 32+480+14 = 526
// height = 130 + 130 + 14*0 + 16 = 276
if s.width != 526.0 {
t.Errorf("2-child width: got %v, want 526.0", s.width)
}
if s.height != 276.0 {
t.Errorf("2-child height: got %v, want 276.0", s.height)
}
}
func TestSizeOfSubtree_ThreeChildren(t *testing.T) {
ws := OrgWorkspace{Name: "parent", Children: []OrgWorkspace{
{Name: "c0"}, {Name: "c1"}, {Name: "c2"},
}}
s := sizeOfSubtree(ws)
// 3 children → cols=2 (< 3 so capped at 2), rows=2
// each child = (240, 130), maxColW=240, rowHeights=[130,130]
// totalRowH = 130+130 = 260
// width = 16*2 + 240*2 + 14*1 = 526
// height = 130 + 260 + 14*1 + 16 = 420
if s.width != 526.0 {
t.Errorf("3-child width: got %v, want 526.0", s.width)
}
if s.height != 420.0 {
t.Errorf("3-child height: got %v, want 420.0", s.height)
}
}
func TestSizeOfSubtree_FourChildren(t *testing.T) {
ws := OrgWorkspace{Name: "parent", Children: []OrgWorkspace{
{Name: "c0"}, {Name: "c1"}, {Name: "c2"}, {Name: "c3"},
}}
s := sizeOfSubtree(ws)
// 4 children → cols=2, rows=2
// width = 16*2 + 240*2 + 14*1 = 526
// height = 130 + 260 + 14*1 + 16 = 420
if s.width != 526.0 {
t.Errorf("4-child width: got %v, want 526.0", s.width)
}
if s.height != 420.0 {
t.Errorf("4-child height: got %v, want %v", s.height, 420.0)
}
}
func TestSizeOfSubtree_FiveChildren(t *testing.T) {
ws := OrgWorkspace{Name: "parent", Children: []OrgWorkspace{
{Name: "c0"}, {Name: "c1"}, {Name: "c2"}, {Name: "c3"}, {Name: "c4"},
}}
s := sizeOfSubtree(ws)
// 5 children → cols=2, rows=3
// rowHeights = [130, 130, 130], totalRowH = 390
// width = 16*2 + 240*2 + 14*1 = 526
// height = 130 + 390 + 14*2 + 16 = 564
if s.width != 526.0 {
t.Errorf("5-child width: got %v, want 526.0", s.width)
}
if s.height != 564.0 {
t.Errorf("5-child height: got %v, want 564.0", s.height)
}
}
func TestSizeOfSubtree_NestedTree(t *testing.T) {
// Grandparent → [Parent(→ child), leaf]
// parent subtree (1 child): width=272, height=276
// grandparent:
// children = [parent, leaf]
// maxColW = max(272, 240) = 272
// cols=2, rows=1
// width = 16*2 + 272*2 + 14*1 = 590
// height = 130 + max(276, 130) + 14*0 + 16 = 422
parent := OrgWorkspace{Name: "parent", Children: []OrgWorkspace{{Name: "grandchild"}}}
ws := OrgWorkspace{Name: "grandparent", Children: []OrgWorkspace{parent, {Name: "leaf"}}}
s := sizeOfSubtree(ws)
if s.width != 590.0 {
t.Errorf("nested width: got %v, want 590.0", s.width)
}
if s.height != 422.0 {
t.Errorf("nested height: got %v, want 422.0", s.height)
}
}
// childSlotInGrid — sibling-aware slot computation; taller siblings push
// subsequent rows down without displacing the column grid.
func TestChildSlotInGrid_EmptySiblings(t *testing.T) {
x, y := childSlotInGrid(0, nil)
x2, y2 := childSlotInGrid(0, []nodeSize{})
// Both nil and empty slice return the top-left padded origin.
got1, got2 := struct{ x, y float64 }{x, y}, struct{ x, y float64 }{x2, y2}
for _, g := range []struct{ x, y float64 }{got1, got2} {
if g.x != 16.0 || g.y != 130.0 {
t.Errorf("empty siblings: got (%.0f, %.0f), want (16, 130)", g.x, g.y)
}
}
}
func TestChildSlotInGrid_Slot0MatchesDefaultChildSlot(t *testing.T) {
// With uniform 240×130 siblings, slot 0 should equal childSlot(0).
sizes := []nodeSize{{width: 240, height: 130}, {width: 240, height: 130}}
x, y := childSlotInGrid(0, sizes)
cx, cy := childSlot(0)
if x != cx || y != cy {
t.Errorf("uniform siblings slot 0: got (%.0f, %.0f), want childSlot (%.0f, %.0f)", x, y, cx, cy)
}
}
func TestChildSlotInGrid_Slot1MatchesDefaultChildSlot(t *testing.T) {
sizes := []nodeSize{{width: 240, height: 130}, {width: 240, height: 130}}
x, y := childSlotInGrid(1, sizes)
cx, cy := childSlot(1)
if x != cx || y != cy {
t.Errorf("uniform siblings slot 1: got (%.0f, %.0f), want childSlot (%.0f, %.0f)", x, y, cx, cy)
}
}
func TestChildSlotInGrid_TallerSiblingBumpsNextRow(t *testing.T) {
// Sibling at index 1 is taller (height=300 vs 130).
// Slot 0: col=0, row=0 → x=16, y=130
// Slot 1: col=1, row=0 → x=270, y=130
// Slot 2: col=0, row=1 → x=16, y = 130 + 300 + 14 = 444
sizes := []nodeSize{
{width: 240, height: 130},
{width: 240, height: 300}, // taller — pushes row 2 down
{width: 240, height: 130},
}
x0, y0 := childSlotInGrid(0, sizes)
if x0 != 16.0 || y0 != 130.0 {
t.Errorf("slot 0: got (%.0f, %.0f), want (16, 130)", x0, y0)
}
x1, y1 := childSlotInGrid(1, sizes)
if x1 != 270.0 || y1 != 130.0 {
t.Errorf("slot 1: got (%.0f, %.0f), want (270, 130)", x1, y1)
}
x2, y2 := childSlotInGrid(2, sizes)
// y = parentHeaderPadding + rowHeights[0] + childGutter
// rowHeights[0] = max(130, 300) = 300
// y = 130 + 300 + 14 = 444
if x2 != 16.0 || y2 != 444.0 {
t.Errorf("slot 2: got (%.0f, %.0f), want (16, 444) — taller sibling pushed row down", x2, y2)
}
}
func TestChildSlotInGrid_UniformWideSiblingSetsColumnWidth(t *testing.T) {
// Sibling at index 0 is wider (300 vs 240).
// Slot 0: x=16, y=130
// Slot 1: col=1 → x = 16 + 300 + 14 = 330 (NOT 270 = 16+240+14)
// y=130
sizes := []nodeSize{
{width: 300, height: 130}, // wider — sets column width
{width: 240, height: 130},
}
x1, y1 := childSlotInGrid(1, sizes)
if x1 != 330.0 || y1 != 130.0 {
t.Errorf("slot 1: got (%.0f, %.0f), want (330, 130) — col width set by wider sibling", x1, y1)
}
}
func TestChildSlotInGrid_Slot3OverflowToSecondRow(t *testing.T) {
// 4 siblings in 2-column grid → rows=2
// Slot 0: col=0, row=0
// Slot 1: col=1, row=0
// Slot 2: col=0, row=1
// Slot 3: col=1, row=1
sizes := []nodeSize{
{width: 240, height: 130},
{width: 240, height: 130},
{width: 240, height: 130},
{width: 240, height: 130},
}
x3, y3 := childSlotInGrid(3, sizes)
// y = 130 + 130 + 14 = 274
if x3 != 270.0 || y3 != 274.0 {
t.Errorf("slot 3: got (%.0f, %.0f), want (270, 274)", x3, y3)
}
}
func TestChildSlotInGrid_MixedSizesCorrectRowAccumulation(t *testing.T) {
// 3 siblings: [short(130), tall(300), medium(200)]
// cols=2, rows=2
// rowHeights[0] = max(130, 300) = 300
// rowHeights[1] = max(200, 0) = 200
// slot 0: col=0, row=0 → x=16, y=130
// slot 1: col=1, row=0 → x=330, y=130
// slot 2: col=0, row=1 → x=16, y=130+300+14=444
sizes := []nodeSize{
{width: 240, height: 130},
{width: 240, height: 300},
{width: 240, height: 200},
}
x2, y2 := childSlotInGrid(2, sizes)
if x2 != 16.0 || y2 != 444.0 {
t.Errorf("slot 2: got (%.0f, %.0f), want (16, 444)", x2, y2)
}
}
+3 -33
View File
@@ -354,39 +354,9 @@ func TestExpandWithEnv_UnsetVar(t *testing.T) {
}
}
func TestHasUnresolvedVarRef_NoVars(t *testing.T) {
if hasUnresolvedVarRef("plain text", "plain text") {
t.Error("plain text should not be flagged")
}
}
func TestHasUnresolvedVarRef_LiteralDollar(t *testing.T) {
// "$5" is a literal price, not a var ref — should NOT be flagged
if hasUnresolvedVarRef("price: $5", "price: $5") {
t.Error("literal $5 should not be flagged as unresolved")
}
}
func TestHasUnresolvedVarRef_Resolved(t *testing.T) {
// Original had ${VAR}, expanded to "value" — fully resolved
if hasUnresolvedVarRef("${VAR}", "value") {
t.Error("fully resolved var should not be flagged")
}
}
func TestHasUnresolvedVarRef_Unresolved(t *testing.T) {
// Original had ${VAR}, expanded to "" — unresolved
if !hasUnresolvedVarRef("${VAR}", "") {
t.Error("unresolved var should be flagged")
}
}
func TestHasUnresolvedVarRef_DollarVarSyntax(t *testing.T) {
// $VAR syntax (no braces) — also a real ref
if !hasUnresolvedVarRef("$MISSING_VAR", "") {
t.Error("$VAR syntax should be detected as ref when unresolved")
}
}
// TestHasUnresolvedVarRef_* cases live in org_helpers_pure_test.go to keep
// pure-helper tests in their own file. Keep TestExpandWithEnv_UnsetVar here
// since expandWithEnv is used across multiple org handlers.
func eqStringSlice(a, b []string) bool {
if len(a) != len(b) {
@@ -0,0 +1,80 @@
package handlers
import (
"testing"
"github.com/stretchr/testify/assert"
)
// supportsRuntime tests — plugin runtime compatibility checking.
func TestSupportsRuntime_EmptyRuntimes(t *testing.T) {
// Empty runtimes = unspecified, try it → always compatible.
info := pluginInfo{Name: "test", Runtimes: nil}
assert.True(t, info.supportsRuntime("claude_code"))
assert.True(t, info.supportsRuntime("any_runtime"))
}
func TestSupportsRuntime_ExactMatch(t *testing.T) {
info := pluginInfo{Name: "test", Runtimes: []string{"claude_code", "anthropic"}}
assert.True(t, info.supportsRuntime("claude_code"))
assert.True(t, info.supportsRuntime("anthropic"))
}
func TestSupportsRuntime_NoMatch(t *testing.T) {
info := pluginInfo{Name: "test", Runtimes: []string{"claude_code"}}
assert.False(t, info.supportsRuntime("openai"))
}
func TestSupportsRuntime_HyphenUnderscoreNormalized(t *testing.T) {
// "claude-code" and "claude_code" are considered equal.
info := pluginInfo{Name: "test", Runtimes: []string{"claude-code"}}
assert.True(t, info.supportsRuntime("claude_code"))
assert.True(t, info.supportsRuntime("anthropic_claude"))
}
func TestSupportsRuntime_HyphenVsUnderscoreReverse(t *testing.T) {
// Plugin declares underscore form; runtime uses hyphen.
info := pluginInfo{Name: "test", Runtimes: []string{"claude_code"}}
assert.True(t, info.supportsRuntime("claude-code"))
}
func TestSupportsRuntime_EmptyStringRuntime(t *testing.T) {
info := pluginInfo{Name: "test", Runtimes: []string{"claude_code"}}
// Empty runtime string: should not match any plugin.
assert.False(t, info.supportsRuntime(""))
}
func TestSupportsRuntime_SingleRuntimeMatch(t *testing.T) {
// Multiple declared runtimes: only matching one is sufficient.
info := pluginInfo{Name: "test", Runtimes: []string{"python", "nodejs", "claude_code"}}
assert.True(t, info.supportsRuntime("claude_code"))
assert.False(t, info.supportsRuntime("ruby"))
}
func TestSupportsRuntime_AllHyphenForms(t *testing.T) {
// Both plugin and runtime use hyphen form.
info := pluginInfo{Name: "test", Runtimes: []string{"claude-code"}}
assert.True(t, info.supportsRuntime("claude-code"))
}
func TestSupportsRuntime_MultipleHyphenNormalization(t *testing.T) {
// Mixed hyphen/underscore forms normalize to the same.
info := pluginInfo{Name: "test", Runtimes: []string{"some-runtime-name"}}
assert.True(t, info.supportsRuntime("some_runtime_name"))
assert.True(t, info.supportsRuntime("some-runtime-name"))
}
func TestSupportsRuntime_EmptyPluginRuntimesWithAnyInput(t *testing.T) {
// Empty Runtimes on plugin = try it regardless of runtime.
info := pluginInfo{Name: "test", Runtimes: []string{}}
assert.True(t, info.supportsRuntime(""))
assert.True(t, info.supportsRuntime("any"))
assert.True(t, info.supportsRuntime("unknown"))
}
func TestSupportsRuntime_ZeroLengthRuntimes(t *testing.T) {
// Empty slice vs nil: both should be treated as "unspecified".
info := pluginInfo{Name: "test"}
assert.True(t, info.supportsRuntime("anything"))
}
@@ -0,0 +1,165 @@
package handlers
// workspace_crud_helpers_test.go — tests for pure-logic helpers in workspace_crud.go.
//
// Covered helpers:
// validateWorkspaceDir — bind-mount path safety (CWE-22 defence-in-depth)
import "testing"
// ─────────────────────────────────────────────────────────────────────────────
// validateWorkspaceDir
// ─────────────────────────────────────────────────────────────────────────────
func TestValidateWorkspaceDir_AcceptsValidAbsolutePath(t *testing.T) {
cases := []string{
"/home/ubuntu/workspace",
"/opt/myapp/data",
"/tmp/molecule-workspace",
"/Users/admin/workspace",
"/workspace",
"/mnt/volumes/data",
"/srv/molecule",
"/nix/store",
}
for _, dir := range cases {
err := validateWorkspaceDir(dir)
if err != nil {
t.Errorf("validateWorkspaceDir(%q) returned error: %v; want nil", dir, err)
}
}
}
func TestValidateWorkspaceDir_RejectsRelativePath(t *testing.T) {
cases := []string{
"relative/path",
"./local",
"../sibling",
"workspace",
"",
}
for _, dir := range cases {
err := validateWorkspaceDir(dir)
if err == nil {
t.Errorf("validateWorkspaceDir(%q) = nil; want error (relative path)", dir)
}
}
}
func TestValidateWorkspaceDir_RejectsTraversalSequence(t *testing.T) {
cases := []string{
"/etc/../../../etc/passwd",
"/home/user/../../root",
"/workspace/../../../sibling",
"/foo/bar/..%2f..%2fetc",
"/valid/../etc/passwd",
}
for _, dir := range cases {
err := validateWorkspaceDir(dir)
if err == nil {
t.Errorf("validateWorkspaceDir(%q) = nil; want error (traversal)", dir)
}
}
}
func TestValidateWorkspaceDir_RejectsSystemPaths(t *testing.T) {
// System paths must be rejected outright — a workspace binding /etc or
// /proc would let the agent read host secrets or inspect kernel state.
systemPaths := []string{
"/etc",
"/var",
"/proc",
"/sys",
"/dev",
"/boot",
"/sbin",
"/bin",
"/usr",
}
for _, dir := range systemPaths {
err := validateWorkspaceDir(dir)
if err == nil {
t.Errorf("validateWorkspaceDir(%q) = nil; want error (system path)", dir)
}
}
}
func TestValidateWorkspaceDir_RejectsDescendantsOfSystemPaths(t *testing.T) {
// A descendant of a system path must also be rejected — /etc/shadow,
// /proc/1/cmdline, /dev/null all fall in this category.
descendants := []string{
"/etc/passwd",
"/etc/shadow",
"/etc/ssh/sshd_config",
"/var/log/syslog",
"/proc/self/environ",
"/sys/kernel/version",
"/dev/null",
"/boot/grub/grub.cfg",
"/sbin/init",
"/bin/bash",
"/usr/bin/python3",
}
for _, dir := range descendants {
err := validateWorkspaceDir(dir)
if err == nil {
t.Errorf("validateWorkspaceDir(%q) = nil; want error (descendant of system path)", dir)
}
}
}
func TestValidateWorkspaceDir_AcceptsPathsSimilarToSystemPaths(t *testing.T) {
// Paths that LOOK like system paths but are NOT exact matches or
// descendants should be accepted. These are valid workspace directories.
valid := []string{
"/etcworkspace",
"/varworkspace",
"/procworkspace",
"/sysworkspace",
"/devworkspace",
"/bootworkspace",
"/sbinworkspace",
"/binworkspace",
"/usrworkspace",
"/etx", // typo of /etc but a different path
"/vartmp", // /var/tmp is different from /var
"/usrr", // typo of /usr but a different path
"/workspace/etc",
"/workspace/var",
"/home/user/etc",
"/opt/etc",
}
for _, dir := range valid {
err := validateWorkspaceDir(dir)
if err != nil {
t.Errorf("validateWorkspaceDir(%q) returned error: %v; want nil", dir, err)
}
}
}
func TestValidateWorkspaceDir_ErrorMessages(t *testing.T) {
// Error messages must be descriptive enough for operators to self-diagnose.
relErr := validateWorkspaceDir("relative")
if relErr == nil {
t.Fatal("relative path: want error, got nil")
}
if relErr.Error() == "" {
t.Error("relative path error message is empty")
}
travErr := validateWorkspaceDir("/etc/../../../etc/passwd")
if travErr == nil {
t.Fatal("traversal: want error, got nil")
}
if travErr.Error() == "" {
t.Error("traversal error message is empty")
}
sysErr := validateWorkspaceDir("/etc")
if sysErr == nil {
t.Fatal("system path: want error, got nil")
}
if sysErr.Error() == "" {
t.Error("system path error message is empty")
}
}
@@ -0,0 +1,268 @@
package handlers
import (
"testing"
)
// ── validateWorkspaceID ─────────────────────────────────────────────────────────
func TestValidateWorkspaceID_Valid(t *testing.T) {
cases := []string{
"550e8400-e29b-41d4-a716-446655440000",
"00000000-0000-0000-0000-000000000000",
"ffffffff-ffff-ffff-ffff-ffffffffffff",
}
for _, id := range cases {
t.Run(id, func(t *testing.T) {
if err := validateWorkspaceID(id); err != nil {
t.Errorf("validateWorkspaceID(%q) returned error: %v", id, err)
}
})
}
}
func TestValidateWorkspaceID_Invalid(t *testing.T) {
cases := []struct {
name string
id string
}{
{"empty", ""},
{"not a UUID", "not-a-uuid"},
{"traversal attack", "../../etc/passwd"},
{"SQL injection", "'; DROP TABLE workspaces;--"},
{"UUID too short", "550e8400-e29b-41d4-a716"},
{"UUID with invalid hex chars", "550e8400-e29b-41d4-a716-44665544000g"},
// Note: "UUID all zeros" (nil UUID) is accepted by google/uuid.Parse
// as a valid RFC 4122 nil UUID, so it passes validateWorkspaceID.
// If nil UUIDs should be rejected, validateWorkspaceID must be updated.
}
for _, tc := range cases {
t.Run(tc.name, func(t *testing.T) {
if err := validateWorkspaceID(tc.id); err == nil {
t.Errorf("validateWorkspaceID(%q): expected error, got nil", tc.id)
}
})
}
}
// ── validateWorkspaceDir ───────────────────────────────────────────────────────
func TestValidateWorkspaceDir_Valid(t *testing.T) {
cases := []string{
"/opt/molecule/workspaces/dev",
"/home/user/.molecule/workspaces",
// Note: /var/data/workspace-abc-123 is NOT in this list because
// /var is blocked as a system path prefix — /var/data is correctly
// rejected by validateWorkspaceDir. Use /tmp or /srv for non-system paths.
"/opt/services/molecule/tenant-workspaces",
"/tmp/molecule/workspaces/dev",
}
for _, dir := range cases {
t.Run(dir, func(t *testing.T) {
if err := validateWorkspaceDir(dir); err != nil {
t.Errorf("validateWorkspaceDir(%q) returned error: %v", dir, err)
}
})
}
}
func TestValidateWorkspaceDir_RelativeRejected(t *testing.T) {
cases := []string{
"relative/path",
"./myworkspace",
"~/workspaces/dev",
}
for _, dir := range cases {
t.Run(dir, func(t *testing.T) {
if err := validateWorkspaceDir(dir); err == nil {
t.Errorf("validateWorkspaceDir(%q): expected error (relative path), got nil", dir)
}
})
}
}
func TestValidateWorkspaceDir_TraversalRejected(t *testing.T) {
cases := []string{
"/opt/molecule/../../../etc",
"/workspaces/dev/../../root",
"/opt/../opt/../etc",
}
for _, dir := range cases {
t.Run(dir, func(t *testing.T) {
if err := validateWorkspaceDir(dir); err == nil {
t.Errorf("validateWorkspaceDir(%q): expected error (traversal), got nil", dir)
}
})
}
}
func TestValidateWorkspaceDir_SystemPathsRejected(t *testing.T) {
cases := []string{
"/etc",
"/etc/molecule",
"/var",
"/var/log",
"/proc",
"/proc/self",
"/sys",
"/sys/kernel",
"/dev",
"/dev/null",
"/boot",
"/sbin",
"/bin",
"/lib",
"/usr",
"/usr/local",
}
for _, dir := range cases {
t.Run(dir, func(t *testing.T) {
if err := validateWorkspaceDir(dir); err == nil {
t.Errorf("validateWorkspaceDir(%q): expected error (system path), got nil", dir)
}
})
}
}
func TestValidateWorkspaceDir_PrefixMatchesBlocked(t *testing.T) {
// The blocklist checks prefix so /etc/foo must also be rejected.
cases := []string{
"/etc/molecule-config",
"/var/log/workspace",
"/usr/local/bin",
"/usr/bin/molecule",
}
for _, dir := range cases {
t.Run(dir, func(t *testing.T) {
if err := validateWorkspaceDir(dir); err == nil {
t.Errorf("validateWorkspaceDir(%q): expected error (prefix of blocked path), got nil", dir)
}
})
}
}
// ── validateWorkspaceFields ────────────────────────────────────────────────────
func TestValidateWorkspaceFields_AllEmpty(t *testing.T) {
// All empty → valid (creation uses defaults; empty is allowed)
if err := validateWorkspaceFields("", "", "", ""); err != nil {
t.Errorf("validateWorkspaceFields with all empty: expected nil, got %v", err)
}
}
func TestValidateWorkspaceFields_Valid(t *testing.T) {
if err := validateWorkspaceFields("My Workspace", "Backend Engineer", "gpt-4o", "langgraph"); err != nil {
t.Errorf("validateWorkspaceFields with valid args: expected nil, got %v", err)
}
}
func TestValidateWorkspaceFields_NameTooLong(t *testing.T) {
longName := make([]byte, 256)
for i := range longName {
longName[i] = 'a'
}
if err := validateWorkspaceFields(string(longName), "", "", ""); err == nil {
t.Error("name > 255 chars: expected error, got nil")
}
// Exactly 255 chars is OK
validName := make([]byte, 255)
for i := range validName {
validName[i] = 'a'
}
if err := validateWorkspaceFields(string(validName), "", "", ""); err != nil {
t.Errorf("name exactly 255 chars: expected nil, got %v", err)
}
}
func TestValidateWorkspaceFields_RoleTooLong(t *testing.T) {
longRole := make([]byte, 1001)
for i := range longRole {
longRole[i] = 'x'
}
if err := validateWorkspaceFields("", string(longRole), "", ""); err == nil {
t.Error("role > 1000 chars: expected error, got nil")
}
}
func TestValidateWorkspaceFields_ModelTooLong(t *testing.T) {
longModel := make([]byte, 101)
for i := range longModel {
longModel[i] = 'x'
}
if err := validateWorkspaceFields("", "", string(longModel), ""); err == nil {
t.Error("model > 100 chars: expected error, got nil")
}
}
func TestValidateWorkspaceFields_RuntimeTooLong(t *testing.T) {
longRuntime := make([]byte, 101)
for i := range longRuntime {
longRuntime[i] = 'x'
}
if err := validateWorkspaceFields("", "", "", string(longRuntime)); err == nil {
t.Error("runtime > 100 chars: expected error, got nil")
}
}
func TestValidateWorkspaceFields_NewlineInName(t *testing.T) {
if err := validateWorkspaceFields("My\nWorkspace", "", "", ""); err == nil {
t.Error("name with \\n: expected error, got nil")
}
}
func TestValidateWorkspaceFields_CRLFInRole(t *testing.T) {
if err := validateWorkspaceFields("", "Backend\r\nEngineer", "", ""); err == nil {
t.Error("role with \\r\\n: expected error, got nil")
}
}
func TestValidateWorkspaceFields_NewlineInModel(t *testing.T) {
if err := validateWorkspaceFields("", "", "gpt-\n4o", ""); err == nil {
t.Error("model with \\n: expected error, got nil")
}
}
func TestValidateWorkspaceFields_NewlineInRuntime(t *testing.T) {
if err := validateWorkspaceFields("", "", "", "lang\rgraph"); err == nil {
t.Error("runtime with \\r: expected error, got nil")
}
}
func TestValidateWorkspaceFields_YAMLSpecialChars(t *testing.T) {
// yamlSpecialChars = "{}[]|>*&!"
// These must be rejected in name and role.
dangerous := []string{
"Workspace{evil}",
"Workspace[evil]",
"Workspace]evil[",
"Workspace|evil",
"Workspace>evil",
"Workspace*evil",
"Workspace&evil",
"Workspace!evil",
"Name{}",
"Role[]",
}
for _, v := range dangerous {
t.Run(v, func(t *testing.T) {
if err := validateWorkspaceFields(v, "", "", ""); err == nil {
t.Errorf("name %q: expected error (YAML special char), got nil", v)
}
})
}
}
func TestValidateWorkspaceFields_YAMLCharsAllowedInModelRuntime(t *testing.T) {
// YAML special chars are only blocked in name/role, not model/runtime.
if err := validateWorkspaceFields("", "", "model{}[]", "runtime*&!"); err != nil {
t.Errorf("model/runtime with YAML chars: expected nil, got %v", err)
}
}
func TestValidateWorkspaceFields_YAMLCharsAllowedInEmptyName(t *testing.T) {
// Empty name is fine; YAML char restriction is only on non-empty values.
if err := validateWorkspaceFields("", "Backend Engineer", "", ""); err != nil {
t.Errorf("empty name with valid role: expected nil, got %v", err)
}
}
@@ -24,7 +24,7 @@ func makeTestOpts(t *testing.T) *LocalBuildOptions {
RepoPrefix: "https://git.test/molecule-ai/molecule-ai-workspace-template-",
Platform: "linux/amd64",
HTTPClient: &http.Client{},
preflightLocalBuild: func() error {
checkShellDeps: func() error {
return nil // tests bypass the real PATH check
},
remoteHeadSha: func(ctx context.Context, opts *LocalBuildOptions, runtime string) (string, error) {
@@ -46,10 +46,7 @@ func makeTestOpts(t *testing.T) *LocalBuildOptions {
dockerTag: func(ctx context.Context, src, dst string) error {
return nil
},
// Stub the shell-dep pre-flight so tests run without docker/git on PATH.
checkShellDeps: func() error {
return nil
},
}
}
@@ -677,10 +674,10 @@ func TestProvisionerStartUsesLocalBuild_LocalMode(t *testing.T) {
// caught by this test.
}
// TestEnsureLocalImage_Hooks preflightLocalBuild — when preflight fails,
// TestEnsureLocalImage_Hooks checkShellDeps — when preflight fails,
func TestEnsureLocalImage_PreflightFailsIfDockerMissing(t *testing.T) {
opts := makeTestOpts(t)
opts.preflightLocalBuild = func() error {
opts.checkShellDeps = func() error {
return fmt.Errorf(
"local-build mode requires `docker` and `git` on PATH in the platform container; " +
"found: docker=<missing>, git=<missing>. " +
@@ -702,7 +699,7 @@ func TestEnsureLocalImage_PreflightFailsIfDockerMissing(t *testing.T) {
// nil, execution proceeds normally.
func TestEnsureLocalImage_PreflightOKPassesThrough(t *testing.T) {
opts := makeTestOpts(t)
opts.preflightLocalBuild = func() error { return nil }
opts.checkShellDeps = func() error { return nil }
tag, err := ensureLocalImageWithOpts(context.Background(), "claude-code", opts)
if err != nil {
t.Fatalf("unexpected error: %v", err)
+3 -1
View File
@@ -127,7 +127,9 @@ func (h *Hub) Close() {
count := len(h.clients)
for client := range h.clients {
close(client.Send)
client.Conn.Close()
if client.Conn != nil {
client.Conn.Close()
}
delete(h.clients, client)
}
log.Printf("WebSocket hub closed (%d clients disconnected)", count)
+386
View File
@@ -0,0 +1,386 @@
package ws
import (
"sync"
"testing"
"time"
"github.com/Molecule-AI/molecule-monorepo/platform/internal/models"
)
// ─── helpers ────────────────────────────────────────────────────────────────
// mockClient returns a Client with a buffered send channel of the given size
// and a nil WebSocket connection. Nil Conn is safe for our tests because we
// never call WritePump (which uses Conn) — we only test the hub's send channel
// and broadcast logic.
func mockClient(workspaceID string, bufSize int) *Client {
return &Client{
WorkspaceID: workspaceID,
Send: make(chan []byte, bufSize),
// Conn is nil — safe: WritePump (which uses Conn) is never called in tests.
}
}
// ─── NewHub ────────────────────────────────────────────────────────────────
func TestNewHub_NilChecker(t *testing.T) {
// nil AccessChecker is accepted (hub allows all workspace→workspace broadcasts
// when canCommunicate is unset — the gating is purely advisory).
h := NewHub(nil)
if h == nil {
t.Fatal("NewHub(nil) returned nil")
}
if h.canCommunicate != nil {
t.Error("canCommunicate should be nil")
}
}
func TestNewHub_AccessCheckerWired(t *testing.T) {
called := false
checker := func(callerID, targetID string) bool {
called = true
return callerID == targetID // only self-communication allowed
}
h := NewHub(checker)
if h.canCommunicate == nil {
t.Fatal("canCommunicate not wired")
}
// Invoke the wired function directly
allowed := h.canCommunicate("ws-1", "ws-1")
if !called {
t.Error("checker was not called")
}
if !allowed {
t.Error("self-communication should be allowed")
}
if h.canCommunicate("ws-1", "ws-2") {
t.Error("cross-workspace communication should be blocked by checker")
}
}
// ─── safeSend ─────────────────────────────────────────────────────────────
func TestSafeSend_OpenChannel_Sends(t *testing.T) {
c := mockClient("ws-1", 10)
data := []byte(`{"type":"ping"}`)
ok := safeSend(c, data)
if !ok {
t.Error("safeSend should return true for open channel")
}
select {
case got := <-c.Send:
if string(got) != string(data) {
t.Errorf("got %q, want %q", got, data)
}
case <-time.After(100 * time.Millisecond):
t.Error("no message received on channel")
}
}
func TestSafeSend_ClosedChannel_ReturnsFalse(t *testing.T) {
c := mockClient("ws-1", 10)
close(c.Send) // close before safeSend
ok := safeSend(c, []byte("data"))
if ok {
t.Error("safeSend should return false for closed channel")
}
}
func TestSafeSend_FullChannel_ReturnsFalse(t *testing.T) {
c := mockClient("ws-1", 1) // buffer size 1
// Fill the channel
c.Send <- []byte("first")
// Channel is now full
ok := safeSend(c, []byte("second"))
if ok {
t.Error("safeSend should return false when channel buffer is full")
}
// Drain to leave clean state
<-c.Send
}
// ─── Broadcast ────────────────────────────────────────────────────────────
func TestBroadcast_CanvasAlwaysReceives(t *testing.T) {
h := NewHub(nil) // nil checker: canvas always gets messages
// Canvas client (no workspaceID) + two workspace clients
canvas := mockClient("", 10)
ws1 := mockClient("ws-1", 10)
ws2 := mockClient("ws-2", 10)
// Manually register clients into hub state
h.mu.Lock()
h.clients[canvas] = true
h.clients[ws1] = true
h.clients[ws2] = true
h.mu.Unlock()
msg := models.WSMessage{Event: "test", Payload: []byte(`"hello"`)}
h.Broadcast(msg)
// Canvas must receive
select {
case got := <-canvas.Send:
t.Logf("canvas received: %s", got)
case <-time.After(100 * time.Millisecond):
t.Error("canvas client did not receive broadcast")
}
}
func TestBroadcast_WorkspaceCanCommunicateGating(t *testing.T) {
// Only ws-1 can receive messages for ws-2
checker := func(callerID, targetID string) bool {
return callerID == targetID
}
h := NewHub(checker)
ws1 := mockClient("ws-1", 10)
ws2 := mockClient("ws-2", 10)
canvas := mockClient("", 10)
h.mu.Lock()
h.clients[ws1] = true
h.clients[ws2] = true
h.clients[canvas] = true
h.mu.Unlock()
// Broadcast addressed to ws-2
msg := models.WSMessage{Event: "test", WorkspaceID: "ws-2"}
h.Broadcast(msg)
// ws-1 should NOT receive (not the target, checker says no)
select {
case <-ws1.Send:
t.Error("ws-1 should not receive broadcast for ws-2")
case <-time.After(50 * time.Millisecond):
t.Log("ws-1 correctly blocked — no message")
}
// ws-2 should receive
select {
case <-ws2.Send:
t.Log("ws-2 correctly received broadcast")
case <-time.After(100 * time.Millisecond):
t.Error("ws-2 did not receive broadcast")
}
// Canvas always receives
select {
case <-canvas.Send:
t.Log("canvas correctly received broadcast")
case <-time.After(100 * time.Millisecond):
t.Error("canvas did not receive broadcast")
}
}
func TestBroadcast_DropsOnClosedChannel(t *testing.T) {
h := NewHub(nil)
c := mockClient("", 10)
close(c.Send) // pre-close so safeSend returns false
h.mu.Lock()
h.clients[c] = true
h.mu.Unlock()
// Broadcast must not panic; closed client should be dropped silently.
msg := models.WSMessage{Event: "ping"}
h.Broadcast(msg) // should not panic
}
func TestBroadcast_DropsOnFullChannel(t *testing.T) {
h := NewHub(nil)
c := mockClient("", 1)
c.Send <- []byte("blocker") // fill buffer
h.mu.Lock()
h.clients[c] = true
h.mu.Unlock()
msg := models.WSMessage{Event: "ping"}
h.Broadcast(msg) // safeSend returns false; no panic
// Drain to leave clean state
<-c.Send
}
func TestBroadcast_EmptyHubNoPanic(t *testing.T) {
h := NewHub(nil)
msg := models.WSMessage{Event: "ping"}
h.Broadcast(msg) // must not panic with no clients
}
func TestBroadcast_MultiClient(t *testing.T) {
h := NewHub(nil)
clients := make([]*Client, 5)
h.mu.Lock()
for i := 0; i < 5; i++ {
clients[i] = mockClient("", 10)
h.clients[clients[i]] = true
}
h.mu.Unlock()
msg := models.WSMessage{Event: "multi", Payload: []byte(`"all receive"`)}
h.Broadcast(msg)
for i, c := range clients {
select {
case <-c.Send:
t.Logf("client %d received", i)
case <-time.After(100 * time.Millisecond):
t.Errorf("client %d did not receive broadcast", i)
}
}
}
func TestBroadcast_CanvasIgnoresChecker(t *testing.T) {
// Strict checker that blocks ALL cross-workspace (never returns true for different IDs)
strictChecker := func(callerID, targetID string) bool {
return callerID == targetID
}
h := NewHub(strictChecker)
canvas := mockClient("", 10)
h.mu.Lock()
h.clients[canvas] = true
h.mu.Unlock()
msg := models.WSMessage{Event: "ping", WorkspaceID: "ws-1"}
h.Broadcast(msg)
select {
case <-canvas.Send:
t.Log("canvas received message even though checker blocks ws-1")
case <-time.After(100 * time.Millisecond):
t.Error("canvas must always receive — checker should be bypassed")
}
}
// ─── Close ────────────────────────────────────────────────────────────────
func TestClose_DisconnectsAllClients(t *testing.T) {
h := NewHub(nil)
clients := make([]*Client, 3)
h.mu.Lock()
for i := 0; i < 3; i++ {
clients[i] = mockClient("", 10)
h.clients[clients[i]] = true
}
h.mu.Unlock()
// Start Run goroutine so Close can drain Unregister channel
go h.Run()
defer h.Close()
// Unregister all clients so the mutex is released before Close() tries to lock it
for _, c := range clients {
h.Unregister <- c
}
time.Sleep(50 * time.Millisecond)
// Now close — mutex is free, Close() should succeed
h.Close()
// All client channels should be closed
for i, c := range clients {
select {
case _, ok := <-c.Send:
if ok {
t.Errorf("client %d channel still open after Close", i)
}
case <-time.After(100 * time.Millisecond):
// Channel drained and closed
}
}
}
func TestClose_Idempotent(t *testing.T) {
h := NewHub(nil)
c := mockClient("", 10)
h.mu.Lock()
h.clients[c] = true
h.mu.Unlock()
// Close twice — must not panic or deadlock
h.Close()
h.Close() // second call also fine
}
func TestClose_ClosesDoneChannel(t *testing.T) {
h := NewHub(nil)
// Start Run goroutine
done := make(chan struct{})
go func() {
h.Run()
close(done)
}()
h.Close()
select {
case <-done:
t.Log("Run exited after Close")
case <-time.After(200 * time.Millisecond):
t.Error("Run did not exit after Close")
}
}
// ─── Run goroutine (Unregister) ──────────────────────────────────────────
func TestRun_UnregisterClosesClientSend(t *testing.T) {
h := NewHub(nil)
c := mockClient("ws-1", 10)
// Start Run() BEFORE sending to Register — Register is unbuffered,
// so Run() must be ready to receive before the send can complete.
go h.Run()
defer h.Close()
// Register the client
h.Register <- c
// Give Run a moment to register the client
time.Sleep(20 * time.Millisecond)
// Unregister client
h.Unregister <- c
select {
case _, ok := <-c.Send:
if ok {
t.Error("client send channel should be closed after Unregister")
}
case <-time.After(500 * time.Millisecond):
t.Error("client send channel not closed within timeout")
}
}
// ─── Concurrent access ────────────────────────────────────────────────────
func TestBroadcast_ConcurrentSafe(t *testing.T) {
h := NewHub(nil)
clients := make([]*Client, 10)
h.mu.Lock()
for i := 0; i < 10; i++ {
clients[i] = mockClient("", 100)
h.clients[clients[i]] = true
}
h.mu.Unlock()
var wg sync.WaitGroup
for i := 0; i < 5; i++ {
wg.Add(1)
go func(id int) {
defer wg.Done()
for j := 0; j < 20; j++ {
h.Broadcast(models.WSMessage{Event: "ping", Payload: []byte(`"concurrent"`)})
}
}(i)
}
wg.Wait() // should not deadlock or panic
}
+10 -11
View File
@@ -9,6 +9,13 @@ import uuid
import httpx
# OFFSEC-003: peer-controlled text MUST be wrapped with sanitize_a2a_result
# before being returned to the LLM. This module's delegate_task() is one of
# the trust-boundary entry points where peer output crosses into our agent's
# context — same surface as a2a_tools_delegation.py:325 (fixed via #492).
# Issue #537.
from _sanitize_a2a import sanitize_a2a_result
PLATFORM_URL = os.environ.get("PLATFORM_URL", "http://host.docker.internal:8080")
WORKSPACE_ID = os.environ.get("WORKSPACE_ID", "")
@@ -69,12 +76,12 @@ async def delegate_task(workspace_id: str, task: str) -> str:
result = data["result"]
parts = result.get("parts", []) if isinstance(result, dict) else []
if parts and isinstance(parts[0], dict):
return parts[0].get("text", "(no text)")
return sanitize_a2a_result(parts[0].get("text", "(no text)"))
# Empty parts list (e.g. {"parts": []}) should return str(result),
# not "(no text)" — preserves pre-fix behavior (#279 regression fix).
if isinstance(result, dict) and result.get("parts") == []:
return str(result)
return str(result) if isinstance(result, str) else "(no text)"
return sanitize_a2a_result(str(result))
return sanitize_a2a_result(str(result) if isinstance(result, str) else "(no text)")
elif "error" in data:
err = data["error"]
# Handle both string-form errors ("error": "some string")
@@ -87,14 +94,6 @@ async def delegate_task(workspace_id: str, task: str) -> str:
else:
msg = str(err)
return f"Error: {msg}"
msg = ""
if isinstance(err, dict):
msg = err.get("message", "")
elif isinstance(err, str):
msg = err
else:
msg = str(err)
return f"Error: {msg}"
return str(data)
except Exception as e:
return f"Error sending A2A message: {e}"