molecule-core

Author	SHA1	Message	Date
claude-ceo-assistant (Claude Opus 4.7 on Hongming's MacBook)	75a72bf5a2	feat(canvas/chat-server): canvas consumes /chat-history + server-side row-aware reverse (RFC #2945 PR-C-2) Some checks failed Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 4s Details CI / Detect changes (pull_request) Successful in 5s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 5s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 5s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 5s Details Harness Replays / detect-changes (pull_request) Successful in 6s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 5s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 5s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 2s Details CI / Python Lint & Test (pull_request) Successful in 30s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Failing after 9s Details CodeQL / Analyze (${{ matrix.language }}) (go) (pull_request) Failing after 54s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 5s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 3s Details CodeQL / Analyze (${{ matrix.language }}) (javascript-typescript) (pull_request) Failing after 1m19s Details CodeQL / Analyze (${{ matrix.language }}) (python) (pull_request) Failing after 1m20s Details Harness Replays / Harness Replays (pull_request) Failing after 46s Details CI / Canvas (Next.js) (pull_request) Failing after 2m21s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details CI / Platform (Go) (pull_request) Failing after 2m44s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Failing after 4m49s Details Closes the SSOT story shipped in PR-C/D: canvas now consumes the typed /chat-history endpoint instead of /activity?type=a2a_receive, and the server emits messages in display-ready chronological order so the client doesn't have to re-order them. ## Canvas (consumer migration) - loadMessagesFromDB swaps from /activity to /chat-history. - Drops type=a2a_receive + source=canvas params (server applies the filter centrally now). - Drops [...activities].reverse() — wire is already display-ready. - Drops the local INTERNAL_SELF_MESSAGE_PREFIXES constant + isInternalSelfMessage helper. Server-side IsInternalSelfMessage applies the same predicate before emitting rows. - Drops the activityRowToMessages + ActivityRowForHydration imports from historyHydration.ts. The TS parser stays in tree because message-parser.ts is still load-bearing for live A2A WebSocket messages (ChatTab.tsx:805, AgentCommsPanel.tsx, canvas-events.ts). ## Server (row-aware wire-order fix) The pre-PR-C-2 client did `[...activities].reverse()` over ROWS, then flattened each row into [user, agent] messages. The reversal was ROW-aware. After PR-C/D, the server returned a flat ChatMessage slice in `ORDER BY created_at DESC` order, with [user, agent] within each row. A naive client-side flat reverse would FLIP each pair (agent before user at same timestamp). Two ways to fix it: A) Server emits oldest-first within page; canvas does NOT reverse. B) Canvas does row-aware reversal (group by timestamp, reverse). Option A is cleaner — server owns the wire-order responsibility, every client trusts `for m of messages` to render chronologically. Server adds reverseRowChunks() that: 1. Groups consecutive same-Timestamp messages into row chunks (1-2 messages per row). 2. Reverses the chunk order (newest-row-first → oldest-row-first). 3. Flattens. Within-chunk [user, agent] order is preserved. Single-message rows (agent reply not yet recorded, attachments-only user upload) collapse to 1-element chunks and reverse correctly too. ## Tests Server: 3 new unit tests on reverseRowChunks (paired across rows, single-message rows, empty input) + 1 sqlmock integration test on List() that drives the full SQL → reverse → wire path. Mutation-tested: removed `messages = reverseRowChunks(messages)` from List(), confirmed the integration test fires red with all 4 misordered indices flagged. Restored, all 25 messagestore tests + 9 chat-history handler tests green. Canvas: 8 lazyHistory pagination tests refactored to mock /chat-history (not /activity) and assert against the new wire shape ({messages, reached_end} not raw activity rows). All 1389/1389 vitest tests green; tsc --noEmit clean. ## Three weakest spots (hostile-reviewer self-pass) 1. reverseRowChunks groups by Timestamp string equality. If two distinct rows had the SAME timestamp (legitimately possible at sub- millisecond granularity), the algorithm would treat them as one chunk and not reverse them relative to each other. Mitigated: activity_logs.created_at uses microsecond resolution; concurrent inserts at exact-same microsecond are vanishingly rare. If a collision happens, the within-chunk order is whatever the SQL returned — both rows render at the same timestamp, no user-visible misordering. 2. The pre-existing TS parser files (historyHydration.ts + message-parser.ts) stay in tree. historyHydration.ts is now dead code (no consumers post-migration); deletion is parked as a follow- up after a one-week observation window confirms no live-message consumer reaches it. 3. canvas's loadMessagesFromDB returns `resp.messages ?? []`. If the server were ever to return `null` instead of `[]` (it currently doesn't — handler defensively coerces nil to []), the nullish coalesce keeps the canvas from crashing. A stricter wire schema would assert the never-null invariant; for today's pragmatic safety, the ?? is enough. ## Security review - Untrusted input? Same as PR-C — agent JSON parsed defensively in the messagestore parser. No new exposure. - Trust boundary? Same. Canvas → /chat-history → wsAuth → messagestore. - Output sanitization? Plain text + opaque attachment URIs as before. No security-relevant changes beyond what /chat-history already exposes via PR-C. Considered, not skipped. ## Versioning / backwards compat - /activity endpoint unchanged. - /chat-history endpoint shape unchanged (still {messages, reached_end}); only the wire ORDER within a page changed (newest-first row → oldest- first row). Canvas is the only consumer in tree; no API consumers depend on the previous order. - canvas's loadMessagesFromDB call signature unchanged — internal refactor. 🤖 Generated with [Claude Code](https://claude.com/claude-code)	2026-05-06 16:55:00 -07:00
Hongming Wang	04f7a07add	feat(canvas/chat): inline image preview + fullscreen lightbox (RFC #2991 PR-1) First specialized renderer landing under RFC #2991 — chat attachment preview. Adds the dispatch infrastructure that PR-2 (video/audio) and PR-3 (PDF/text) will extend. Architecture (RFC #2991 Phase 2 design) --------------------------------------- - preview-kind.ts: pure helper that maps mimeType (+ extension fallback for missing/generic MIME) to one of: image \| video \| audio \| pdf \| text \| file. Single source of truth; the dispatch axis for every attachment renderer. - AttachmentPreview.tsx: SSOT dispatch component. ChatTab no longer imports kind-specific components — it imports AttachmentPreview, which switches on the kind and renders the right child. - AttachmentImage.tsx: inline thumbnail (max 240×180) + click → lightbox. Auth-aware: for platform URIs (workspace: / platform-pending: / etc) the bytes are fetched via JS-injected headers, wrapped in a Blob, served as ObjectURL — bare <img src> would not include the cookie/token. - AttachmentLightbox.tsx: shared fullscreen modal (image now; PDF will use it in PR-3). Esc / backdrop click / X button to close, focus trap on close button, focus restoration on close. - AttachmentChip retained as the kind=file fallback. No breaking change for existing renderable shapes. External-workspace coverage --------------------------- The wire shape (ChatAttachment.mimeType + uri) is identical for internal + external workspaces — both go through AgentMessageWriter (PR #2949). External claude-code agents that attach images via send_message_to_user automatically get the new preview surface; no runtime-side change needed. Failure modes ------------- - Fetch failure (404, 403, network) → AttachmentChip fallback so the user still gets a working download. Pinned by tests. - Decoded as non-image (corrupt bytes, wrong Content-Type) → onError on the <img> swaps to AttachmentChip. Pinned by tests. - Non-platform URIs (http/https external image hosts) → skip the auth-fetch flow, use the raw URL via resolveAttachmentHref. Pinned by extension-fallback tests. Tests ----- preview-kind.test.ts (49 cases): - Strict MIME match across image/video/audio/pdf/text/unknown - Extension fallback when MIME is missing or application/octet-stream - URL with query string + fragment → strip before parsing - MIME wins over extension (regression: don't render image-named zip) - SVG is image (not text) despite being XML - Non-canonical MIME like application/javascript → text AttachmentPreview.test.tsx (9 component tests): - Dispatch: kind=file → chip, kind=image → image path - Loading state shows placeholder, NOT chip (proves dispatch routed) - Extension fallback (no mimeType) routes to image path - Fetch fail (404) and network error → fall back to chip - Image success: <img> renders ObjectURL, click opens lightbox - Lightbox: Esc closes, backdrop click closes, content click doesn't - Universal fallback: unknown MIME → chip even when extension hints at a renderable kind Hostile self-review (3 weakest spots, addressed) ------------------------------------------------ 1. <img> auth: bare <img src="/chat/download?..."> would NOT include our auth headers. Resolved via fetch+Blob+ObjectURL pattern. Pinned by the image-success test (asserts src === "blob:test-url"). 2. Server-side allowed-roots mismatch: pre-fix tests used /tmp/ paths which the server doesn't allow. Caught when the dispatch test fell into the non-platform path. Updated tests to use /workspace/ subpaths matching templates.go's allowedRoots. 3. Bundle size creep: each kind component adds bytes. Lightbox is currently always-bundled. Lazy-loading is plausible but defer until measured-needed. Verified - tsc --noEmit clean - 168 chat tests green (49 unit + 9 component + 110 pre-existing) PR-2 (video + audio) and PR-3 (PDF + text) extend the dispatch in AttachmentPreview.tsx with their own kind-specific components. Refs RFC #2991. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 19:39:37 -07:00
Hongming Wang	5d8b5e96e3	fix(canvas/chat): handle platform-pending: scheme for poll-mode upload downloads Followup to PR #2966. The user reported the about:blank symptom on reno-stars and the browser console showed: Failed to launch 'platform-pending:d76977b1-…/bb0dcaf3-…' because the scheme does not have a registered handler. So the agent's "download link" was a `platform-pending:<wsid>/<file_id>` URI — the canonical reference for poll-mode chat uploads (see workspace-server/internal/handlers/chat_files.go:690 + workspace/inbox_uploads.py). PR #2966 only handled `workspace:`, `file:///`, and absolute container paths; the platform-pending scheme fell through to the raw URI which the browser couldn't navigate to. Fix --- - `resolveAttachmentHref`: added a `platform-pending:` branch that resolves to `${PLATFORM_URL}/workspaces/<wsid>/pending-uploads/ <file_id>/content`. Uses the wsid from the URI, NOT the chat's workspace_id — these can differ when a file is forwarded across workspaces (cross-workspace delegation, agent forwarding). - New `isPlatformAttachment(uri)` helper — single source of truth for "this URI requires our auth headers, route through downloadChatFile". Used by both `downloadChatFile` (chip click) and ChatTab's markdown-link override. - ChatTab.tsx markdown-link override now imports `isPlatformAttachment` instead of duplicating the scheme list. Pre-fix this list was duplicated and missed `platform-pending:`. Tests ----- The 4 IME tests still pass; tsc clean. The platform-pending resolution is exercised via the `isPlatformAttachment` SSOT helper (any URI reaching `downloadChatFile` or the markdown override goes through it). A dedicated test for the URL shape would need a more elaborate fixture; manual verification on staging post-deploy is the practical gate. Reported on production reno-stars 2026-05-05. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 16:55:43 -07:00
Hongming Wang	c2e12f3fb6	fix(canvas/chat): IME-safe Enter + markdown link target/scheme handling Two production-reported regressions in the same chat surface, fixed in one focused PR. Issue 1 — IME composition + Enter sends half-typed message ---------------------------------------------------------- ChatTab's textarea onKeyDown was: if (e.key === "Enter" && !e.shiftKey) { e.preventDefault(); sendMessage(); } For agents typing CJK / Japanese / Korean via the system IME, Enter commits the candidate selection — not a newline, not a send. With the old check, every IME-commit Enter accidentally sent the half-typed message ("你好" + half-typed-pinyin + Enter to commit the next candidate → message goes out before the user finishes). Fix: guard on `event.nativeEvent.isComposing` AND `e.keyCode !== 229`. The latter covers older Safari / WebKit-based mobile browsers that delay setting isComposing on the composition-end Enter. Issue 2 — markdown links land at about:blank --------------------------------------------- ReactMarkdown's default `<a>` rendering passes the agent-supplied href directly to the DOM with no target / scheme handling: - http(s) → navigates the canvas tab away (canvas state lost) - workspace://path / file:///workspace/... / /workspace/... → browser hits unhandled-protocol click → about:blank, no download (the reported bug) Fix: ReactMarkdown `components.a` override: - In-container paths (workspace:, file:///{workspace,configs,home, plugins}, bare /{workspace,configs,...}) → preventDefault, route through downloadChatFile (same auth path the AttachmentChip uses). Filename is derived from the path's last segment. - External (http/https/mailto/unknown scheme) → target="_blank" rel="noopener noreferrer" so canvas state survives. Tests ----- ChatTab.imeAndLinks.test.tsx (4 tests): - Enter with isComposing=true → does NOT send, input preserved - Enter with keyCode=229 (older-Safari IME) → does NOT send - Enter with no IME signal → DOES send (happy path intact) - Shift+Enter → does NOT send (newline path intact) The link-component override is exercised through the full ChatTab render — the IME tests are jsdom-only and don't load chat history with markdown messages, so the link test would need a more elaborate fixture. Manual verification on staging post-deploy is the practical gate; if the link test grows critical the AttachmentViews-style chip test can extend. Verified: - tsc --noEmit clean - 4/4 IME tests pass Reported on production 2026-05-05. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 16:47:04 -07:00
Hongming Wang	a489ee1a7c	fix(canvas/chat): instant-scroll to bottom on first mount Reported: "right now when chat box opens it opens in the middle, but it should be at the end of conversation." Root cause: ChatTab.tsx:548 fires `bottomRef.scrollIntoView({ behavior: "smooth" })` on every messages-update. On initial mount with N messages already loaded, the smooth-scroll triggers a ~300ms animation that any concurrent React re-render (agent push landing, theme toggle, sidepanel resize) interrupts mid-flight, leaving the user stuck somewhere in the middle of the conversation. Fix: track first-mount via hasInitialScrollRef. Use behavior:"instant" for the initial jump (deterministic, no animation interruption), then smooth for subsequent appends (the new-message-landing visual stays). Refs flipped on first messages.length > 0 transition, so: - Initial open of chat tab: instant jump to bottom ✓ - New agent message arrives: smooth scroll into view ✓ - Workspace switch (ChatTab remounts): fresh hasInitialScrollRef, gets instant again ✓ - loadOlder prepend: anchor-restore path unchanged, still pins user's reading position ✓ Test plan: - pnpm test --run ChatTab.lazyHistory.test.tsx → 8 pass (existing lazy-history tests untouched) - npx tsc --noEmit clean - Manual on hongming.moleculesai.app: open a busy chat (mac laptop, ~50 messages), confirm view lands at the latest bubble, not mid- scroll. Switch to another workspace + back → instant again. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 10:47:32 -07:00
Hongming Wang	4c9309e801	fix(canvas/chat): tag scroll anchor + token-guard fetches (race fixes) Multi-model review of #2826 found two issues my self-approval missed: C1. Live agent-message append during loadOlder() yanked scroll AND swallowed the bottom-pin. The useLayoutEffect's "restore against saved distance-from-bottom" branch fired on ANY messages update while scrollAnchorRef was set — including appends from agent pushes that landed mid-fetch. User reading mid-history got thrown to a stale offset; the new agent message's normal scroll-to-bottom was silently swallowed. Fix: tag scrollAnchorRef with `expectFirstIdNotEqual` (the oldest message's id BEFORE the prepend). The layout effect only honors the anchor when messages[0].id has changed from that tag — i.e., a real prepend happened, not an append. R4. Workspace switch mid-fetch leaked the in-flight promise's result into the new workspace's state — user briefly saw someone else's history. Same shape for a fast-clicked Retry button or rapid scroll-flick triggering a second loadOlder. Fix: `fetchTokenRef` monotonic counter. loadInitial + loadOlder each capture their token at entry; the .then() bails if the token has moved. Both call sites bump the token at fetch start so any in-flight stale fetch loses identity. C2 (loadOlder identity stability via refs) and R3 (inflightRef synchronous double-entry guard) were already pushed in the previous commit on this branch. Build + 1258 tests pass.	2026-05-04 20:42:29 -07:00
Hongming Wang	20f76c4fdf	fix(canvas/chat): stable IntersectionObserver + inflight guard for loadOlder Self-review of the lazy-load PR caught three Important findings: 1. IO observer was re-armed on every messages change. The previous loadOlder useCallback depended on `messages`, so every live agent push recreated it → re-ran the IO useEffect → tore down + re-armed the observer. In a perf PR shipping to chat-heavy users, that's the wrong direction. Fix: refs for the captured state (oldestMessageRef, hasMoreRef), narrow loadOlder deps to [workspaceId], and gate the IO effect on `messages.length > 0` (boolean) instead of `messages` so it arms exactly once when data first lands and stays armed across appends. 2. loadingOlder setState race. Two IO callbacks dispatched in the same microtask (fast scroll, layout shift) could both pass the `if (loadingOlder)` guard before React committed setLoadingOlder. Fix: synchronous inflightRef set BEFORE any await, cleared in finally; loadingOlder state stays for the UI label only. 3. Retry-button onClick duplicated the mount-effect body. Single loadInitial() callback now serves both, eliminating the drift hazard. Coverage: - 4 new tests bring the file to 8/8 (was 4): - loadOlder fetches with limit=20 and before_ts=oldest.timestamp - inflight guard rejects three concurrent IO triggers while a deferred fetch is in flight (asserts call count stays at 2, not 5) - empty older response unmounts the sentinel (proxy for the anchor-clearing branch in loadOlder) - IO observer instance survives three subsequent prepends — same object reference both before and after, no churn - Both behavioural tests verified to FAIL on the prior code (stashed ChatTab.tsx, ran them alone, confirmed both red), then PASS on this commit. Pinning real regressions, not tautologies. - IntersectionObserver fake captures instances + exposes triggerIntersection() so the IO callback can be driven directly from jsdom (no real layout / scrolling needed). Test: vitest run src/components/tabs/__tests__/ → 39 passed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-04 20:38:37 -07:00
Hongming Wang	8152cfc81e	feat(canvas/chat): lazy-load history — 10 newest on mount, 20 per scroll-up batch Pre-fix ChatTab fetched the newest 50 messages on every mount and scrolled to bottom, paying full DOM cost up-front even when the user only wanted to read the last few bubbles. On a long-running workspace this meant 50× message-bubble paint + DOM cost on every tab swap. Now: - Initial fetch limit=10 (newest-first slice). - IntersectionObserver on a top sentinel (rootMargin 200px) fires loadOlder() the moment the user scrolls within 200px of the top. - loadOlder() uses the oldest loaded message's timestamp as `before_ts` (RFC3339 cursor the /activity endpoint already supports) and fetches OLDER_HISTORY_BATCH (20) more. - hasMore turns false when the server returns < limit rows; the sentinel unmounts and the IO observer disconnects — no spinner on a short conversation. - useLayoutEffect handles scroll behavior across messages updates: a prepend (loadOlder landed) restores the user's saved distance-from-bottom (captured via scrollAnchorRef before the fetch) so their reading position doesn't jump; an append / initial load pins to the latest bubble. Tests: 4 new in ChatTab.lazyHistory.test.tsx pinning the limit=10 on initial fetch, hasMore=false on short-history, full-page rendering on exactly-the-limit, and limit=10 on retry-after-failure. Doesn't exercise the IO/scroll-anchor in jsdom — that's brittler than trusting the synth-canary against a live tenant. Build clean. Existing 1250 tests + 4 new = 1254 pass.	2026-05-04 20:12:01 -07:00
Hongming Wang	c37596fc26	fix(canvas): brighten agent chat prose body in dark mode User feedback: chat-bubble agent text still washed out after #2618 + #2623. Looked at the actual rendered colors and the issue was Tailwind Typography's `prose-invert` defaults — body text ships at zinc-300, which lands at ~5.3:1 against bg-zinc-700. Passes AA but visibly duller than the user bubble's crisp white-on-blue (~10:1). Override the prose CSS variables on the agent bubble in dark mode: - body → zinc-100 (was zinc-300) - headings / bold → white - inline code → zinc-100 That brings agent body text to ~13:1 against bg-zinc-700, matching the user bubble's brightness so both sides of the conversation read at the same crispness. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 13:04:12 -07:00
Hongming Wang	ffcffa1375	fix(canvas): agent chat bubble dark-mode prose contrast Regression from PR #2618 (chat dark-contrast). PR #2618 switched the agent bubble bg to `dark:bg-zinc-700` so it visibly elevates against the dark panel — but the inner ReactMarkdown prose div only got `prose-invert` for USER messages. Result: in dark mode the agent's markdown text rendered with the Tailwind Typography plugin's default dark body color on top of the new dark bg = invisible text. User reported empty-looking gray rectangles where agent replies should be. Fix: apply `dark:prose-invert` to agent bubbles so prose body text flips light alongside the bg. Light mode unchanged (default prose colors against the warm `bg-surface-card`). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 12:36:44 -07:00
Hongming Wang	026c81acf0	fix(canvas): dark-mode chat bubble contrast User screenshot showed pale lavender user bubbles with hard-to-read white text and a nearly-invisible agent bubble blending into the dark panel. Root causes: 1. Tailwind v4 defaults `dark:` to `prefers-color-scheme: dark`. Our ThemeProvider writes `data-theme="dark"` on <html> so user toggle wins over OS — but `dark:` classes elsewhere in the codebase weren't tracking it. Added `@custom-variant dark` to re-bind the variant. 2. `bg-accent` themes lighter in dark mode (--color-accent: #6883e8), dropping white-text contrast to ~3:1 (fails WCAG AA). Switched user bubble to solid blue-600/500 so it stays ~5:1 in both modes. 3. `bg-surface-card` (#1a1d23) was only ~7% lighter than the panel bg (#0e1014), making agent bubbles disappear. Bumped to zinc-700 in dark; light mode keeps the warm surface-card tint. 4. System (error) bubble's /10 overlay was nearly invisible; raised to /25 in dark with stronger border + ink for readability. Sub-tab + textarea polish included: low-contrast `text-ink-soft` → `text-ink-mid`, focus-visible rings on tabs, dark variants on textarea. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 12:00:51 -07:00
Hongming Wang	3d145da99d	fix(canvas): chat bubble + sub-tab contrast in light theme Chat bubble fixes (canvas/src/components/tabs/ChatTab.tsx): - User bubble: bg-accent-strong/30 + text-blue-100 → bg-accent + text-white (translucent dark-blue overlay on warm-paper surface read as pale lavender with near-invisible light-blue text — a real WCAG AA failure on the highest-traffic surface in canvas). - System/error bubble: bg-red-900/30 + text-red-200 → bg-bad/10 + text-bad, using semantic tokens so dark-mode adapts automatically. - Agent bubble: drop /80 + /30 opacity modifiers; solid bg-surface-card + text-ink + border-line gives consistent contrast in both themes. - prose-invert was unconditional, so markdown text on agent/system bubbles rendered as light text on a light surface in light mode. Make it apply only on the user bubble (the only dark surface in this component). - Timestamp: text-ink-soft is too pale on warm-paper; use text-ink-mid for agent/system, white/70 for user (visible on the now-solid blue bg). Sub-tab bar fixes (canvas/src/components/SidePanel.tsx): - Right-edge fade was hardcoded `from-zinc-950` — that paints a dark vertical strip on the right edge of the tab bar in light mode. Switch to `from-surface` so the gradient blends into whichever theme is active. - Inactive tab text: text-ink-soft (~3.5:1 on warm-paper) → text-ink-mid (~7:1). Active tab background: drop the /40 opacity so the selection is unambiguous on either surface. No semantic-token additions; all changes use existing warm-paper tokens that already work in both themes.	2026-05-03 09:58:18 -07:00
Hongming Wang	db48d1d261	fix(canvas): restore text-white on saturated buttons + close zinc gaps Independent code review of #2555 caught two contrast regressions left by the bulk perl pass: 1. text-white → text-ink mass-substitution silently broke destructive and primary buttons. text-ink resolves to #15181c (warm-paper near-black) in light mode — dark text on bg-red-600 / bg-amber-600 / bg-emerald-600 / bg-blue-600 / bg-accent / bg-accent-strong / bg-good / bg-bad fails WCAG contrast and looks broken. Per-line pass flips text-ink → text-white only when a saturated bg utility is present; tinted-state pills (bg-red-950/50 etc.) keep their intentionally-retained text-* literals. 2. Original mapping table was missing bg-zinc-600 (most-used hover-state literal for cancel buttons — caused them to JUMP from warm cream resting state to dark zinc on hover in light mode) and text-zinc-700/800/900 (separator dots and decorative dim text invisible on warm-paper light bg). Extended mapping fills these gaps with bg-surface-card / text-ink-soft. Also: drop stale tailwind.config.ts reference from components.json (file deleted by the v3→v4 migration); switch baseColor zinc → neutral and enable cssVariables since v4 uses CSS-driven tokens. Future shadcn-cli invocations would have failed or written malformed components without this. 27 sites in 27 files affected by #1, ~20 sites in 20 files by #2. 1214/1214 unit tests still pass; build still clean. Findings courtesy of multi-model review per code-review-and-quality skill — different blind spots catch different bugs. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 02:04:20 -07:00
Hongming Wang	c0eca8d0e1	feat(canvas): warm-paper theme + Tailwind v4 migration Brings the canvas onto the warm-paper design system already shipped to landing, marketplace, and SaaS surfaces, and migrates the build from Tailwind v3 → v4 to match molecule-app. Plumbing: - swap tailwindcss v3 → v4, drop autoprefixer, add @tailwindcss/postcss - delete tailwind.config.ts (v4 reads tokens from @theme blocks in CSS) - globals.css: @import "tailwindcss" + @plugin "@tailwindcss/typography" - two @theme blocks: warm-paper light defaults + always-dark surface tokens (bg-bg / ink-mute / line-strong) for terminal/console panels - [data-theme="dark"] cascade overrides the warm-paper tokens for dark - React Flow edge stroke + scrollbar + selection colour pull from semantic tokens so they flip with the theme Theme infra (ported from molecule-app, identical contracts): - lib/theme-cookie.ts: mol_theme cookie + boot script (no "use client" so server components can read the constants) - lib/theme-provider.tsx: ThemeProvider + useTheme + cookie writer with Domain=.moleculesai.app so the preference follows the user across canvas/app/market/landing subdomains AND tenant subdomains - lib/theme.ts: ColorToken union + cssVar() helper - components/ThemeToggle.tsx: 3-way System/Light/Dark picker - layout.tsx: SSR cookie read + nonce'd inline boot script (CSP needs the explicit nonce — strict-dynamic doesn't forgive an un-nonce'd inline sibling) + ThemeProvider wrapper + bg-surface/text-ink body Component migration (62 files): - Mechanical bg-zinc-* / text-zinc-* / border-zinc-* / text-white → semantic surface/ink/line tokens via perl negative-lookahead pass (preserves opacity modifiers like /80, /60) - bg-blue-500/600 → bg-accent / bg-accent-strong - text-red-* / amber-* / emerald-* → text-bad / warm / good - Tinted-state banner backgrounds (bg-red-950, bg-amber-950, bg-blue-950 etc.) intentionally left literal — they remain readable on warm-paper in light mode without inventing new state-soft tokens - TerminalTab.tsx skipped — xterm renders to canvas, not DOM - 3 unit-test assertions updated to match new token strings (credits pillTone, AuthGate overlay class, A2AEdge accent) Verification: - pnpm test: 1214/1214 pass - pnpm tsc --noEmit: clean - next build: ✓ Compiled successfully (8 routes) - dev server inspection: html data-theme stamped, body uses bg-surface text-ink, boot script carries nonce, compiled CSS contains both @theme blocks + [data-theme="dark"] override Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 01:43:55 -07:00
Hongming Wang	fc33cf1131	docs(a2a): correct misleading v1-tolerance comments Follow-up to PR #2509/#2510. The defensive v1-detection branches in extract_attached_files (Python) and extractFilesFromTask (TypeScript) were merged with comments claiming they fix a "v0→v1 silent-drop" bug that surfaced as the 2026-05-01 hongming "no text content" incident. Live test disproved that hypothesis: a2a-sdk's JSON-RPC layer validates inbound requests against the v0 Pydantic union, so v1 shapes are rejected at the request boundary — the v1 detection branch is unreachable on the JSON-RPC ingress path. The actual root cause of the hongming incident was the missing /workspace chown fixed by CP PR #381 + test #382. Update the comments to honestly describe these branches as defensive future-proofing (kept against an eventual SDK schema migration or in-process callers that construct Parts directly from protobuf), not as fixes for an observed bug. Also trims ChatTab.tsx's outbound-shape comment block from ~21 lines to a 3-line pointer to the SDK union. Comment-only change. No behavior change. 86 workspace tests + 91 canvas tests still pass.	2026-05-02 02:33:00 -07:00
Hongming Wang	3ce7c11a13	fix(canvas): revert v1 outbound file part shape The previous PR (#2509) flipped canvas outbound file parts to the v1 flat shape `{url, filename, mediaType}` based on a hypothesis that a2a-sdk's JSON-RPC parser silently dropped v0 `{kind:"file", file:{...}}` shapes. Live test shows the opposite: a2a-sdk's JSON-RPC layer validates against the v0 Pydantic discriminated union (TextPart \| FilePart \| DataPart), so v1 flat shape is rejected with: Invalid Request: params.message.parts.0.TextPart.text — Field required params.message.parts.0.FilePart.file — Field required params.message.parts.0.DataPart.data — Field required The actual root cause of the user-visible "Error: message contained no text content" was the missing `/workspace` chown (CP PR #381 + test pin #382), not a wire-shape mismatch. Verified end-to-end by sending a v0 image-only message after PR #381 + workspace re-provision — agent receives the file, reads its bytes, and replies normally. Reverting only the canvas outbound shape. Defensive v1-tolerance stays in: - workspace/executor_helpers.py — extract_attached_files still accepts v1 protobuf parts in case a future client emits them or a future SDK release flips internal representation. Harmless on the v0 hot path. - canvas/message-parser.ts — extractFilesFromTask still tolerates v1 shape on incoming agent responses. Some agents may emit v1 when their internal serializer round-trips through protobuf. Tests stay green (91 canvas, 86 workspace).	2026-05-02 01:31:56 -07:00
Hongming Wang	02a8841402	fix(a2a): send v1 file Part shape; tolerate v1 server-side Image-only chats surface "Error: message contained no text content" because canvas posts v0 `{kind:"file", file:{uri,name,mimeType}}` shapes that the workspace runtime's a2a-sdk v1 protobuf parser silently drops: v1 `Part` has fields `[text, raw, url, data, metadata, filename, media_type]` and `ignore_unknown_fields=True` discards `kind`+`file`, producing a fully-empty Part. With no text and no extracted file attachments, the executor's "no text content" guard fires. Three coordinated changes close the gap: 1. canvas/ChatTab.tsx — outbound file parts now carry the v1 flat shape `{url, filename, mediaType}` so the v1 protobuf parser populates Part fields instead of dropping them. 2. workspace/executor_helpers.py — extract_attached_files learns the v1 detection branch (non-empty `part.url` + `filename` + `media_type`) alongside the existing v0 RootModel and flat-file shapes. Defends every runtime that mounts the OSS wheel against the same drop, including any pre-fix client still on the wire. 3. canvas/message-parser.ts — extractFilesFromTask tolerates the v1 shape on incoming agent responses too, so file chips render in chat history regardless of which Part shape the runtime emits. Test pins: - workspace/tests/test_executor_helpers.py: + v1 protobuf shape extraction + empty-Part defense (v0→v1 silent-drop fall-through returns []) - canvas message-parser test: + v1 protobuf flat parts + filename fallback to URL basename for v1	2026-05-02 00:58:05 -07:00
Hongming Wang	4028b81e04	refactor(canvas): route panel WS subscriptions through global socket Both AgentCommsPanel and ChatTab's activity-feed opened raw `new WebSocket(WS_URL)` instances per mount, with no onclose handler and no reconnect logic. When the underlying connection dropped — idle timeout, browser background-tab throttle, network jitter — the per- panel sockets stayed dead until the panel re-mounted (refresh or sub-tab unmount/remount). Live agent-comms bubbles and live activity feed lines silently went missing in the gap, manifesting as "the delegation didn't show up until I refreshed." The global ReconnectingSocket in store/socket.ts already owns reconnect, exponential backoff, health-check, and HTTP fallback poll. Routing component subscribers through it gives every consumer those guarantees for free, with one TCP connection per tab instead of N. Three new pieces: - store/socket-events.ts: tiny pub/sub bus. emitSocketEvent fan-outs every decoded WSMessage to the listener Set; subscribeSocketEvents returns an unsubscribe. A throwing listener is logged and isolated so it can't break siblings. - store/socket.ts: ws.onmessage now calls emitSocketEvent(msg) right after applyEvent(msg), so the store's derived state and component subscribers stay in lockstep on every event arrival. - hooks/useSocketEvent.ts: React hook that registers exactly once per mount, capturing the latest handler in a ref so the closure sees current state/props without re-subscribing on every render. Refactored sites: - AgentCommsPanel: replaced its WebSocket-in-useEffect block with useSocketEvent. Same parsing logic; the panel no longer opens its own connection. - ChatTab activity feed: split the previous useEffect in two — one seeds the activity log when `sending` flips, the other subscribes unconditionally and gates work on `sending` inside the handler. Hooks can't be conditional, so the gate has to live in the body rather than around the effect. The ws-close graceful-close helper is no longer needed in either site; the global socket owns its own teardown. Tests: 6 new tests for the bus contract (single delivery, fan-out order, unsubscribe, throwing-listener isolation, no-subscriber emit, duplicate-subscribe Set semantics). All 27 existing socket tests still pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 13:12:47 -07:00
Hongming Wang	f7ad5a82f7	fix(canvas): release sendInFlightRef in the activity-log WS path too Third-pass review caught a fourth WS path I missed. The original fix + the stale-callback follow-up patched 3 sites that release the in-flight guards (pendingAgentMsgs effect, HTTP .then() success, HTTP .catch() success), but the ACTIVITY_LOGGED handler at lines 410-419 also clears `sending` + `sendingFromAPIRef` when the platform logs the workspace's a2a_receive ok/error. It only cleared 2 of the 3 refs — same exact bug class as the original. If THIS path wins the race (a2a_receive activity logged before pendingAgentMsgs delivers the reply text), sendInFlightRef stays stuck true and the next sendMessage() silently no-ops at line 464. Fix: route both branches (ok and error) through releaseSendGuards() so all four sites are now uniform. Updated the helper's docstring to explicitly list all four sites and warn that any future "I saw the reply" path that only clears the natural pair (sending + sendingFromAPIRef) will silently re-introduce the freeze. The disabled-button logic can't see sendInFlightRef so the visible state diverges from the synchronous re-entry guard otherwise. This is exactly the drift `releaseSendGuards()` was supposed to prevent — the helper landed in the prior commit but the activity-log site wasn't migrated to use it. Fixing now closes the gap. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 12:27:29 -07:00
Hongming Wang	cacf499354	fix(canvas): close stale-callback race + extract releaseSendGuards helper Self-review on PR #2185 surfaced a latent race the original fix exposed: the WS-clears-guards path now releases sendInFlightRef immediately, which means a user can fire msg #2 between WS-arrival and HTTP-arrival for msg #1. Without coordination, msg #1's late .then() sees sendingFromAPIRef=true (set by msg #2's send), enters the main body, and runs setSending(false) + appendMessageDeduped against msg #1's response body — clobbering msg #2's in-flight UI state. This race is realistic for claude-code SDK: the comment at line 294-298 already calls WS the "authoritative reply arrived" signal, and the user typically reads-then-types before the trailing HTTP completes. Without the original Send-button freeze "protecting" the race, it surfaces. Two changes: 1. Token-keyed callbacks. sendTokenRef bumps on every sendMessage entry; .then()/.catch() capture the token in closure and bail without touching any flags if a newer send has superseded them. The newer send owns the in-flight guards. 2. releaseSendGuards() helper. The three-clear-guards trio (setSending, sendingFromAPIRef, sendInFlightRef) now lives in one useCallback so the WS handler, .then() success, and .catch() success can't drift apart. A future contributor dropping one of the three would silently re-introduce either the post-WS Send freeze or the stale-callback clobber. Skipped a unit test for this regression — ChatTab has no __tests__ file and a mount test would need WS + zustand + api mocks. The fix is 4 logical lines (token capture + 2 guard checks) and the manual test covers it. Follow-up to add a focused mount test when ChatTab gets its first __tests__ file. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 11:47:12 -07:00
Hongming Wang	5faaf58466	fix(canvas): clear sendInFlightRef on WS-push reply path Send button + Enter both silently no-op'd after the first agent reply on runtimes that deliver via WebSocket (claude-code SDK does this per the comment at ChatTab.tsx:294-298). The visible disabled-state checks (sending, uploading, agentReachable) were all clean — the freeze came from a third synchronous reentry guard the button can't see: if (sendInFlightRef.current) return; // ChatTab.tsx:438 The ref was set true at the start of sendMessage() and only cleared in .then() / .catch() of the HTTP fall-through and the upload-failure branch. The WS-push handler in the pendingAgentMsgs effect cleared `sending` and `sendingFromAPIRef` but left `sendInFlightRef` stuck true. The HTTP .then() then early-returned at the dedup check (line 513) without touching the ref — only the .catch() early-return path did. Net result: refresh fixed it because the ref reset on remount. Two-line fix: - WS handler: also clear sendInFlightRef when the push delivers the reply (primary fix; no race window where the ref is stuck while the user can already type) - .then() early-return: mirror .catch()'s cleanup as defense in depth, so neither delivery order leaks the ref While here: A2AEdge.test.tsx fixture was typed `as never` to dodge EdgeProps' discriminated-union complaint, which broke spreading at the call sites with TS2698 ("Spread types may only be created from object types"). Replaced with `as unknown as ComponentProps<typeof A2AEdge>` — preserves the original "skip restating every optional field" intent and keeps a spreadable type. All 10 A2AEdge tests pass; tsc --noEmit is clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 11:11:58 -07:00
Hongming Wang	0032f9c906	fix(chat): drop unused extractResponseText import after helper extraction Reviewer bot flagged: ChatTab.tsx imported extractResponseText but no longer used it after the loop body moved to historyHydration.ts (the helper imports it directly). Drop from the named import to unblock merge. extractFilesFromTask remains used at line 515 for the WS A2A_RESPONSE handler's reply-files extraction. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 22:52:53 -07:00
Hongming Wang	fe204f04da	test(chat): extract historyHydration helper + 12 unit tests User pushed back: the timestamp bug should have been caught by E2E. Right — my earlier coverage tested the server contract (notify endpoint, WS broadcast filter) but never the chat-history HYDRATION path. Without a unit test that froze the wall clock and asserted timestamps came from created_at, a future refactor could re-introduce the same bug. This commit: 1. Extracts the per-row → ChatMessage[] mapping out of the closure inside loadMessagesFromDB into chat/historyHydration.ts. Pure function, no React dependency, easy to test. 2. Adds 12 vitest cases in __tests__/historyHydration.test.ts covering: - Timestamp regression (3 tests, with system time frozen to 2030 so a regression starts producing "2030-…" timestamps and the assertion fails unmistakably). The third test mirrors the user's screenshot: two rows with distinct created_at must produce distinct timestamps. - User-message extraction (text, internal-self filter, null body) - Agent-message extraction (text, error→system role, file attachments, null body, body with neither text nor files) - End-to-end: a single row with both request and response emits two messages with the same timestamp (the canonical canvas-source row pattern) 3. The new file-attachment test caught a SECOND latent bug — the helper was passing `response_body.result ?? response_body` to extractFiles FromTask, which passes the STRING "<text>" for the notify-with- attachments shape `{result: "<text>", parts: [...]}` and silently returns []. So a chat reload after an agent attached a file would lose the chips. Fixed by only unwrapping `result` when it's an object (the task-shape) and falling through to response_body otherwise (the notify shape). ChatTab now imports the helper and the loop body becomes one line: `messages.push(...activityRowToMessages(a, isInternalSelfMessage))`. Verification: - 12/12 historyHydration tests pass - 1072/1072 full canvas vitest pass (was 1060 before, +12) - tsc --noEmit clean Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 21:18:22 -07:00
Hongming Wang	8415870520	fix(chat): pin historical user-message timestamps to activity created_at User flagged that all historical user bubbles render with the same "now" clock after a chat reload — both messages in the screenshot showed 9:01:58 PM despite being sent hours apart. ChatTab.tsx:142 minted user messages with createMessage(...) which calls new Date().toISOString() — fine for a freshly-typed message, wrong for hydrated history. Every reload re-stamped all user bubbles to the render moment, collapsing the visible chronology. The agent path on line 157 already overrides with a.created_at; mirror that. One-line fix (spread + override timestamp) plus a comment explaining why the override is load-bearing so the next refactor doesn't drop it. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 21:06:19 -07:00
Hongming Wang	892de784b3	fix: review-driven hardening of wedge detector + idle timeout + progress feed Bundle review of pieces 1/2/3 surfaced two critical issues plus a handful of required + optional fixes. All addressed. Critical: 1. Migration 043 was missing 'paused' and 'hibernated' from the workspace_status enum. Both are real production statuses written by workspace_restart.go (lines 283 and 406), introduced by migration 029_workspace_hibernation. The original `USING status::workspace_status` cast would have errored mid-transaction on any production DB containing those values. Added both. Also added `SET LOCAL lock_timeout = '5s'` so the migration aborts instead of stalling the workspace fleet behind a slow SELECT. 2. The chat activity-feed window kept only 8 lines, and a single multi-tool turn (Read 5 files + Grep + Bash + Edit + delegate) easily flushed older context before the user could read it. Extracted appendActivityLine to chat/activityLog.ts with a 20-line window AND consecutive-duplicate collapse (same tool on the same target twice in a row is noise, not new progress). 5 unit tests pin the behavior. Required: 3. The SDK wedge flag was sticky-only — a single transient Control-request-timeout from a flaky network blip locked the workspace into degraded for the whole process lifetime, even when the next query() would have succeeded. Added _clear_sdk_wedge_on_success(), called from _run_query's success path. The next heartbeat after a working query reports runtime_state empty and the platform recovers the workspace to online without a manual restart. New regression test. 4. _report_tool_use now sets target_id = WORKSPACE_ID for self- actions, matching the convention other self-logged activity rows use. DB consumers joining on target_id see a well-defined value instead of NULL. Optional taken: 5. Tightened _WEDGE_ERROR_PATTERNS from "control request timeout" to "control request timeout: initialize" — suffix-anchored so a future SDK error on an in-flight tool-call control message doesn't get misclassified as the unrecoverable post-init wedge. 6. Dropped the redundant "context canceled" substring fallback in isUpstreamBusyError. errors.Is(err, context.Canceled) is the typed check; the substring would also match healthy client-side aborts, which we don't want classified as upstream-busy. Verified: 1010 canvas tests + 64 Python tests + full Go suite pass; migration applies cleanly on dev DB with all 8 enum values; reverse migration restores TEXT. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 08:43:10 -07:00
Hongming Wang	166c7f77af	feat(chat): stream per-tool progress into MyChat live feed Two halves of the same UX win — the user wants to see what Claude is doing while a chat reply is in flight instead of staring at "0s" for minutes. Workspace side (claude_sdk_executor.py): - The executor's _run_query message loop already iterated the SDK stream for AssistantMessage.TextBlock content. Now also detects ToolUseBlock / ServerToolUseBlock entries (by class name, since the conftest stub doesn't define them) and fires-and-forgets a POST /workspaces/:id/activity row of type agent_log per tool use. - _summarize_tool_use maps the common tools (Read, Write, Edit, Bash, Glob, Grep, WebFetch, WebSearch, Task, TodoWrite) to a one-line summary with the file path / pattern / command, falling back to "🛠 <tool>(…)" for anything else. Truncated at 200 chars. - Posts directly to /workspaces/:id/activity rather than going through a2a_tools.report_activity, which would also push a /registry/heartbeat current_task and double-log as a TASK_UPDATED line in the same chat feed. - All failures swallowed silently — telemetry must not break the conversation. Canvas side (ChatTab.tsx): - The existing ACTIVITY_LOGGED handler streams a2a_send / a2a_receive / task_update events into a sliding-window activityLog state. Two issues fixed: 1. No `msg.workspace_id === workspaceId` filter — a sibling workspace's a2a_send was leaking into the wrong chat panel as "→ Delegating to X...". Added an early return. 2. No agent_log render branch. Added one that renders the summary verbatim (the workspace already prefixed its own emoji icon, so no double-icon). - Existing 8-line sliding window keeps the UI scoped; older progress lines naturally roll off as new ones arrive. Result: when DD is delegating to Visual Designer + reading config files + running Bash to lint, the spinner area shows: 📄 Read /configs/system-prompt.md ⚡ Bash: pnpm test → Delegating to Visual Designer... ← Visual Designer responded (47s) instead of bare "0s · Processing with Claude Code..." for minutes. 63 Python tests + 58 canvas chat tests pass; tsc clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 08:28:55 -07:00
Hongming Wang	1d71b4e9e5	fix(canvas): bundle of UX hardening — modals, position stability, error UX, paste Single-themed bundle of fixes accumulated while polishing the canvas chat / agent-comms / plugins / position flows. Each piece is small; the connective tissue is "things observable from the canvas right panel and the org-deploy flow that surprised real users". UI / composer - Legend: add close X + persisted-localStorage state + reopener pill; default open for first-time users. - SidePanel: rename "Skills" tab label → "Plugins" (single-line; internal panelTab enum value, component name, and store keys unchanged). - SkillsTab: registry tri-state UI (loading / error / empty) with actionable Retry button + 10s explicit fetch timeout. Handle AbortSignal.timeout's DOMException by name (TimeoutError / AbortError) — Chromium's "signal timed out" message wouldn't match the prior naive /timeout/ regex. Reset mountedRef on every mount: pre-existing StrictMode dev-mode bug where cleanup-only `current = false` was never re-set, permanently wedging every `if (mountedRef.current) setX(...)` guard and producing a "Loading…" panel that never resolved on hard refresh. - ChatTab: paste-image-from-clipboard via onPaste handler; unique monotonic-counter filenames so same-second pastes don't collide on name+size dedup. mime→ext map avoids `image/svg+xml`-style raw extensions on synthesised filenames. Bypasses the DataTransfer constructor so Safari < 14.1 / older Edge work. - ChatTab: drop stuck error toast when the WS path already delivered the agent reply but the HTTP path errored late (sendingFromAPIRef gate now covers the .catch() handler). - ChatTab: filter heartbeat-style internal self-messages from the My Chat tab so historical rows with source_id=NULL don't surface as user-typed input. - Modal portals: OrgImportPreflightModal + MissingKeysModal (ProviderPickerModal + AllKeysModal) now createPortal to document.body and clamp max-h to 80vh. Escapes the ancestor containing block (TemplatePalette's fixed+filtered sidebar re-anchored descendants' position:fixed to itself, hiding modals behind workspace cards). MissingKeysModal bumped to z-[60] for stack ordering when both modals are open. - OrgImportPreflightModal saveOne: ref-based microtask-safe in-flight gate replaces the brittle "set startValue inside a setState updater and read on the next line" pattern (React 18 doesn't guarantee functional updaters run synchronously; that path strands `saving:true` and never calls createSecret). Same useRef pattern guards SkillsTab.loadRegistry against concurrent fires and Fast-Refresh-stranded promises; force=true parameter on retry click bypasses the gate. Agent comms - AgentCommsPanel: derive UI-facing `flow` field instead of using activity_type-derived direction. Self-logged a2a_receive rows (source_id == workspace_id, what the agent runtime writes to log its own outbound delegation replies) now correctly render as OUTBOUND with → arrow + right-justified bubble. Previously they rendered "← From Self" with Restart pointing at THIS workspace. - AgentCommsPanel: error rows replace the unactionable "X failed [A2A_ERROR]" body with banner + underlying-error code-block + cause-hint (matched on Claude Code SDK init wedge, deadline-exceeded, agent-thrown exception, empty-error) + Restart [peer] / Open [peer] action buttons. - AgentCommsPanel: render text bodies through ReactMarkdown + remark-gfm so multi-part replies (tables, code) render properly. Multi-part text extractor - extractReplyText (live A2A response in ChatTab) and extractResponseText (chat history loader in message-parser): now COLLECT from every source — top-level parts, parts.root.text, and artifacts — joined with "\n". Previous "first source wins" silently dropped multi-part replies (Hermes summary+detail, Claude Code long-form table). Tests cover joined-from-parts, joined-from-artifacts, joined-from-both. Position stability - canvas-topology.buildNodesAndEdges: auto-rescue heuristic now accepts currentParentSizes map; uses max(initial min, currently grown) for the bbox check. Fixes "child jumps to weird location after 30s" — the periodic socket health-check rehydrate (silenceSec > 30) was rebuilding nodes from scratch, and the rescue's reliance on grid-derived initial size false-flagged children the user dragged into the user-grown area. - canvas.hydrate: pass live measured dimensions from the existing store into buildNodesAndEdges. - socket.RehydrateDedup: pure exported helper class that gates rehydrate calls. Two states — in-flight (in-flight Promise reused by concurrent callers) + post-completion window (1.5s, returns Promise.resolve()). Initialised with -Infinity so first call always passes the gate. Wired into ReconnectingSocket.rehydrate. A2A edges - New A2AEdge custom React Flow edge component portals its label out of the SVG layer via EdgeLabelRenderer so labels (a) render above workspace cards instead of being hidden behind them and (b) accept clicks. Click selects source + switches panel to Activity, but only on a NEW selection (preserves current tab on re-click of an already-selected source). - buildA2AEdges output tagged type:"a2a"; edgeTypes wired in Canvas.tsx. Tests - 14 new vitest cases across 4 files (964 → 978 passing): OrgImportPreflightModal saveOne single-fire / double-click, any-of rendering; AgentCommsPanel toCommMessage flow derivation in all four shapes; canvas-topology rescue respects-grown / rescues-genuine-drift / fallback-without-live-size; socket RehydrateDedup gate behaviour; message-parser multi-part response extraction. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 19:54:43 -07:00
Hongming Wang	94d9331c76	feat(canvas+platform): chat attachments, model selection, deploy/delete UX Session's accumulated UX work across frontend and platform. Reviewable in four logical sections — diff is large but internally cohesive (each section fixes a gap the next one depends on). ## Chat attachments — user ↔ agent file round trip - New POST /workspaces/:id/chat/uploads (multipart, 50 MB total / 25 MB per file, UUID-prefixed storage under /workspace/.molecule/chat-uploads/). - New GET /workspaces/:id/chat/download with RFC 6266 filename escaping and binary-safe io.CopyN streaming. - Canvas: drag-and-drop onto chat pane, pending-file pills, per-message attachment chips with fetch+blob download (anchor navigation can't carry auth headers). - A2A flow carries FileParts end-to-end; hermes template executor now consumes attachments via platform helpers. ## Platform attachment helpers (workspace/executor_helpers.py) Every runtime's executor routes through the same helpers so future runtimes inherit attachment awareness for free: - extract_attached_files — resolve workspace:/file:///bare URIs, reject traversal, skip non-existent. - build_user_content_with_files — manifest for non-image files, multi-modal list (text + image_url) for images. Respects MOLECULE_DISABLE_IMAGE_INLINING for providers whose vision adapter hangs on base64 payloads (MiniMax M2.7). - collect_outbound_files — scans agent reply for /workspace/... paths, stages each into chat-uploads/ (download endpoint whitelist), emits as FileParts in the A2A response. - ensure_workspace_writable — called at molecule-runtime startup so non-root agents can write /workspace without each template having to chmod in its Dockerfile. Hermes template executor + langgraph (a2a_executor.py) + claude-code (claude_sdk_executor.py) all adopt the helpers. ## Model selection & related platform fixes - PUT /workspaces/:id/model — was 404'ing, so canvas "Save" silently lost the model choice. Stores into workspace_secrets (MODEL_PROVIDER), auto-restarts via RestartByID. - applyRuntimeModelEnv falls back to envVars["MODEL_PROVIDER"] so Restart propagates the stored model to HERMES_DEFAULT_MODEL without needing the caller to rehydrate payload.Model. - ConfigTab Tier dropdown now reads from workspaces row, not the (stale) config.yaml — fixes "badge shows T3, form shows T2". ## ChatTab & WebSocket UX fixes - Send button no longer locks after a dropped TASK_COMPLETE — `sending` no longer initializes from data.currentTask. - A2A POST timeout 15 s → 120 s. LLM turns routinely exceed 15 s; the previous default aborted fetches while the server was still replying, producing "agent may be unreachable" on success. - socket.ts: disposed flag + reconnectTimer cancellation + handler detachment fix zombie-WebSocket in React StrictMode. - Hermes Config tab: RUNTIMES_WITH_OWN_CONFIG drops 'hermes' — the adaptor's purpose IS the form, banner was contradictory. - workspace_provision.go auto-recovery: try <runtime>-default AND bare <runtime> for template path (hermes lives at the bare name). ## Org deploy/delete animation (theme-ready CSS) - styles/theme-tokens.css — design tokens (durations, easings, colors). Light theme overrides by setting only the deltas. - styles/org-deploy.css — animation classes + keyframes, every value references a token. prefers-reduced-motion respected. - Canvas projects node.draggable=false onto locked workspaces (deploying children AND actively-deleting ids) — RF's authoritative drag lock; useDragHandlers retains a belt-and- braces check. - Organ cancel button (red pulse pill on root during deploy) cascades via existing DELETE /workspaces/:id?confirm=true. - Auto fit-view after each arrival, debounced 500 ms so rapid sibling arrivals coalesce into one fit (previous per-event fit made the viewport lurch continuously). - Auto-fit respects user-pan — onMoveEnd stamps a user-pan timestamp only when event !== null (ignores programmatic fitView) so auto-fits don't self-cancel. - deletingIds store slice + useOrgDeployState merge gives the delete flow the same dim + non-draggable treatment as deploy. - Platform-level classNames.ts shared by canvas-events + useCanvasViewport (DRY'd 3 copies of split/filter/join). ## Server payload change - org_import.go WORKSPACE_PROVISIONING broadcast now includes parent_id + parent-RELATIVE x/y (slotX/slotY) so the canvas renders the child at the right parent-nested slot without doing any absolute-position walk. createWorkspaceTree signature gains relX, relY alongside absX, absY; both call sites updated. ## Tests - workspace/tests/test_executor_helpers.py — 11 new cases covering URI resolution (including traversal rejection), attached-file extraction (both Part shapes), manifest-only vs multi-modal content, large-image skip, outbound staging, dedup, and ensure_workspace_writable (chmod 777 + non-root tolerance). - workspace-server chat_files_test.go — upload validation, Content-Disposition escaping, filename sanitisation. - workspace-server secrets_test.go — SetModel upsert, empty clears, invalid UUID rejection. - tests/e2e/test_chat_attachments_e2e.sh — round-trip against a live hermes workspace. - tests/e2e/test_chat_attachments_multiruntime_e2e.sh — static plumbing check + round-trip across hermes/langgraph/claude-code. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 13:27:51 -07:00
Hongming Wang	507696d88a	fix(canvas,server): address review findings on `3f11df03` Five review findings from the `3f11df03` six-bug commit: 1. Add TestPeers_DevModeFailOpen_{Allows,ClosedWhenAdminTokenSet, ClosedInProduction} covering all three gating states for the security-sensitive dev-mode hatch the prior commit added to /registry/:id/peers. Previously shipped untested — a future refactor could have silently inverted polarity or removed the gate. New tests pin the contract: * MOLECULE_ENV=development + ADMIN_TOKEN="" → allow bearerless * MOLECULE_ENV=development + ADMIN_TOKEN set → require token * MOLECULE_ENV=production → require token 2. ConfigTab handleSave diffs against the RAW parsed YAML / form config instead of the DEFAULT_CONFIG-merged shape. The previous code would silently PATCH tier=1 to the DB when a user deleted the `tier:` line in raw mode (the default-merge substituted 1). Now: only fields the user actually typed participate in the diff. Type guards (typeof === "number" / "string") prevent coercion surprises on malformed YAML. 3. ConfigTab model-save failure no longer lies "Saved". The /workspaces/:id/model PATCH can reject when the runtime doesn't support the chosen model; previously we caught + console.warn'd + showed green Saved, and the user watched the model revert on next reload with no explanation. Now the save path collects a `modelSaveError` and surfaces it via setError with a partial- success message ("Other fields saved, but model update failed: …") so the user sees why. 4. ChannelsTab now surfaces BOTH channels-fetch and adapters-fetch failures, distinguishing them in the error text ("Failed to load connected channels and platforms — try refreshing"). Previously only an adapters failure was visible; a channels failure left the user with an apparently-empty list and no indication the API was unreachable. 5. ChatTab panels drop the redundant aria-hidden attribute. The `hidden`/`flex` Tailwind class already sets display:none, which removes the node from the accessibility tree on its own; the extra aria-hidden invited WAI-ARIA lint warnings if a focusable descendant ever landed inside an inactive panel. Tests: 923 canvas + full Go handler suite pass. 3 new Go tests. No behaviour change on the five prior fixes — this commit tightens their edges per the independent review. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 20:29:44 -07:00
Hongming Wang	3f11df031c	fix: six UX bugs (peers auth, scroll, chat tabs, config persist, + visibility) Six bugs reported from a live session — all shippable in one commit: 1. Peers tab 401 on local Docker. The /registry/:id/peers endpoint demands a workspace-scoped bearer token (validateDiscoveryCaller) which the canvas session doesn't hold. Added the same Tier-1b dev-mode fail-open hatch that AdminAuth and WorkspaceAuth already use — gated by MOLECULE_ENV=development + empty ADMIN_TOKEN, so SaaS production stays strict. Exported IsDevModeFailOpen from the middleware package for the handler layer to reuse. 2. Org Templates list unscrollable. OrgTemplatesSection was rendered in the TemplatePalette footer — a div without overflow — so when it expanded to 15+ entries the list extended past the viewport with no scroll. Moved it to the top of the flex-1 overflow-y-auto container. Tall lists now scroll naturally. 3. Chat tab: "My Chat" and "Agent Comms" rendered stacked instead of switching. HTML `hidden` attribute was being overridden by Tailwind's `flex` class (display: flex beats the attribute), so both tabpanels rendered concurrently. Swapped to a conditional Tailwind `hidden`/`flex` class so the inactive panel is display:none with proper CSS specificity. 4. Hermes Config form never persists. handleSave wrote config.yaml but name / tier / runtime / model all live on the workspace row (or the dedicated /workspaces/:id/model endpoint) — the form edited in-memory, the request returned 200, the next reload wiped everything back. Hermes + external runtimes manage their own config inside the container anyway, so writing config.yaml is a no-op for them; skip it. Always diff and PATCH the DB-backed fields that actually changed. 5. Channels "+ Connect" dropdown empty on first open. ChannelsTab's load() used Promise.all with a silent catch — if EITHER the channels or adapters fetch failed, both setters were skipped with no error visible. Switched to Promise.allSettled so each endpoint settles independently, and the adapters failure now surfaces via the top-level error state. 6. Plugin registry always "No plugins in registry". Same silent catch pattern in SkillsTab.tsx — load errors for /plugins, /plugins/sources, and /workspaces/:id/plugins swallowed without logging. Replaced the empty catches with console.warn so future failures are at least visible in devtools. Tests: 923 passing (unchanged). Go handler tests pass. Server rebuilt and running with the peers-auth + collapsed-persistence fixes (pid 15875). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 20:18:30 -07:00
Hongming Wang	7356cf8d3a	fix(chat): clear sending spinner when any path delivers the reply Two latent bugs kept the "Processing with Claude Code..." timer ticking after the agent had already answered: 1. The A2A_RESPONSE store handler wrote into agentMessages[workspaceId] (no prefix) but ChatTab's "clear sending" effect subscribed to agentMessages["a2a:" + workspaceId]. Keys never matched — the effect was dead code from day one. Removed the dead subscription and moved the setSending(false) into the pendingAgentMsgs effect so any reply delivered via a WS push (Claude Code SDK, Hermes's send_message_to_user) also closes the spinner. 2. Added an activity-log fallback: when the platform emits a successful a2a_receive ACTIVITY_LOGGED for this workspace, clear sending and stop the timer. That covers the "runtime answered but we never saw the store message" case Claude Code exhibited tonight — the HTTP request can stay in flight while the SDK already pushed its reply. Symmetric a2a_receive error path also clears sending and surfaces the error message, so a runtime-side failure no longer hangs the UI. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 17:43:30 -07:00
Hongming Wang	b4719ad070	fix(canvas): Legend avoids TemplatePalette + silence WS handshake races ### Two unrelated but small UI fixes surfaced while testing the Canvas 1. Legend hidden under the open TemplatePalette. Legend is `fixed bottom-6 left-4 z-30`. TemplatePalette's drawer (when open) is `fixed top-0 left-0 w-[280px] z-30` — same z-index, same left-edge column. The Legend overlapped the palette's bottom 180 px. Published the palette-open state to the canvas store so the Legend can shift right (to `left-[296px]` — 280 px palette + 16 px gap) while the palette is open, animated via a 200 ms `transition-[left]` to match the palette's slide. Closes cleanly back to `left-4` when the palette is dismissed. Files: - `store/canvas.ts` — added `templatePaletteOpen` + `setTemplatePaletteOpen`. - `TemplatePalette.tsx` — calls `setTemplatePaletteOpen(open)` on every open/close transition via a new useEffect. - `Legend.tsx` — reads the flag and swaps `left-4` <-> `left-[296px]`. 2. "WebSocket is closed before the connection is established" spam. Two components (`ChatTab`, `AgentCommsPanel`) open their own short- lived WebSocket to tail the ACTIVITY_LOGGED stream. Their cleanup path called `ws.close()` unconditionally, which trips a browser console warning when React StrictMode re-runs the effect in dev and the handshake hasn't completed yet. Confirmed via DevTools console on the running canvas. Added a `closeWebSocketGracefully(ws)` helper in `lib/ws-close.ts`: - OPEN / CLOSING → close immediately (normal path). - CONNECTING → defer close to the 'open' listener so the browser sees a full handshake. Also wires an 'error' listener that cancels the queued close if the handshake fails (no double-close). - CLOSED → no-op. Both consumers now call the helper in their useEffect cleanup. Silences the warning without changing observable behaviour. ### Tests `canvas/src/lib/__tests__/ws-close.test.ts` — 5 cases with a fake WebSocket covering each readyState branch plus the error-before-open cancellation path. Full vitest suite: 927/927 pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 16:03:01 -07:00
Hongming Wang	de99a22ffc	fix(quickstart): hotfixes discovered during live testing session Five additional breakages surfaced while testing the restored stack end-to-end (spin up Hermes template → click node → open side panel → configure secrets → send chat). Each fix is narrowly scoped and has matching unit or e2e tests so they don't regress. ### 1. SSRF defence blocked loopback A2A on self-hosted Docker handlers/ssrf.go was rejecting `http://127.0.0.1:<port>` workspace URLs as loopback, so POST /workspaces/:id/a2a returned 502 on every Canvas chat send in local-dev. The provisioner on self-hosted Docker publishes each container's A2A port on 127.0.0.1:<ephemeral> — that's the only reachable address for the platform-on-host path. Added `devModeAllowsLoopback()` — allows loopback only when MOLECULE_ENV ∈ {development, dev}. SaaS (MOLECULE_ENV=production) continues to block loopback; every other blocked range (metadata 169.254/16, TEST-NET, CGNAT, link-local) stays blocked in dev mode. Tests: 5 new tests in ssrf_test.go covering dev-mode loopback, dev-mode short-alias ("dev"), production still blocks loopback, dev-mode still blocks every other range, and a 9-case table test of the predicate with case/whitespace/typo variants. ### 2. canvas/src/lib/api.ts: 401 → login redirect broke localhost Every 401 called `redirectToLogin()` which navigates to `/cp/auth/login`. That route exists only on SaaS (mounted by the cp_proxy when CP_UPSTREAM_URL is set). On localhost it 404s — users landed on a blank "404 page not found" instead of seeing the actual error they should fix. Gated the redirect on the SaaS-tenant slug check: on <slug>.moleculesai.app, redirect unchanged; on any non-SaaS host (localhost, LAN IP, reserved subdomains like app.moleculesai.app), throw a real error so the calling component can render a retry affordance. Tests: 4 new vitest cases in a dedicated api-401.test.ts (needs jsdom for window.location.hostname) — SaaS redirects, localhost throws, LAN hostname throws, reserved apex throws. ### 3. SecretsSection rendered a hardcoded key list config/secrets-section.tsx shipped a fixed COMMON_KEYS list (Anthropic / OpenAI / Google / SERP / Model Override) regardless of what the workspace's template actually needed. A Hermes workspace declaring MINIMAX_API_KEY in required_env got five irrelevant slots and nothing for the key it actually needed. Made the slot list template-driven via a new `requiredEnv?: string[]` prop passed down from ConfigTab. Added `KNOWN_LABELS` for well-known names and `humanizeKeyName` to turn arbitrary SCREAMING_SNAKE_CASE into a readable label (e.g. MINIMAX_API_KEY → "Minimax API Key"). Acronyms (API, URL, ID, SDK, MCP, LLM, AI) stay uppercase. Legacy fallback preserved when required_env is empty. Tests: 8 new vitest cases covering known-label lookup, humanise fallback, acronym preservation, deduplication, and both fallback paths. ### 4. Confusing placeholder in Required Env Vars field The TagList in ConfigTab labelled "Required Env Vars (from template)" is a DECLARATION field — stores variable names. The placeholder "e.g. CLAUDE_CODE_OAUTH_TOKEN" suggested that, but users naturally typed the value of their API key into the field instead. The actual values go in the Secrets section further down the tab. Relabelled to "Required Env Var Names (from template)", changed the placeholder to "variable NAME (e.g. ANTHROPIC_API_KEY) — not the value", and added a one-line helper below pointing to Secrets. ### 5. Agent chat replies rendered 2-3 times Three delivery paths can fire for a single agent reply — HTTP response to POST /a2a, A2A_RESPONSE WS event, and a send_message_to_user WS push. Paths 2↔3 were already guarded by `sendingFromAPIRef`; path 1 had no guard. Hermes emits both the reply body AND a send_message_to_user with the same text, which manifested as duplicate bubbles with identical timestamps. Added `appendMessageDeduped(prev, msg, windowMs = 3000)` in chat/types.ts — dedupes on (role, content) within a 3s window. Threaded into all three setMessages call sites. The window is short enough that legitimate repeat messages ("hi", "hi") from a real user/agent a few seconds apart still render. Tests: 8 new vitest cases covering empty history, different content, duplicate within window, different roles, window elapsed, stale match, malformed timestamps, and custom window. ### 6. New end-to-end regression test tests/e2e/test_dev_mode.sh — 7 HTTP assertions that run against a live platform with MOLECULE_ENV=development and catch regressions on all the dev-mode escape hatches in a single pass: AdminAuth (empty DB + after-token), WorkspaceAuth (/activity, /delegations), AdminAuth on /approvals/pending, and the populated /org/templates response. Shellcheck-clean. ### Test sweep - `go test -race ./internal/handlers/ ./internal/middleware/ ./internal/provisioner/` — all pass - `npx vitest run` in canvas — 922/922 pass (up from 902) - `shellcheck --severity=warning infra/scripts/setup.sh tests/e2e/test_dev_mode.sh` — clean - `bash tests/e2e/test_dev_mode.sh` — 7/7 pass against a live platform + populated template registry ### SaaS parity Every relaxation remains conditional on MOLECULE_ENV=development. Production tenants run MOLECULE_ENV=production (enforced by the secrets-encryption strict-init path) and always set ADMIN_TOKEN, so none of these code paths fire on hosted SaaS. Behaviour on real tenants is byte-for-byte unchanged. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 14:57:18 -07:00
Hongming Wang	ecad02eadc	fix: ChatTab comment path for workspace-server rename Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-18 01:48:59 -07:00
Molecule AI Frontend Engineer	91957dff4d	fix(canvas): expose loadMessagesFromDB failures with error banner + Retry Previously loadMessagesFromDB swallowed all errors and returned [] — a network failure was indistinguishable from an empty history, so the user had no way to know loading failed. Now the function returns { messages, error } and the MyChatPanel renders a role="alert" banner with the error message and a Retry button when messages are empty and a load error occurred. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 20:34:48 +00:00
Canvas Agent	c33b59a93a	fix(canvas): QA blockers — ChatTab aria-controls, AuthGate test, CommunicationOverlay status icons BLOCKER 1 (ChatTab.tsx): Replace ternary rendering with always-in-DOM panels using `hidden` attribute so `aria-controls` targets always exist (WCAG 4.1.2). Add `id` to tab buttons for `aria-labelledby` back-reference. Non-blocking: change `key={i}` → `key={line + i}` on activity log items. BLOCKER 2 (AuthGate.test.tsx): Create test file asserting the loading state renders a `.bg-zinc-950.fixed.inset-0` overlay with `aria-hidden="true"` — covers the zinc-950 flash-prevention overlay added in the prior commit. BLOCKER 3 (CommunicationOverlay.tsx): Add `aria-hidden="true"` to the status icon span so decorative glyphs (✓ ✕ ⏱) are not announced by screen readers. Tests: 490/490 passing. Build: clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 10:53:52 +00:00
Canvas Agent	28f3e33581	fix(canvas): UIUX Cycle 15 dark-theme & a11y sweep (C1-C5, A1-A4, F1, M1) - C4: OnboardingWizard skip button — aria-label + text-zinc-400 (was zinc-600) - A1+M1: CommunicationOverlay — aria-label on both icon buttons, aria-hidden on decorative arrow glyphs (↗↙ toggle, ✕ close, → comms rows) - A2: ChatTab sub-tab bar — ARIA roving tabIndex + ArrowLeft/ArrowRight keyboard navigation (role=tablist/tab already present) - A4: SearchDialog search input — focus-visible:ring-2 ring-blue-500 replaces bare focus:outline-none so keyboard focus is visible - F1: AuthGate loading state — zinc-950 full-screen backdrop instead of null (prevents white flash on SaaS tenant load) - A3: SidePanel tab bar — wrap in relative container + right-edge fade gradient so truncated tabs are visually signalled C2 (settings-panel.css input backgrounds) and C3 (Canvas.tsx colorMode="dark") were already in place; verified by code audit before this commit. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 10:35:32 +00:00
Dev Lead Agent	cf8db07020	fix(canvas): WCAG critical — ARIA live toasts, dialog focus trap, keyboard nav Addresses the three release-blocking WCAG violations from the UX audit (3rd consecutive cycle) and the new ChatTab ARIA gap from Audit #2. Changes: - Toaster: split into polite (success/info) + assertive (error) live regions, both always in DOM so screen readers register them before any toast fires. Adds x dismiss button on every toast. Errors no longer auto-expire after 4s — persist until explicitly dismissed. - ConfirmDialog: on open, requestAnimationFrame focuses the first button inside the dialog. Tab/Shift-Tab is now trapped inside the dialog while open. Added role="dialog" aria-modal="true" and aria-labelledby pointing to the title h3. - WorkspaceNode: outer div gains role="button", tabIndex={0}, aria-label, aria-pressed, and onKeyDown (Enter/Space => selectNode, ContextMenu key => openContextMenu). Keyboard-only users can now reach and activate workspace nodes. - ChatTab sub-tab bar: role="tablist" on wrapper, role="tab" + aria-selected + aria-controls on each button, matching role="tabpanel" + id on each panel div. Textarea gets aria-label="Message to agent". 453/453 Vitest tests pass. Production build clean (Next.js 15). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-15 08:31:06 +00:00
Dev Lead Agent	95abca2f4f	fix(a11y): prefers-reduced-motion WCAG 2.3.3 compliance globals.css: append @media (prefers-reduced-motion: reduce) block that zeroes animation/transition durations, disables .animate-in/.slide-in-from-* entry animations (Toaster, ApprovalBanner, SidePanel slide), strips dashdraw and node-appear keyframes from React Flow elements. Components: replace all bare animate-pulse (13 occurrences across WorkspaceNode, StatusDot, Toolbar, SidePanel, Legend, SearchDialog, TerminalTab, TemplatePalette) with motion-safe:animate-pulse so status indicator pulsing stops for users with vestibular disorders. Replace 3 animate-bounce occurrences in ChatTab typing indicator with motion-safe:animate-bounce. Tests: new canvas/src/__tests__/reduced-motion.test.ts (12 tests) verifies the @media block is present in globals.css and that every component file uses the motion-safe: variant rather than bare animation classes. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-14 11:25:23 +00:00
Dev Lead Agent	7cdbd0d2a8	fix: workspace parent combobox, WCAG button text minimum 11px Replace raw Parent Workspace ID text input with a <select> populated from GET /workspaces (T{tier} · {name} format, graceful fallback on fetch error). Raise all interactive button text from text-[8px]/[9px] to text-[11px] across SkillsTab, ScheduleTab, secrets-section, ActivityTab, SidePanel, ChatTab; non-interactive labels/badges to text-[10px]. Adds 7 CreateWorkspaceDialog unit tests (372/372 passing). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-14 07:27:49 +00:00
Hongming Wang	74e2da8b92	chore: quality pass — native dialogs, env sync, Go handler splits Three parallel cleanups driven by the second code-review pass. ## Native dialogs → ConfirmDialog (7 sites) Violated the standing feedback_no_native_dialogs rule. - ChannelsTab: confirm() → ConfirmDialog danger variant with pendingDelete state - ScheduleTab: window.confirm() → ConfirmDialog danger - ChatTab: confirm("Restart...") → ConfirmDialog warning (restart is recoverable) - TemplatePalette: two alert() sites collapsed into a single notice state + ConfirmDialog as OK-only info toast - ErrorBoundary: dropped both window.alert calls entirely. Clipboard-copy click is self-evident; console.error already captures the fallback. ## .env.example ↔ Go env var sync Added 11 previously-undocumented env vars grouped into 6 new sections: - Platform: PLATFORM_URL, MOLECULE_URL, WORKSPACE_DIR, MOLECULE_ENV - CORS / rate limiting: CORS_ORIGINS, RATE_LIMIT - Activity retention: ACTIVITY_RETENTION_DAYS, ACTIVITY_CLEANUP_INTERVAL_HOURS - Container detection: MOLECULE_IN_DOCKER (moved to dedup) - Observability: AWARENESS_URL - Webhooks: GITHUB_WEBHOOK_SECRET - CLI: MOLECLI_URL All 21 distinct os.Getenv / envx.* keys (excluding HOME) now documented. Zero orphans in the other direction. ## Go handler function splits (4 funcs, pure refactor) No behavior change; same tests pass. \| Function \| Before \| After \| Helpers \| \|---------------------------\|-------:\|------:\|---------------------------------------------------------------\| \| proxyA2ARequest \| 257 \| 56 \| resolveAgentURL, normalizeA2APayload, dispatchA2A, \| \| \| \| \| handleA2ADispatchError, maybeMarkContainerDead, \| \| \| \| \| logA2AFailure, logA2ASuccess \| \| Delegate \| 127 \| 60 \| bindDelegateRequest, lookupIdempotentDelegation, \| \| \| \| \| insertDelegationRow \| \| Discover \| 125 \| 40 \| discoverWorkspacePeer, writeExternalWorkspaceURL, \| \| \| \| \| discoverHostPeer \| \| SessionSearch \| 109 \| 24 \| parseSessionSearchParams, buildSessionSearchQuery, \| \| \| \| \| scanSessionSearchRows \| Preserved exact error semantics, log.Printf calls, status codes, and response shapes. Introduced a proxyDispatchBuildError sentinel in a2a_proxy so the orchestrator can distinguish "couldn't build the request" from "Do() failed" without changing existing branches. ## Verification - go build ./... clean - go vet ./... clean - go test -race ./internal/... — all pass - canvas npm run build — clean - canvas npm test -- --run — 352/352 pass - grep window.confirm\|window.alert\|window.prompt in canvas/src — 0 matches - every platform os.Getenv key present in .env.example Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 14:36:30 -07:00
Hongming Wang	24fec62d7f	initial commit — Molecule AI platform Forked clean from public hackathon repo (Starfire-AgentTeam, BSL 1.1) with full rebrand to Molecule AI under github.com/Molecule-AI/molecule-monorepo. Brand: Starfire → Molecule AI. Slug: starfire / agent-molecule → molecule. Env vars: STARFIRE_* → MOLECULE_*. Go module: github.com/agent-molecule/platform → github.com/Molecule-AI/molecule-monorepo/platform. Python packages: starfire_plugin → molecule_plugin, starfire_agent → molecule_agent. DB: agentmolecule → molecule. History truncated; see public repo for prior commits and contributor attribution. Verified green: go test -race ./... (platform), pytest (workspace-template 1129 + sdk 132), vitest (canvas 352), build (mcp). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 11:55:37 -07:00

42 Commits