molecule-core

Author	SHA1	Message	Date
Hongming Wang	9eb22333a5	fix(deploy-modal): snap provider radio when model resolves to a provider The TemplatePalette deploy modal (MissingKeysModal → ProviderPickerModal) let the model field and provider radio drift apart. When a hermes template defaulted the model to "MiniMax-M2.7-highspeed" but the radio defaulted to providers[0] (Anthropic), the env-var input below asked for ANTHROPIC_API_KEY. A user pasting their MINIMAX_API_KEY there (or just dismissing the dialog) ended up with a workspace whose runtime_config.model=MiniMax + ANTHROPIC_API_KEY env — the hermes adapter then crashed during boot before /registry/register, surfacing as WORKSPACE_PROVISION_FAILED 12 minutes later. Caught 2026-05-02 on hongming/Hermes Agent (workspace 95ed3ff2-… ended with: "container started but never called /registry/register"). Sibling of the ConfigTab cascade fix in PR #2516 (task #236) — same pattern, different surface. Plumbs the template's full ModelSpec[] (with required_env per model) into the picker. When the typed model matches a registry entry, snap the radio so the env-var fields underneath match what the model actually needs. Free-text models (typed slug not in the registry) and models with no required_env (local/self-hosted endpoints) leave the radio alone — the user can still pick a provider manually. Backwards-compat: callers that don't pass `models` get the pre-cascade behavior, pinned by a regression test. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 19:01:13 -07:00
Hongming Wang	fd5fe34f69	Merge pull request #2523 from Molecule-AI/dependabot/github_actions/actions/github-script-9.0.0 chore(deps)(deps): bump actions/github-script from 7.1.0 to 9.0.0	2026-05-03 01:37:00 +00:00
Hongming Wang	0d8b0c37a6	Merge pull request #2521 from Molecule-AI/dependabot/github_actions/actions/checkout-6 chore(deps)(deps): bump actions/checkout from 4 to 6	2026-05-03 01:36:57 +00:00
Hongming Wang	252e126207	Merge pull request #2522 from Molecule-AI/dependabot/github_actions/docker/setup-buildx-action-4.0.0 chore(deps)(deps): bump docker/setup-buildx-action from 3.12.0 to 4.0.0	2026-05-03 01:27:03 +00:00
Hongming Wang	e84df73e96	Merge pull request #2528 from Molecule-AI/dependabot/github_actions/docker/build-push-action-7.1.0 chore(deps)(deps): bump docker/build-push-action from 6.19.2 to 7.1.0	2026-05-03 01:27:00 +00:00
Hongming Wang	7db4129877	Merge pull request #2525 from Molecule-AI/dependabot/github_actions/imjasonh/setup-crane-0.5 chore(deps)(deps): bump imjasonh/setup-crane from 0.4 to 0.5	2026-05-03 01:25:37 +00:00
Hongming Wang	9c03b1084f	Merge pull request #2524 from Molecule-AI/dependabot/pip/workspace/opentelemetry-api-gte-1.41.1 chore(deps)(deps): update opentelemetry-api requirement from >=1.24.0 to >=1.41.1 in /workspace	2026-05-03 01:25:34 +00:00
Hongming Wang	476dbc83a3	Merge pull request #2530 from Molecule-AI/dependabot/pip/workspace/opentelemetry-exporter-otlp-proto-http-gte-1.41.1 chore(deps)(deps): update opentelemetry-exporter-otlp-proto-http requirement from >=1.24.0 to >=1.41.1 in /workspace	2026-05-03 01:25:31 +00:00
Hongming Wang	cf118df2af	Merge pull request #2520 from Molecule-AI/dependabot/go_modules/workspace-server/github.com/creack/pty-1.1.24 chore(deps)(deps): bump github.com/creack/pty from 1.1.18 to 1.1.24 in /workspace-server	2026-05-03 01:25:28 +00:00
Hongming Wang	8dc07b46dd	Merge pull request #2526 from Molecule-AI/dependabot/pip/workspace/python-multipart-gte-0.0.27 chore(deps)(deps): update python-multipart requirement from >=0.0.18 to >=0.0.27 in /workspace	2026-05-03 01:25:25 +00:00
Hongming Wang	495cc38dd9	Merge pull request #2531 from Molecule-AI/dependabot/pip/workspace/pyyaml-gte-6.0.3 chore(deps)(deps): update pyyaml requirement from >=6.0 to >=6.0.3 in /workspace	2026-05-03 01:25:20 +00:00
Hongming Wang	ecfb160b0d	Merge pull request #2532 from Molecule-AI/dependabot/npm_and_yarn/canvas/jsdom-29.1.1 chore(deps)(deps-dev): bump jsdom from 29.1.0 to 29.1.1 in /canvas	2026-05-03 01:25:17 +00:00
dependabot[bot]	dfc1f6d455	chore(deps)(deps): update pyyaml requirement in /workspace Updates the requirements on [pyyaml](https://github.com/yaml/pyyaml) to permit the latest version. - [Release notes](https://github.com/yaml/pyyaml/releases) - [Changelog](https://github.com/yaml/pyyaml/blob/6.0.3/CHANGES) - [Commits](https://github.com/yaml/pyyaml/compare/6.0...6.0.3) --- updated-dependencies: - dependency-name: pyyaml dependency-version: 6.0.3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:25 +00:00
dependabot[bot]	f61750808e	chore(deps)(deps-dev): bump jsdom from 29.1.0 to 29.1.1 in /canvas Bumps [jsdom](https://github.com/jsdom/jsdom) from 29.1.0 to 29.1.1. - [Release notes](https://github.com/jsdom/jsdom/releases) - [Commits](https://github.com/jsdom/jsdom/compare/v29.1.0...v29.1.1) --- updated-dependencies: - dependency-name: jsdom dependency-version: 29.1.1 dependency-type: direct:development update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:25 +00:00
dependabot[bot]	0e0550c640	chore(deps)(deps): update opentelemetry-exporter-otlp-proto-http requirement Updates the requirements on [opentelemetry-exporter-otlp-proto-http](https://github.com/open-telemetry/opentelemetry-python) to permit the latest version. - [Release notes](https://github.com/open-telemetry/opentelemetry-python/releases) - [Changelog](https://github.com/open-telemetry/opentelemetry-python/blob/v1.41.1/CHANGELOG.md) - [Commits](https://github.com/open-telemetry/opentelemetry-python/compare/v1.24.0...v1.41.1) --- updated-dependencies: - dependency-name: opentelemetry-exporter-otlp-proto-http dependency-version: 1.41.1 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:21 +00:00
dependabot[bot]	c46db97ac6	chore(deps)(deps): bump docker/build-push-action from 6.19.2 to 7.1.0 Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.19.2 to 7.1.0. - [Release notes](https://github.com/docker/build-push-action/releases) - [Commits](`10e90e3645...bcafcacb16`) --- updated-dependencies: - dependency-name: docker/build-push-action dependency-version: 7.1.0 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:17 +00:00
dependabot[bot]	1d99b3b8ae	chore(deps)(deps): update python-multipart requirement in /workspace Updates the requirements on [python-multipart](https://github.com/Kludex/python-multipart) to permit the latest version. - [Release notes](https://github.com/Kludex/python-multipart/releases) - [Changelog](https://github.com/Kludex/python-multipart/blob/main/CHANGELOG.md) - [Commits](https://github.com/Kludex/python-multipart/compare/0.0.18...0.0.27) --- updated-dependencies: - dependency-name: python-multipart dependency-version: 0.0.27 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:15 +00:00
dependabot[bot]	6c6c6eb1e8	chore(deps)(deps): bump imjasonh/setup-crane from 0.4 to 0.5 Bumps [imjasonh/setup-crane](https://github.com/imjasonh/setup-crane) from 0.4 to 0.5. - [Release notes](https://github.com/imjasonh/setup-crane/releases) - [Commits](`31b88efe9d...6da1ae0188`) --- updated-dependencies: - dependency-name: imjasonh/setup-crane dependency-version: '0.5' dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:13 +00:00
dependabot[bot]	8072f00b2f	chore(deps)(deps): update opentelemetry-api requirement in /workspace Updates the requirements on [opentelemetry-api](https://github.com/open-telemetry/opentelemetry-python) to permit the latest version. - [Release notes](https://github.com/open-telemetry/opentelemetry-python/releases) - [Changelog](https://github.com/open-telemetry/opentelemetry-python/blob/v1.41.1/CHANGELOG.md) - [Commits](https://github.com/open-telemetry/opentelemetry-python/compare/v1.24.0...v1.41.1) --- updated-dependencies: - dependency-name: opentelemetry-api dependency-version: 1.41.1 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:11 +00:00
dependabot[bot]	e1f7d49575	chore(deps)(deps): bump actions/github-script from 7.1.0 to 9.0.0 Bumps [actions/github-script](https://github.com/actions/github-script) from 7.1.0 to 9.0.0. - [Release notes](https://github.com/actions/github-script/releases) - [Commits](`f28e40c7f3...3a2844b7e9`) --- updated-dependencies: - dependency-name: actions/github-script dependency-version: 9.0.0 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:09 +00:00
dependabot[bot]	ab7ac2e103	chore(deps)(deps): bump docker/setup-buildx-action from 3.12.0 to 4.0.0 Bumps [docker/setup-buildx-action](https://github.com/docker/setup-buildx-action) from 3.12.0 to 4.0.0. - [Release notes](https://github.com/docker/setup-buildx-action/releases) - [Commits](`8d2750c68a...4d04d5d948`) --- updated-dependencies: - dependency-name: docker/setup-buildx-action dependency-version: 4.0.0 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:05 +00:00
dependabot[bot]	3598eb41d1	chore(deps)(deps): bump actions/checkout from 4 to 6 Bumps [actions/checkout](https://github.com/actions/checkout) from 4 to 6. - [Release notes](https://github.com/actions/checkout/releases) - [Commits](https://github.com/actions/checkout/compare/v4...v6) --- updated-dependencies: - dependency-name: actions/checkout dependency-version: '6' dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:23:01 +00:00
dependabot[bot]	82d0655fe9	chore(deps)(deps): bump github.com/creack/pty in /workspace-server Bumps [github.com/creack/pty](https://github.com/creack/pty) from 1.1.18 to 1.1.24. - [Release notes](https://github.com/creack/pty/releases) - [Commits](https://github.com/creack/pty/compare/v1.1.18...v1.1.24) --- updated-dependencies: - dependency-name: github.com/creack/pty dependency-version: 1.1.24 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>	2026-05-02 19:22:48 +00:00
Hongming Wang	ea967d5787	Merge pull request #2518 from Molecule-AI/docs/hermes-plugin-status-update docs(integrations): hermes plugin path status post-PR #32 merge	2026-05-02 16:01:47 +00:00
Hongming Wang	2dd5684e73	docs(integrations): update hermes plugin path status to post-merge PR #32 (workspace template) merged 2026-05-02; image rebuild succeeded. Plugin baked in. Local full-chain E2E green; caught + fixed a real KeyError in upstream hermes_cli/tools_config.py. Upstream PR #18775 still OPEN/CONFLICTING — not on critical path. Also rewrites hermes-platform-plugins-upstream-pr.md to reflect the final landing shape (existing hermes_cli/plugins.py, not a new plugins/platforms/ system). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 04:42:00 -07:00
Hongming Wang	2552779d97	Merge pull request #2517 from Molecule-AI/test/all-runtimes-a2a-e2e-harness test(e2e): unified A2A round-trip parity harness across all 4 runtimes	2026-05-02 11:40:14 +00:00
Hongming Wang	d88c160e56	test(e2e): wire SaaS auth headers (TENANT_ADMIN_TOKEN + TENANT_ORG_ID) The harness needs Authorization + X-Molecule-Org-Id (per-tenant, NOT CP_ADMIN_API_TOKEN) when targeting *.moleculesai.app subdomains. Existing single-Origin-header form silent-failed with 404 against staging tenants since the SaaS edge WAF rewrites unauthenticated /workspaces calls to Next.js (per reference_saas_waf_origin_header.md). Switch to a headers array so multiple -H flags compose cleanly with curl arg-quoting, and document the env var contract at the top of the script. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 04:36:23 -07:00
Hongming Wang	5aaac7d2d9	test(e2e): unified A2A round-trip parity harness across all 4 runtimes Adds two scripts: scripts/test-all-runtimes-a2a-e2e.sh Provisions one workspace per runtime (claude-code, hermes, codex, openclaw), sets provider keys, waits online, sends two A2A messages per workspace. First message validates round-trip; second message validates session continuity. Cleans up via trap on EXIT. scripts/test-hermes-plugin-e2e.sh Hermes-only variant focused on the plugin /a2a/inbound path. Proof-point: session continuity between turns (the plugin path's deliverable; old chat-completions path lost context per turn). Both honor SKIP_<runtime> env vars for incremental testing and tolerate the SaaS edge WAF Origin header requirement (per reference_saas_waf_origin_header.md). Run: PLATFORM=https://demo-tenant.staging.moleculesai.app \\ ./scripts/test-all-runtimes-a2a-e2e.sh Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 04:36:23 -07:00
Hongming Wang	15dd1f26c3	Merge pull request #2513 from Molecule-AI/auto-sync/main-35cb6ba0 chore: sync main → staging (auto, merge `35cb6ba0`)	2026-05-02 10:53:27 +00:00
Hongming Wang	8083fd8b7d	Merge branch 'staging' into auto-sync/main-35cb6ba0	2026-05-02 03:39:00 -07:00
Hongming Wang	1f77f41a80	Merge pull request #2514 from Molecule-AI/fix/honest-v1-tolerance-comments docs(a2a): correct misleading v1-tolerance comments	2026-05-02 09:46:33 +00:00
Hongming Wang	119518a612	Merge pull request #2515 from Molecule-AI/fix/sweep-cf-tunnels-parallelize-deletes fix(sweep-cf-tunnels): parallelize deletes + raise workflow timeout	2026-05-02 09:38:31 +00:00
Hongming Wang	8bf29b7d0e	fix(sweep-cf-tunnels): parallelize deletes + raise workflow timeout The hourly Sweep stale Cloudflare Tunnels job got cancelled mid-cleanup on 2026-05-02 (run 25248788312, killed at 5min after deleting 424/672 stale tunnels). A second manual dispatch finished the remaining 254 fine, so the immediate backlog cleared, but two underlying bugs would re-trip on the next big cleanup. Bug 1: serial delete loop. The execute branch was a `while read; do curl -X DELETE; done` pipeline at ~0.7s/tunnel — fine for the steady-state cleanup of a handful, but a 600+ backlog needs ~7-8min. This commit fans out to $SWEEP_CONCURRENCY (default 8) workers via `xargs -P 8 -L 1 -I {} bash -c '...' _ {} < "$DELETE_PLAN"`. With 8x parallelism the same 600+ list drains in ~60s. Notes: - We use stdin (`<`) not GNU's `xargs -a FILE` so the script stays portable to BSD xargs (matters for local-runner testing on macOS). - We pass ONLY the tunnel id on argv. xargs tokenizes on whitespace by default; tab-separating id+name on argv risks mangling. The name is kept in a side-channel id->name map ($NAME_MAP) and looked up by the worker only on failure, for FAIL_LOG readability. - Workers print exactly `OK` or `FAIL` on stdout; tally with `grep -c '^OK$' / '^FAIL$'`. - On non-zero FAILED, log the first 20 lines of $FAIL_LOG as "Failure detail (first 20):" — same diagnostic surface as before but consolidated so we don't spam logs on a flaky CF API. Bug 2: workflow's 5-min cap was set as a hangs-detector but turned out to be a real-job-too-slow detector. Raised to 30 min — generous headroom for the ~60s steady-state run while still surfacing genuine hangs (and in line with the sweep-cf-orphans companion job). Bug 3 (drive-by): the existing trap was `trap 'rm -rf "$PAGES_DIR"' EXIT`, which would have been silently overwritten by any later trap registration. Replaced with a single `cleanup()` function that wipes PAGES_DIR + all four new tempfiles (DELETE_PLAN, NAME_MAP, FAIL_LOG, RESULT_LOG), called once via `trap cleanup EXIT`. Verification: - bash -n scripts/ops/sweep-cf-tunnels.sh: clean - shellcheck -S warning scripts/ops/sweep-cf-tunnels.sh: clean - python3 yaml.safe_load on the workflow: clean - Synthetic 30-line delete plan with every 7th id sentinel'd to return {"success":false}: TEST PASS, DELETED=26 FAILED=4, FAIL_LOG side-channel name lookup verified. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 02:35:46 -07:00
Hongming Wang	fc33cf1131	docs(a2a): correct misleading v1-tolerance comments Follow-up to PR #2509/#2510. The defensive v1-detection branches in extract_attached_files (Python) and extractFilesFromTask (TypeScript) were merged with comments claiming they fix a "v0→v1 silent-drop" bug that surfaced as the 2026-05-01 hongming "no text content" incident. Live test disproved that hypothesis: a2a-sdk's JSON-RPC layer validates inbound requests against the v0 Pydantic union, so v1 shapes are rejected at the request boundary — the v1 detection branch is unreachable on the JSON-RPC ingress path. The actual root cause of the hongming incident was the missing /workspace chown fixed by CP PR #381 + test #382. Update the comments to honestly describe these branches as defensive future-proofing (kept against an eventual SDK schema migration or in-process callers that construct Parts directly from protobuf), not as fixes for an observed bug. Also trims ChatTab.tsx's outbound-shape comment block from ~21 lines to a 3-line pointer to the SDK union. Comment-only change. No behavior change. 86 workspace tests + 91 canvas tests still pass.	2026-05-02 02:33:00 -07:00
github-actions[bot]	03c1cbf12b	chore: sync main → staging (auto)	2026-05-02 09:27:17 +00:00
Hongming Wang	35cb6ba089	Merge pull request #2512 from Molecule-AI/feat/register-codex-runtime feat: register codex runtime + runtime native-MCP design docs	2026-05-02 02:26:56 -07:00
Hongming Wang	ce0188d5b4	Merge pull request #2499 from Molecule-AI/auto-sync/main-e7375348 chore: sync main → staging (auto, ff to `e7375348`)	2026-05-02 09:22:51 +00:00
Hongming Wang	7224276de0	feat: register codex runtime + runtime native-MCP design docs Adds the OpenAI Codex CLI as a Molecule workspace runtime and lands the design docs that drove the runtime native-MCP push parity work across claude-code, hermes, openclaw, and codex. manifest.json: - Adds `codex` workspace_template entry pointing at the new Molecule-AI/molecule-ai-workspace-template-codex repo (initial commit landed there in parallel; 14 files / 1411 LOC). The workspace-server runtime registry already had `codex` in its fallback set — this entry makes it manifest-reachable in prod. docs/integrations/: - runtime-native-mcp-status.md — index across all four runtime streams - codex-app-server-adapter-design.md — full design including v2 RPC sequence, executor skeleton, schema-vs-runtime drift findings (real codex 0.72 returns thread.id, schema says thread.threadId) - hermes-platform-plugins-upstream-pr.md — pre-submission draft of the hermes-agent upstream PR Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 02:21:11 -07:00
Hongming Wang	3d7b4b70ff	Merge pull request #2511 from Molecule-AI/fix/redeploy-tolerate-e2e-teardown-race fix(redeploy-staging): tolerate e2e-* teardown race in fleet HTTP 500	2026-05-02 09:19:45 +00:00
Hongming Wang	6e0eb2ddc9	fix(redeploy-staging): tolerate e2e-* teardown race in fleet HTTP 500 Recurring failure pattern in redeploy-tenants-on-staging: ##[error]redeploy-fleet returned HTTP 500 ##[error]Process completed with exit code 1. with the per-tenant breakdown in the response body showing the failures were on ephemeral e2e-* tenants (saas/canvas/ext) whose parent E2E run torn them down mid-redeploy — SSM exit=2 because the EC2 was already terminating, or healthz timeout because the CF tunnel was already gone. The actual operator-facing tenants (dryrun-98407, demo-prep, etc) all rolled fine in the same call. This shape repeats every staging push that overlaps an active E2E run. The downstream `Verify each staging tenant /buildinfo matches published SHA` step ALREADY distinguishes STALE vs UNREACHABLE for exactly this reason (per #2402); only the top-level `if HTTP_CODE != 200; exit 1` gate misclassifies the race. Filter: HTTP 500 + every failed slug matches `^e2e-` → soft-warn and fall through to verify. Any non-e2e-* failure or non-500 HTTP remains a hard fail, with the failed non-e2e slugs surfaced in the error so the operator doesn't have to dig the response body out of CI. Verified the gate logic with 6 synthetic CP responses (happy / e2e-only race / mixed real+e2e fail / non-200 / 200+ok=false / all-real-fail) — all behave correctly. prod's redeploy-tenants-on-main is intentionally NOT touched: prod CP serves no e2e-* tenants, so the race can't occur there and the strict gate is the right behavior. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 02:17:36 -07:00
Hongming Wang	1ce9b7f716	Merge pull request #2510 from Molecule-AI/fix/revert-canvas-v1-outbound fix(canvas): revert v1 outbound file part shape — JSON-RPC layer rejects it	2026-05-02 08:43:53 +00:00
Hongming Wang	3ce7c11a13	fix(canvas): revert v1 outbound file part shape The previous PR (#2509) flipped canvas outbound file parts to the v1 flat shape `{url, filename, mediaType}` based on a hypothesis that a2a-sdk's JSON-RPC parser silently dropped v0 `{kind:"file", file:{...}}` shapes. Live test shows the opposite: a2a-sdk's JSON-RPC layer validates against the v0 Pydantic discriminated union (TextPart \| FilePart \| DataPart), so v1 flat shape is rejected with: Invalid Request: params.message.parts.0.TextPart.text — Field required params.message.parts.0.FilePart.file — Field required params.message.parts.0.DataPart.data — Field required The actual root cause of the user-visible "Error: message contained no text content" was the missing `/workspace` chown (CP PR #381 + test pin #382), not a wire-shape mismatch. Verified end-to-end by sending a v0 image-only message after PR #381 + workspace re-provision — agent receives the file, reads its bytes, and replies normally. Reverting only the canvas outbound shape. Defensive v1-tolerance stays in: - workspace/executor_helpers.py — extract_attached_files still accepts v1 protobuf parts in case a future client emits them or a future SDK release flips internal representation. Harmless on the v0 hot path. - canvas/message-parser.ts — extractFilesFromTask still tolerates v1 shape on incoming agent responses. Some agents may emit v1 when their internal serializer round-trips through protobuf. Tests stay green (91 canvas, 86 workspace).	2026-05-02 01:31:56 -07:00
Hongming Wang	bf83af0960	Merge pull request #2509 from Molecule-AI/fix/a2a-v1-file-part-shape fix(a2a): send v1 file Part shape; tolerate v1 server-side	2026-05-02 08:13:52 +00:00
Hongming Wang	02a8841402	fix(a2a): send v1 file Part shape; tolerate v1 server-side Image-only chats surface "Error: message contained no text content" because canvas posts v0 `{kind:"file", file:{uri,name,mimeType}}` shapes that the workspace runtime's a2a-sdk v1 protobuf parser silently drops: v1 `Part` has fields `[text, raw, url, data, metadata, filename, media_type]` and `ignore_unknown_fields=True` discards `kind`+`file`, producing a fully-empty Part. With no text and no extracted file attachments, the executor's "no text content" guard fires. Three coordinated changes close the gap: 1. canvas/ChatTab.tsx — outbound file parts now carry the v1 flat shape `{url, filename, mediaType}` so the v1 protobuf parser populates Part fields instead of dropping them. 2. workspace/executor_helpers.py — extract_attached_files learns the v1 detection branch (non-empty `part.url` + `filename` + `media_type`) alongside the existing v0 RootModel and flat-file shapes. Defends every runtime that mounts the OSS wheel against the same drop, including any pre-fix client still on the wire. 3. canvas/message-parser.ts — extractFilesFromTask tolerates the v1 shape on incoming agent responses too, so file chips render in chat history regardless of which Part shape the runtime emits. Test pins: - workspace/tests/test_executor_helpers.py: + v1 protobuf shape extraction + empty-Part defense (v0→v1 silent-drop fall-through returns []) - canvas message-parser test: + v1 protobuf flat parts + filename fallback to URL basename for v1	2026-05-02 00:58:05 -07:00
Hongming Wang	b36eed97f6	Merge pull request #2508 from Molecule-AI/fix/sweep-cf-tunnels-arg-too-long fix(sweep-cf-tunnels): buffer pages to disk to avoid argv ARG_MAX	2026-05-02 07:45:01 +00:00
Hongming Wang	a117a60eed	fix(sweep-cf-tunnels): buffer pages to disk to avoid argv ARG_MAX The page-merge loop passed the entire accumulating tunnel JSON to python3 -c via argv on every iteration. On a busy account (verified 2026-05-02: 672 tunnels, 14 pages on Hongmingwangrabbit account) this exceeds the GH Ubuntu runner's combined argv+envp limit (~128 KB) and dies with `python3: Argument list too long` at exit 126 — the workflow has been silently failing this way since the very first run that hit a real account, masked earlier by a missing-CF_ACCOUNT_ID secret check. Buffer each page response to a file under a temp dir, merge from disk at the end. Also bumps the page cap from 20 to 40 (1000 → 2000 tunnel ceiling) so the existing soft-cap warning has headroom; the disk-merge shape is O(n) in tunnel count rather than the previous O(n^2) so the larger ceiling is cheap. Verified locally against the live account (672 tunnels): script now runs cleanly to the existing MAX_DELETE_PCT safety gate, which trips at 99% > 90% as designed and surfaces the actual orphan backlog for operator-driven cleanup. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 00:42:25 -07:00
Hongming Wang	cdbf54beed	Merge pull request #2507 from Molecule-AI/fix/canary-prompt-explicit-echo fix(canary): reframe smoke prompt to give GPT-4o explicit permission to echo	2026-05-02 06:55:39 +00:00
Hongming Wang	fa9e29f2f5	fix(canary): reframe smoke prompt to give GPT-4o explicit permission to echo Canary started flaking 2026-05-01 22:11 with model-refusal replies: - "I'm unable to do that." - "I'm unable to fulfill that request. Can I assist you with anything else?" - "I'm unable to reply with responses that don't allow me to fulfill tasks…" 3 fails / 10 recent runs ≈ 30% flake. Trigger: 2026-04-30's Platform Capabilities preamble (#2332) added the directive "Use them proactively" to the top of every system prompt. Combined with the heavy A2A + HMA tool docs further down, the model reads the contrived bare-echo prompt ("Reply with exactly: PONG") as out-of-role and intermittently refuses. Real user prompts don't hit this — only the synthetic smoke prompt does, so the right fix is in the canary's prompt phrasing, not the platform's system prompt (which is correctly priming agents toward tool use). New phrasing explicitly tells the model "this is a smoke test" and "no tools or memory are needed" so it has permission to comply. Also updates the child workspace's CHILD_PONG prompt with the same framing — same failure mode would have hit it once full-mode runs again. No code change to system prompt, no test infra change. Just two prompt strings + a load-bearing comment so future readers don't trim back to the brittle phrasing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 23:53:24 -07:00
Hongming Wang	12807962d2	Merge pull request #2506 from Molecule-AI/ci/secret-scan-required-and-precommit secret-scan: align local pre-commit + extend drift lint (closes #1569 root)	2026-05-02 06:52:28 +00:00
Hongming Wang	0d25922f91	Merge branch 'staging' into ci/secret-scan-required-and-precommit	2026-05-01 23:48:50 -07:00

1 2 3 4 5 ...

3776 Commits