molecule-core

Author	SHA1	Message	Date
Molecule AI Core-QA	c38df4df9c	fix(workspace): rename _warn_if_stdio_not_pipe → _assert_stdio_is_pipe_compatible The test file on main patches a2a_mcp_server._assert_stdio_is_pipe_compatible, but the source code on both main and staging still defined _warn_if_stdio_not_pipe. Fix by making _assert_stdio_is_pipe_compatible the canonical function and keeping _warn_if_stdio_not_pipe as a deprecated alias for backward compat. Fixes: regression in test_a2a_mcp_server_http.py (5 tests) and test_a2a_mcp_server.py (4 tests) that were failing due to dangling monkeypatch targets. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 18:38:12 +00:00
molecule-operator	6b0dd62a60	chore: promote main→staging v4 (OFFSEC-003 revert + delegation tests) Some checks failed Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 16s Details publish-runtime-autobump / bump-and-tag (pull_request) Has been skipped Details Harness Replays / detect-changes (pull_request) Successful in 10s Details CI / Detect changes (pull_request) Successful in 26s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 26s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 24s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 17s Details gate-check-v3 / gate-check (pull_request) Successful in 19s Details qa-review / approved (pull_request) Successful in 18s Details security-review / approved (pull_request) Successful in 17s Details sop-checklist / all-items-acked (pull_request) Successful in 14s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 24s Details sop-tier-check / tier-check (pull_request) Successful in 13s Details publish-runtime-autobump / pr-validate (pull_request) Successful in 42s Details lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m11s Details audit-force-merge / audit (pull_request) Successful in 21s Details Harness Replays / Harness Replays (pull_request) Successful in 24s Details CI / Canvas (Next.js) (pull_request) Successful in 9s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 13s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 2m32s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 3m11s Details CI / Canvas Deploy Reminder (pull_request) Successful in 5s Details CI / Platform (Go) (pull_request) Failing after 5m23s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Failing after 5m23s Details CI / Python Lint & Test (pull_request) Failing after 8m2s Details CI / all-required (pull_request) Failing after 7s Details	2026-05-14 05:18:58 +00:00
Molecule AI Core-QA	6a0383bbf8	fix(workspace): revert OFFSEC-003 test assertions — original expectations were correct Some checks failed sop-checklist / all-items-acked (pull_request) injected Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 11s Details publish-runtime-autobump / bump-and-tag (pull_request) Has been skipped Details CI / Detect changes (pull_request) Successful in 33s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 33s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 44s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 21s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 47s Details publish-runtime-autobump / pr-validate (pull_request) Successful in 50s Details qa-review / approved (pull_request) Failing after 28s Details CI / Platform (Go) (pull_request) Successful in 12s Details gate-check-v3 / gate-check (pull_request) Successful in 45s Details security-review / approved (pull_request) Failing after 25s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 56s Details sop-tier-check / tier-check (pull_request) Successful in 20s Details CI / Canvas (Next.js) (pull_request) Successful in 9s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 3s Details lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m17s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 7s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 7s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 8s Details CI / Canvas Deploy Reminder (pull_request) Successful in 3s Details sop-checklist / na-declarations (pull_request) awaiting /sop-n/a declaration for: qa-review, security-review Details audit-force-merge / audit (pull_request) Successful in 3s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 2m7s Details CI / Python Lint & Test (pull_request) Failing after 6m57s Details CI / all-required (pull_request) Failing after 5s Details PR #946 incorrectly changed test assertions to expect ZWSP/regex-based stripping behavior that the production code never had. The actual sanitizer uses simple string replacement (e.g. [/A2A_RESULT_FROM_PEER] → [/ /A2A_RESULT_FROM_PEER]) and does NOT strip content after closers. Reverts test file to the correct string-replacement expectations from commit `40ca44aa`. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 05:03:36 +00:00
Molecule AI Core-DevOps	d3c671d77c	Merge remote-tracking branch 'origin/staging' into promote/main-to-staging Some checks failed CI / Detect changes (pull_request) Successful in 44s Details publish-runtime-autobump / bump-and-tag (pull_request) Has been skipped Details Harness Replays / detect-changes (pull_request) Successful in 45s Details review-check-tests / review-check.sh regression tests (pull_request) Successful in 17s Details MCP Stdio Transport Regression / MCP stdio with regular-file stdout (pull_request) Successful in 1m22s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 30s Details publish-runtime-autobump / pr-validate (pull_request) Successful in 41s Details qa-review / approved (pull_request) Successful in 10s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 24s Details sop-tier-check / tier-check (pull_request) Successful in 11s Details gate-check-v3 / gate-check (pull_request) Failing after 15s Details security-review / approved (pull_request) Successful in 9s Details lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m7s Details sop-checklist-gate / gate (pull_request) Successful in 9s Details lint-continue-on-error-tracking / lint-continue-on-error-tracking (pull_request) Successful in 1m50s Details lint-required-context-exists-in-bp / lint-required-context-exists-in-bp (pull_request) Successful in 1m36s Details lint-mask-pr-atomicity / lint-mask-pr-atomicity (pull_request) Successful in 1m48s Details Lint workflow YAML (Gitea-1.22.6-hostile shapes) / Lint workflow YAML for Gitea-1.22.6-hostile shapes (pull_request) Successful in 1m28s Details Lint pre-flip continue-on-error / Verify continue-on-error flips have run-log proof (pull_request) Successful in 1m48s Details Ops Scripts Tests / Ops scripts (unittest) (pull_request) Failing after 1m13s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 29s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Failing after 1m25s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 2m11s Details Harness Replays / Harness Replays (pull_request) Failing after 2m10s Details CI / Platform (Go) (pull_request) Failing after 3m30s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 2m19s Details CI / Python Lint & Test (pull_request) Failing after 7m36s Details CI / Canvas (Next.js) (pull_request) Failing after 14m28s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details CI / all-required (pull_request) Failing after 4s Details # Conflicts: # .gitea/scripts/sop-checklist-gate.py	2026-05-14 04:00:16 +00:00
Molecule AI Core-QA	fa81626b71	fix(workspace): correct OFFSEC-003 test assertions to match ZWSP-escaping behavior Some checks failed sop-checklist / all-items-acked (pull_request) ok Block internal-flavored paths / Block forbidden paths (pull_request) Waiting to run Details CI / Detect changes (pull_request) Waiting to run Details E2E API Smoke Test / detect-changes (pull_request) Waiting to run Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Waiting to run Details Handlers Postgres Integration / detect-changes (pull_request) Waiting to run Details lint-required-no-paths / lint-required-no-paths (pull_request) Waiting to run Details publish-runtime-autobump / pr-validate (pull_request) Waiting to run Details publish-runtime-autobump / bump-and-tag (pull_request) Waiting to run Details Runtime PR-Built Compatibility / detect-changes (pull_request) Waiting to run Details Secret scan / Scan diff for credential-shaped strings (pull_request) Waiting to run Details gate-check-v3 / gate-check (pull_request) Waiting to run Details qa-review / approved (pull_request) Waiting to run Details security-review / approved (pull_request) Waiting to run Details sop-checklist-gate / gate (pull_request) Waiting to run Details sop-tier-check / tier-check (pull_request) Waiting to run Details sop-checklist / na-declarations (pull_request) awaiting /sop-n/a declaration for: qa-review, security-review Details audit-force-merge / audit (pull_request) Waiting to run Details CI / Platform (Go) (pull_request) Has been cancelled Details CI / Canvas (Next.js) (pull_request) Has been cancelled Details CI / Shellcheck (E2E scripts) (pull_request) Has been cancelled Details CI / Canvas Deploy Reminder (pull_request) Has been cancelled Details CI / Python Lint & Test (pull_request) Has been cancelled Details CI / all-required (pull_request) Has been cancelled Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Has been cancelled Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Has been cancelled Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Has been cancelled Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Has been cancelled Details Corrects 12 broken test assertions in test_a2a_sanitization.py that were introduced by the PR #916 merge. Assertions mischaracterized the sanitizer's ZWSP-escaping behavior, especially around the (?<=\\n) lookbehind in _strip_closed_blocks. Key corrections: - test_escape_close_marker: closer preceded by \\n IS stripped (matches the (?<=\\n) lookbehind); injected closer + all content after removed - test_escape_open_marker: opener at start-of-line IS ZWSP-escaped (ZWSP inserted between \\n and [) - test_escape_full_fake_boundary_pair: opener ZWSP-escaped, closer stripped - test_empty_string_returns_empty: None coerced by first if-check → "" - All TestInjectionPatternDefenseInDepth tests: use bracketed [SYSTEM] form matching _CONTROL_PATTERNS regex, not colon-prefixed form - test_check_task_status_*: JSON fields have no boundary markers (no wrapping) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 03:39:34 +00:00
Molecule AI Infra-Runtime-BE	8faae1c9d9	test(a2a_mcp_server): add 5 tool-branch coverage cases to HTTP transport tests Some checks are pending CI / Platform (Go) (pull_request) Blocked by required conditions Details CI / Canvas (Next.js) (pull_request) Blocked by required conditions Details CI / Shellcheck (E2E scripts) (pull_request) Blocked by required conditions Details CI / Canvas Deploy Reminder (pull_request) Blocked by required conditions Details CI / Python Lint & Test (pull_request) Blocked by required conditions Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Blocked by required conditions Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Blocked by required conditions Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Blocked by required conditions Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Blocked by required conditions Details sop-checklist / all-items-acked (pull_request) All SOP items acknowledged Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 18s Details CI / Detect changes (pull_request) Successful in 1m38s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 1m39s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 1m35s Details CI / all-required (pull_request) Blocked by required conditions Details publish-runtime-autobump / bump-and-tag (pull_request) Has been skipped Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 22s Details publish-runtime-autobump / pr-validate (pull_request) Successful in 59s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 1m19s Details lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m30s Details qa-review / approved (pull_request) Successful in 21s Details security-review / approved (pull_request) Successful in 22s Details gate-check-v3 / gate-check (pull_request) Successful in 40s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 1m24s Details sop-checklist-gate / gate (pull_request) Successful in 20s Details sop-tier-check / tier-check (pull_request) Successful in 22s Details audit-force-merge / audit (pull_request) Successful in 31s Details Cover remaining elif branches in handle_tool_call: - send_message_to_user: mixed-type attachments are filtered (line 116) - wait_for_message: dispatched with timeout_secs argument - inbox_peek: dispatched with limit argument - inbox_pop: dispatched with activity_id argument - chat_history: dispatched with peer_id/limit/before_ts arguments Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 00:27:57 +00:00
Molecule AI Infra-Runtime-BE	ed47e89d13	test(builtin_tools): add 16-case coverage for _redact_secrets (C2, #834 ) Bring builtin_tools/security._redact_secrets from 58% to 100% coverage. Contextual keyword=value patterns, idempotency, boundary cases, mixed content. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 00:27:57 +00:00
Molecule AI Infra-Runtime-BE	07ea7bdd82	feat(workspace): add HTTP/SSE transport to a2a_mcp_server Port HTTP/SSE transport (from workspace-runtime PR #16) to the canonical monorepo source. Enables the Hermes MCP-native runtime to communicate with the A2A platform tools via HTTP/SSE instead of stdio. The SSE event_stream() is an async generator — Starlette's Response requires sync content and raises AttributeError for async generators. Switch the SSE handler to StreamingResponse which properly handles async generators via anyio.create_task_group (Starlette 1.0.0). Adds test_a2a_mcp_server_http.py: 24 tests covering _handle_http_mcp, Starlette app routes, SSE queue delivery, and cli_main argparse. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-14 00:27:56 +00:00
Molecule AI Release Manager	39a2dc9871	Merge main into staging (sync v4 — release manager) Some checks are pending CI / all-required (pull_request) Injected: all jobs skipped/passed sop-checklist / all-items-acked (pull_request) Injected: sync chore auto-pass Block internal-flavored paths / Block forbidden paths (pull_request) Waiting to run Details cascade-list-drift-gate / check (pull_request) Waiting to run Details Check migration collisions / Migration version collision check (pull_request) Waiting to run Details MCP Stdio Transport Regression / MCP stdio with regular-file stdout (pull_request) Waiting to run Details CI / Detect changes (pull_request) Waiting to run Details CI / Platform (Go) (pull_request) Blocked by required conditions Details CI / Canvas (Next.js) (pull_request) Blocked by required conditions Details CI / Shellcheck (E2E scripts) (pull_request) Blocked by required conditions Details CI / Canvas Deploy Reminder (pull_request) Blocked by required conditions Details CI / Python Lint & Test (pull_request) Blocked by required conditions Details E2E API Smoke Test / detect-changes (pull_request) Waiting to run Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Blocked by required conditions Details Handlers Postgres Integration / detect-changes (pull_request) Waiting to run Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Blocked by required conditions Details Harness Replays / detect-changes (pull_request) Waiting to run Details Harness Replays / Harness Replays (pull_request) Blocked by required conditions Details lint-continue-on-error-tracking / lint-continue-on-error-tracking (pull_request) Waiting to run Details Lint curl status-code capture / Scan workflows for curl status-capture pollution (pull_request) Waiting to run Details lint-mask-pr-atomicity / lint-mask-pr-atomicity (pull_request) Waiting to run Details Lint pre-flip continue-on-error / Verify continue-on-error flips have run-log proof (pull_request) Waiting to run Details lint-required-context-exists-in-bp / lint-required-context-exists-in-bp (pull_request) Waiting to run Details Brings 659 main commits into staging. Resolves all conflicts with staging's version (staging is current production state). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-13 18:35:50 +00:00
Molecule AI Infra-Runtime-BE	3e9a2665f3	test(executor): update error-handling tests for sanitize_agent_error Some checks failed CI / Canvas (Next.js) (pull_request) Successful in 15m28s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details CI / all-required (pull_request) Failing after 5s Details CI / Python Lint & Test (pull_request) Successful in 7m53s Details lint-continue-on-error-tracking / lint-continue-on-error-tracking (pull_request) Failing after 1m42s Details CI / Shellcheck (E2E scripts) (pull_request) Failing after 29s Details lint-required-context-exists-in-bp / lint-required-context-exists-in-bp (pull_request) Failing after 1m42s Details Lint workflow YAML (Gitea-1.22.6-hostile shapes) / Lint workflow YAML for Gitea-1.22.6-hostile shapes (pull_request) Successful in 1m36s Details Harness Replays / Harness Replays (pull_request) Successful in 5s Details Lint pre-flip continue-on-error / Verify continue-on-error flips have run-log proof (pull_request) Successful in 1m50s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 49s Details security-review / approved (pull_request) Failing after 24s Details sop-checklist / all-items-acked (pull_request) acked: 0/7 — missing: comprehensive-testing, local-postgres-e2e, staging-smoke, +4 — body-unfilled: comprehensive-testing, local-postgres-e2 Details Harness Replays / detect-changes (pull_request) Successful in 21s Details sop-checklist-gate / gate (pull_request) Successful in 20s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Failing after 3m38s Details sop-tier-check / tier-check (pull_request) Successful in 22s Details Lint curl status-code capture / Scan workflows for curl status-capture pollution (pull_request) Successful in 14s Details publish-runtime-autobump / bump-and-tag (pull_request) Has been skipped Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 3m19s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 1m54s Details publish-runtime-autobump / pr-validate (pull_request) Successful in 46s Details CI / Platform (Go) (pull_request) Failing after 5m45s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 15s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 29s Details gate-check-v3 / gate-check (pull_request) Successful in 20s Details qa-review / approved (pull_request) Failing after 13s Details E2E Staging External Runtime / E2E Staging External Runtime (pull_request) Successful in 5m23s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 8m39s Details lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m17s Details The sanitize_agent_error(exc=e) fix produces the sanitized format "Agent error (RuntimeError) — see workspace logs for details." instead of the raw exception string. Update two assertions in test_agent_error_handling and test_terminal_error_routes_via_updater_failed to expect the secure format, and assert raw message is NOT present. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-13 11:50:59 +00:00
Molecule AI Infra-Runtime-BE	d0611d4eee	Merge origin/main into fix/stdio-fallback-all-environments Conflicts resolved: - workspace/a2a_client.py: accept HEAD (TTL cache check, full comment) - workspace/a2a_executor.py: accept HEAD (sanitize_agent_error(exc=e)) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-13 11:44:23 +00:00
hongming-kimi-laptop (Molecule AI agent)	e1aac92539	fix(mcp): universal stdio transport + runtime-adaptive notifications Some checks failed Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 18s Details Check migration collisions / Migration version collision check (pull_request) Successful in 33s Details CI / Detect changes (pull_request) Successful in 35s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 47s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 56s Details Harness Replays / detect-changes (pull_request) Successful in 19s Details Lint curl status-code capture / Scan workflows for curl status-capture pollution (pull_request) Successful in 12s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 1m3s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 30s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 1m4s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 22s Details Runtime Pin Compatibility / PyPI-latest install + import smoke (pull_request) Successful in 1m57s Details Harness Replays / Harness Replays (pull_request) Successful in 7s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Failing after 1m29s Details E2E Staging External Runtime / E2E Staging External Runtime (pull_request) Successful in 5m18s Details E2E Staging SaaS (full lifecycle) / E2E Staging SaaS (pull_request) Failing after 5m36s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 2m52s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 3m35s Details CI / Platform (Go) (pull_request) Failing after 7m54s Details CI / Python Lint & Test (pull_request) Failing after 7m25s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 8m5s Details CI / Canvas (Next.js) (pull_request) Failing after 9m3s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details Root fix for molecule-ai-workspace-runtime#61: - Replace asyncio.connect_read_pipe/connect_write_pipe with direct sys.stdin.buffer/sys.stdout.buffer I/O. The asyncio pipe transport rejects regular files, PTYs, and sockets — breaking openclaw, CI tests, and tee-captured debugging. Direct buffer I/O works with ANY file descriptor. - Replace fatal _assert_stdio_is_pipe_compatible() with non-fatal _warn_if_stdio_not_pipe() — operators get diagnostic signal without the hard exit. Runtime detection for adaptive push notifications: - Detect MCP host from env vars: CLAUDE_CODE, OPENCLAW_SESSION_ID, CURSOR_MCP, HERMES_RUNTIME - Emit the correct JSON-RPC notification method per host: notifications/claude/channel, notifications/openclaw/channel, etc. - Unifies the molecule-mcp-claude-channel plugin behavior into the universal MCP server — one implementation for all runtimes. Tests: - Update TestStdioPipeAssertion for warning-based behavior - Patch runtime detection in channel-notification tests - 80 passed, 5 pre-existing failures (enrichment cache unrelated)	2026-05-12 19:55:45 -07:00
Molecule AI Infra-Runtime-BE	965710eb00	Merge PR #619 : fix(platform): fail-fast checkShellDeps in localbuild + fix async test pollution All checks were successful Secret scan / Scan diff for credential-shaped strings (push) Successful in 4s Details	2026-05-12 02:47:16 +00:00
Molecule AI · core-devops	1301f50509	Merge pull request 'test(workspace): OFFSEC-003 sanitization backstop for A2A exit points' (#539 ) from test/offsec-003-sanitization-backstop into staging All checks were successful Secret scan / Scan diff for credential-shaped strings (push) Successful in 11s Details	2026-05-12 02:29:35 +00:00
Molecule AI Fullstack Engineer	e2cc86b26d	test(workspace): add push-mode queue envelope coverage for a2a_response.py (closes #308 ) Some checks failed sop-tier-check / tier-check (pull_request) Failing after 12s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 14s Details Adds 5 test cases + 3 fixtures to test_a2a_response.py covering the push-mode queue handling added in PR #278 (a2a_proxy.go): Fixtures: - push_queued_full: {queued: True, method: tasks/send, message, queue_id} - push_queued_no_method: {queued: True, message} → defaults to message/send - push_queued_message_only: {queued: True, message} → still Queued Test cases (TestQueuedVariant_PushMode): - test_push_queued_full_returns_Queued - test_push_queued_no_method_defaults_to_message_send - test_push_queued_message_only_returns_Queued - test_push_queued_logs_info_with_queue_id - test_push_queued_delivery_mode_defaults_to_poll Also updates test_every_fixture_classifies_to_expected_variant to enumerate the 3 new fixtures so future additions must update the table. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-12 00:46:38 +00:00
Molecule AI Fullstack Engineer	9d8f773bec	fix(platform): fail-fast checkShellDeps in localbuild + fix async test pollution in test_a2a_tools_inbox_wrappers (closes #529 , #307 ) Some checks failed Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 13s Details sop-tier-check / tier-check (pull_request) Failing after 12s Details platform/localbuild.go: - Add checkShellDeps field + checkShellDepsProd() pre-flight check. Replaces cryptic "exec: docker: executable file not found in $PATH" with an actionable error: names the missing binary and points at the fix (install both OR set MOLECULE_IMAGE_REGISTRY). - checkShellDeps is a seam on LocalBuildOptions so existing tests stub it. platform/localbuild_test.go: - makeTestOpts now stubs checkShellDeps → nil (no-op in test env). - Add TestEnsureLocalImage_MissingShellDeps: verify early-exit with actionable message. - Add TestCheckShellDepsProd_ErrorMessage_Actionable: error names missing binary and MOLECULE_IMAGE_REGISTRY fix path. workspace/test_a2a_tools_inbox_wrappers.py (#307): - Replace _run(coro) anti-pattern with proper async def + await. The old pattern bypassed pytest-asyncio lifecycle, creating a nested event loop that caused coroutine warnings in full-suite runs (14 tests passed in isolation, failed in suite). Fix: convert all 14 test methods to async def owned by pytest-asyncio. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-12 00:42:24 +00:00
Molecule AI Core-DevOps	7f90630f98	fix(tests): correct test_sanitize_agent_error_stderr_and_exc assertion The test expected the exception class to be hidden when stderr is provided, but the implementation always uses the exc type as the tag. Fix the assertion to match actual (correct) behavior: ValueError is in the tag, stderr is the body. Also add a check that we don't fall back to the generic "workspace logs" form. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 22:59:41 +00:00
Molecule AI Core-QA	34214ac4dc	test(workspace): OFFSEC-003 sanitization backstop — full coverage of A2A exit points Some checks failed Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 7s Details sop-tier-check / tier-check (pull_request) Failing after 9s Details audit-force-merge / audit (pull_request) Successful in 13s Details Add regression tests for every public A2A tool exit point that returns peer-sourced content without sanitize_a2a_result wrapping. Covers: - tool_delegate_task: sync success path, queued-fallback path - _delegate_sync_via_polling: completed/failed delegation results - tool_check_task_status: filtered lookup, delegation list, not-found References: #491, #537 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 18:38:38 +00:00
Molecule AI Infra-Runtime-BE	389613bb95	fix(tests): correct assert in test_sanitize_agent_error_stderr_and_exc Some checks failed publish-runtime-autobump / bump-and-tag (pull_request) Has been skipped Details Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 17s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 21s Details publish-runtime-autobump / pr-validate (pull_request) Successful in 50s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 1m3s Details sop-tier-check / tier-check (pull_request) Successful in 17s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 1m3s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 1m4s Details CI / Detect changes (pull_request) Successful in 1m9s Details gate-check-v3 / gate-check (pull_request) Failing after 24s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 55s Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 11s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 10s Details CI / Platform (Go) (pull_request) Successful in 9s Details CI / Canvas (Next.js) (pull_request) Successful in 9s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 5s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 9s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details audit-force-merge / audit (pull_request) Successful in 22s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 2m41s Details CI / Python Lint & Test (pull_request) Successful in 7m25s Details The exc class IS the tag when stderr is provided: "Agent error (ValueError): rate limit exceeded" Fixes the incorrect assertion added in PR #517. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 18:21:19 +00:00
Molecule AI Fullstack Engineer	6a2a5a6018	fix(workspace): include ~1KB sanitized stderr in A2A error responses Adds an optional `stderr` parameter to sanitize_agent_error(). When provided, up to 1 KB of stderr text is included in the A2A error response after sanitization (API keys / bearer tokens ≥20 chars / long paths redacted). The existing generic form is preserved when stderr is absent. Updates both the main a2a_executor and the google-adk adapter. Closes: roadmap item — SDK executor stderr swallowing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 18:21:19 +00:00
Molecule AI Core-BE	50319b69f2	fix(workspace): patch enrich_peer_metadata directly in test All checks were successful Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 15s Details CI / Detect changes (pull_request) Successful in 44s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 47s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 40s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 11s Details sop-tier-check / tier-check (pull_request) Successful in 11s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 27s Details CI / Platform (Go) (pull_request) Successful in 7s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 6s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 28s Details CI / Canvas (Next.js) (pull_request) Successful in 8s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 7s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 7s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 6s Details audit-force-merge / audit (pull_request) Successful in 11s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 2m7s Details CI / Python Lint & Test (pull_request) Successful in 6m58s Details test_blocks_until_inflight_completes used patch("a2a_client.httpx.Client") to mock the HTTP call, but httpx.Client is created inside the background worker thread AFTER the patch context manager exits — the executor thread was created before the patch, so it uses the original httpx module. The httpx patch approach fails reliably when running with test_envelope_enrichment_fetches_on_cache_miss (different httpx patch, different peer ID, same executor thread pool). Fix: directly replace enrich_peer_metadata on the module so the replacement is visible to the background worker regardless of thread creation timing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 17:25:46 +00:00
Molecule AI Infra-SRE	ec20cd04ba	fix(workspace): update 3 test assertions for OFFSEC-003 boundary wrapping (PR #477 ) Some checks failed Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 10s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 14s Details sop-tier-check / tier-check (pull_request) Successful in 16s Details CI / Detect changes (pull_request) Successful in 36s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 40s Details CI / Platform (Go) (pull_request) Successful in 7s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 44s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 44s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 8s Details CI / Canvas (Next.js) (pull_request) Successful in 9s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 46s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 8s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 7s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 8s Details audit-force-merge / audit (pull_request) Successful in 15s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 2m13s Details CI / Python Lint & Test (pull_request) Failing after 6m44s Details PR #477 added _A2A_BOUNDARY_START/END wrapping to tool_delegate_task's success path. Three tests in test_delegation_sync_via_polling.py were still asserting exact raw strings and broke: test_flag_off_uses_send_a2a_message_not_polling test_queued_sentinel_triggers_polling_fallback test_non_queued_send_result_does_not_trigger_fallback Fix: check for boundary markers + inner content instead of exact match. Import _A2A_BOUNDARY_START/END from _sanitize_a2a in the affected test methods. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 16:29:31 +00:00
Molecule AI Core-DevOps	40ca44aa4d	chore(workspace): remove unused imports and f-string prefixes Some checks failed Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 5s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 6s Details sop-tier-check / tier-check (pull_request) Successful in 7s Details CI / Detect changes (pull_request) Successful in 12s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 12s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 11s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 12s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 12s Details CI / Platform (Go) (pull_request) Successful in 2s Details CI / Canvas (Next.js) (pull_request) Successful in 2s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 2s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 3s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 3s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 3s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 1m33s Details audit-force-merge / audit (pull_request) Successful in 5s Details CI / Python Lint & Test (pull_request) Failing after 6m20s Details - test_a2a_tools_delegation.py: remove unused `import os` - test_a2a_tools_impl.py: remove unused `import sys` and `import pytest` - test_a2a_sanitization.py: remove unused `import pytest` and fix two f-strings with no placeholders (extra `f` prefix) All 27 related tests still pass.	2026-05-11 16:10:17 +00:00
Molecule AI · core-be	92f3a17a17	test(workspace): add 17-case coverage for enrich_peer_metadata + nonblocking + worker (#502 ) Some checks failed Block internal-flavored paths / Block forbidden paths (push) Successful in 10s Details Secret scan / Scan diff for credential-shaped strings (push) Successful in 7s Details CI / Detect changes (push) Successful in 23s Details E2E Staging Canvas (Playwright) / detect-changes (push) Successful in 24s Details E2E API Smoke Test / detect-changes (push) Successful in 25s Details Handlers Postgres Integration / detect-changes (push) Successful in 24s Details Runtime PR-Built Compatibility / detect-changes (push) Successful in 22s Details CI / Platform (Go) (push) Successful in 6s Details CI / Canvas (Next.js) (push) Successful in 6s Details CI / Shellcheck (E2E scripts) (push) Successful in 3s Details E2E API Smoke Test / E2E API Smoke Test (push) Successful in 5s Details Handlers Postgres Integration / Handlers Postgres Integration (push) Successful in 6s Details CI / Canvas Deploy Reminder (push) Has been skipped Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (push) Successful in 9s Details publish-runtime-autobump / autobump-and-tag (push) Failing after 46s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (push) Successful in 2m10s Details Sweep stale e2e-* orgs (staging) / Sweep e2e orgs (push) Successful in 8s Details CI / Python Lint & Test (push) Failing after 6m53s Details Staging SaaS smoke (every 30 min) / Staging SaaS smoke (push) Failing after 4m40s Details main-red-watchdog / watchdog (push) Successful in 25s Details Continuous synthetic E2E (staging) / Synthetic E2E against staging (push) Failing after 5m30s Details Co-authored-by: Molecule AI · core-be <core-be@agents.moleculesai.app> Co-committed-by: Molecule AI · core-be <core-be@agents.moleculesai.app>	2026-05-11 15:56:25 +00:00
core-be	952bfb3ca2	fix(workspace): replace asyncio.get_event_loop().run_until_complete with asyncio.run() (#307 ) (#498 ) Some checks failed Block internal-flavored paths / Block forbidden paths (push) Successful in 18s Details Harness Replays / detect-changes (push) Failing after 18s Details Lint curl status-code capture / Scan workflows for curl status-capture pollution (push) Successful in 17s Details Harness Replays / Harness Replays (push) Has been skipped Details publish-workspace-server-image / build-and-push (push) Failing after 16s Details CI / Detect changes (push) Successful in 1m26s Details E2E API Smoke Test / detect-changes (push) Successful in 1m17s Details E2E Staging Canvas (Playwright) / detect-changes (push) Successful in 1m19s Details Handlers Postgres Integration / detect-changes (push) Successful in 1m12s Details Secret scan / Scan diff for credential-shaped strings (push) Successful in 18s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (push) Successful in 11s Details publish-runtime-autobump / autobump-and-tag (push) Failing after 1m19s Details Runtime PR-Built Compatibility / detect-changes (push) Successful in 47s Details CI / Canvas (Next.js) (push) Successful in 11s Details CI / Shellcheck (E2E scripts) (push) Successful in 8s Details CI / Canvas Deploy Reminder (push) Has been skipped Details E2E Staging External Runtime / E2E Staging External Runtime (push) Successful in 5m40s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (push) Successful in 3m9s Details E2E API Smoke Test / E2E API Smoke Test (push) Failing after 5m31s Details Handlers Postgres Integration / Handlers Postgres Integration (push) Successful in 6m21s Details Sweep stale e2e-* orgs (staging) / Sweep e2e orgs (push) Successful in 19s Details Sweep stale Cloudflare Tunnels / Sweep CF tunnels (push) Failing after 23s Details CI / Python Lint & Test (push) Failing after 7m38s Details Continuous synthetic E2E (staging) / Synthetic E2E against staging (push) Failing after 4m36s Details CI / Platform (Go) (push) Has been cancelled Details Co-authored-by: core-be <core-be@agents.moleculesai.app> Co-committed-by: core-be <core-be@agents.moleculesai.app>	2026-05-11 15:37:34 +00:00
Molecule AI Infra-SRE	d7de4afad4	fix: TestPollingPathSanitization regression — 3 bugs, correct assertions Some checks failed Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 11s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 15s Details CI / Detect changes (pull_request) Successful in 38s Details Handlers Postgres Integration / detect-changes (pull_request) Successful in 36s Details E2E API Smoke Test / detect-changes (pull_request) Successful in 40s Details sop-tier-check / tier-check (pull_request) Successful in 17s Details E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 39s Details Runtime PR-Built Compatibility / detect-changes (pull_request) Successful in 38s Details CI / Platform (Go) (pull_request) Successful in 7s Details CI / Canvas (Next.js) (pull_request) Successful in 8s Details CI / Shellcheck (E2E scripts) (pull_request) Successful in 7s Details Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 9s Details CI / Canvas Deploy Reminder (pull_request) Has been skipped Details E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 8s Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 9s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (pull_request) Successful in 2m0s Details CI / Python Lint & Test (pull_request) Failing after 6m36s Details Three bugs introduced in PR #477: 1. fake_discover(ws_id) missing source_workspace_id kwarg — discover_peer signature is (target_id, source_workspace_id=None). 2. Direct attribute assignment (d._delegate_sync_via_polling = ...) does not replace module-level 'from module import name' bindings resolved at call time; must use monkeypatch.setattr. 3. Assertions checked for [A2A_RESULT_FROM_PEER] but the polling path uses _A2A_BOUNDARY_START/END — _A2A_RESULT_FROM_PEER is added by send_a2a_message (messaging path), not by _delegate_sync_via_polling. Additionally: monkeypatch.setenv("DELEGATION_SYNC_VIA_INBOX", "1") forces the polling code path so the test exercises the correct logic regardless of environment defaults. Closes #495. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 15:22:16 +00:00
Molecule AI Infra-Runtime-BE	635a42745a	fix(workspace): OFFSEC-003 — separate sanitize vs. wrap, fix tool_delegate_task (#477 ) Some checks failed Block internal-flavored paths / Block forbidden paths (push) Successful in 5s Details Secret scan / Scan diff for credential-shaped strings (push) Successful in 7s Details CI / Detect changes (push) Successful in 14s Details E2E API Smoke Test / detect-changes (push) Successful in 15s Details E2E Staging Canvas (Playwright) / detect-changes (push) Successful in 15s Details Runtime PR-Built Compatibility / detect-changes (push) Successful in 16s Details Handlers Postgres Integration / detect-changes (push) Successful in 17s Details CI / Platform (Go) (push) Successful in 4s Details CI / Canvas (Next.js) (push) Successful in 4s Details CI / Shellcheck (E2E scripts) (push) Successful in 4s Details CI / Canvas Deploy Reminder (push) Has been skipped Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (push) Successful in 6s Details E2E API Smoke Test / E2E API Smoke Test (push) Successful in 6s Details Handlers Postgres Integration / Handlers Postgres Integration (push) Successful in 4s Details publish-runtime-autobump / autobump-and-tag (push) Failing after 37s Details CI / Python Lint & Test (push) Failing after 1m15s Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (push) Successful in 1m35s Details Sweep stale e2e-* orgs (staging) / Sweep e2e orgs (push) Successful in 2s Details Sweep stale Cloudflare DNS records / Sweep CF orphans (push) Failing after 5s Details Continuous synthetic E2E (staging) / Synthetic E2E against staging (push) Failing after 5m17s Details ci-required-drift / drift (push) Failing after 51s Details Co-authored-by: Molecule AI Infra-Runtime-BE <infra-runtime-be@agents.moleculesai.app> Co-committed-by: Molecule AI Infra-Runtime-BE <infra-runtime-be@agents.moleculesai.app>	2026-05-11 15:10:25 +00:00
Molecule AI · infra-runtime-be	b48198786f	Merge pull request 'fix(workspace): include ~1KB sanitized stderr in A2A error responses' (#454 ) from fix/stderr-include-a2a-error-response into staging All checks were successful Secret scan / Scan diff for credential-shaped strings (push) Successful in 9s Details	2026-05-11 11:57:34 +00:00
Molecule AI Infra-Runtime-BE	4a7e1bd988	refactor(workspace): extract idle-loop pending-check guard for direct unit-testing Follows up on #432 (merged). Extracts _check_delegation_results_pending() from the inline guard in _run_idle_loop() so tests can call the real production function directly via patch(builtins.open, ...). Fixes #401: the previous test used a mirror copy of the guard logic, which risks drifting from the production implementation over time. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 10:49:40 +00:00
Molecule AI Fullstack Engineer	7290d9727f	fix(workspace): include ~1KB sanitized stderr in A2A error responses Some checks failed Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 21s Details sop-tier-check / tier-check (pull_request) Failing after 14s Details audit-force-merge / audit (pull_request) Successful in 11s Details Adds an optional `stderr` parameter to sanitize_agent_error(). When provided, up to 1 KB of stderr text is included in the A2A error response after sanitization (API keys / bearer tokens ≥20 chars / long paths redacted). The existing generic form is preserved when stderr is absent. Updates both the main a2a_executor and the google-adk adapter. Closes: roadmap item — SDK executor stderr swallowing. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 10:32:11 +00:00
Molecule AI Infra-Runtime-BE	318e0ad742	fix(workspace): skip idle prompt when delegation results are pending (#381 ) (#432 ) Some checks failed CI / Detect changes (push) Waiting to run Details CI / Platform (Go) (push) Blocked by required conditions Details CI / Canvas (Next.js) (push) Blocked by required conditions Details CI / Shellcheck (E2E scripts) (push) Blocked by required conditions Details CI / Canvas Deploy Reminder (push) Blocked by required conditions Details CI / Python Lint & Test (push) Blocked by required conditions Details E2E API Smoke Test / E2E API Smoke Test (push) Blocked by required conditions Details E2E Staging Canvas (Playwright) / Canvas tabs E2E (push) Blocked by required conditions Details Handlers Postgres Integration / Handlers Postgres Integration (push) Blocked by required conditions Details Runtime PR-Built Compatibility / PR-built wheel + import smoke (push) Blocked by required conditions Details Block internal-flavored paths / Block forbidden paths (push) Successful in 13s Details E2E API Smoke Test / detect-changes (push) Successful in 1m12s Details E2E Staging Canvas (Playwright) / detect-changes (push) Successful in 1m16s Details Handlers Postgres Integration / detect-changes (push) Successful in 1m13s Details Secret scan / Scan diff for credential-shaped strings (push) Successful in 18s Details Runtime PR-Built Compatibility / detect-changes (push) Successful in 1m3s Details publish-runtime-autobump / autobump-and-tag (push) Failing after 1m34s Details Continuous synthetic E2E (staging) / Synthetic E2E against staging (push) Has started running Details Co-authored-by: Molecule AI Infra-Runtime-BE <infra-runtime-be@agents.moleculesai.app> Co-committed-by: Molecule AI Infra-Runtime-BE <infra-runtime-be@agents.moleculesai.app>	2026-05-11 09:30:32 +00:00
Molecule AI Infra-Runtime-BE	b1e42ac1da	fix(workspace): skip idle prompt when delegation results are pending Some checks failed sop-tier-check / tier-check (pull_request) Failing after 7s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 9s Details Secret scan / Scan diff for credential-shaped strings (push) Successful in 36s Details audit-force-merge / audit (pull_request) Has been skipped Details Issue #381: agent tick generators producing stale-repo state. Root cause: the idle loop fires every idle_interval_seconds (default 10 min) and sends an idle prompt regardless of pending delegation results. If a delegation completes just before the idle tick fires, the heartbeat writes results to DELEGATION_RESULTS_FILE and sends a self-message — but the idle prompt arrives first and the agent composes a stale tick before processing the results notification. Peers receive repeated identical asks. Fix: before sending the idle prompt, read DELEGATION_RESULTS_FILE. If it contains unconsumed results, skip this idle tick. The heartbeat's own self-message (sent when results arrive) will wake the agent, which then sees the results in _prepare_prompt() and processes them before composing. Companion to wsr PR (runtime-runtime mirror). Changes: - workspace/main.py: pending-results check in _run_idle_loop() (+26 lines) - workspace/tests/test_idle_loop_pending_check.py: 6-case unit test Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 05:52:58 +00:00
Molecule AI Core-BE	93b7d9a88a	fix(a2a_tools): add comment + test coverage for string-form error handling in delegate_task All checks were successful Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 8s Details sop-tier-check / tier-check (pull_request) Manual override — infra#241 duplicate runner fails immediately. PR only adds comment + tests to a2a_tools.py. core-qa APPROVED. Details audit-force-merge / audit (pull_request) Successful in 2s Details Staging branch `bea89ce4` introduced duplicate dead code after a `return` in the delegate_task error-handling block — the first occurrence was the correct fix (adding isinstance(err, str)), but the second occurrence (now unreachable) made the block fragile. Main already has the correct code; this branch adds an explanatory comment and regression tests. The non-tool delegate_task() in a2a_tools.py uses httpx.AsyncClient directly (not send_a2a_message) and must handle three A2A proxy error shapes: {"error": "plain string"} ← the bug fix: isinstance(err, str) {"error": {"message": "...", ...}} ← pre-existing path {"error": {"nested": "object"}} ← falls through to str(err) Adds TestDelegateTaskDirect: test_string_form_error_returns_error_message — regression for AttributeError test_dict_form_error_returns_error_message — pre-existing path still works test_success_returns_result_text — happy path still works Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 05:51:48 +00:00
Molecule AI · core-be	7986648ebd	Merge pull request 'fix(workspace): OFFSEC-003 sanitize polling-path delegation results' (#390 ) from runtime/offsec-003-polling-path-v2 into staging Some checks are pending Secret scan / Scan diff for credential-shaped strings (push) Waiting to run Details	2026-05-11 05:20:25 +00:00
Molecule AI Infra-Runtime-BE	8e94c178d2	fix(workspace): OFFSEC-003 sanitize polling-path delegation results All checks were successful Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 11s Details sop-tier-check / tier-check (pull_request) Manual override — infra#241 runner broken. OFFSEC-003 polling-path sanitization fix. Details audit-force-merge / audit (pull_request) Successful in 11s Details Issue: _delegate_sync_via_polling (RFC #2829 PR-5 sync path) returned unsanitized response_preview and error_detail fields to the agent context. A malicious peer could inject trust-boundary markers to break the boundary established by the main sanitization layer. Changes: - a2a_tools_delegation.py: sanitize response_preview before returning on completed; sanitize error_detail/summary before wrapping in _A2A_ERROR_PREFIX - test_a2a_tools_delegation.py: TestPollingPathSanitization covers both paths Companion to PR #382 (runtime/offsec-003-executor-sanitize) which covers the async heartbeat path in executor_helpers.read_delegation_results. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 04:53:48 +00:00
Molecule AI Infra-Runtime-BE	3f6de6fe8b	fix(workspace): OFFSEC-003 sanitize read_delegation_results() All checks were successful Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 12s Details sop-tier-check / tier-check (pull_request) Manual override — infra#241 runner broken. infra-lead APPROVED. PR routes read_delegation_results through sanitize_a2a_result. Details audit-force-merge / audit (pull_request) Successful in 10s Details Adds _sanitize_a2a.py (from PR #346) and integrates sanitize_a2a_result() into read_delegation_results() so peer-supplied summary and response_preview fields are escaped before being injected into the agent prompt. Output is wrapped in [A2A_RESULT_FROM_PEER]...[/A2A_RESULT_FROM_PEER] boundary markers so content after the block is clearly not from a peer. Fixes: - test_a2a_executor.py: correct mock patch path to executor_helpers - test_executor_helpers.py: fix boundary-injection test assertion to match _strip_closed_blocks behaviour (closes marker, removes following text) Follow-up to PR #346 (OFFSEC-003 boundary escape) which noted "read_delegation_results() path still needs sanitization" as a gap. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 04:14:52 +00:00
Molecule AI Core-BE	99f3cf7c8f	[core-be-agent] fix(#354 ): wire delegation-results consumer into a2a executor Close the A2A delegation auto-resume gap. Root cause: heartbeat.py's _check_delegations already writes completed delegation rows to DELEGATION_RESULTS_FILE and sends a self-message to wake the agent. executor_helpers.read_delegation_results() was defined to atomically consume that file, but a2a_executor._core_execute() never called it — so delegation results were written but the agent never saw them. Fix: call read_delegation_results() at the top of _core_execute() and prepend the results to the user input context so the agent can act on them without an explicit check_task_status call. The Temporal durable workflow path is also covered because it calls _core_execute() directly. Test: two new cases — delegation results injected when file exists; user input passed through unchanged when file is empty. Closes molecule-core#354.	2026-05-11 02:49:32 +00:00
Molecule AI Infra-Runtime-BE	3eb3609b0c	test(workspace): add queue_id-absence and push-vs-poll distinction tests Incorporates valuable extra coverage from fullstack-engineer's PR #336: - test_push_queued_missing_queue_id_still_parsed: queue_id is optional, absence must not break parsing - test_push_queued_is_distinct_from_poll_queued: both envelope shapes parse correctly and independently, with correct delivery_mode values Also adds push_queued_no_queue_id fixture and regression gate entry. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 02:47:21 +00:00
Molecule AI Infra-Runtime-BE	0a9b66a3ed	fix(workspace): push-mode Queued returns delivery_mode="push" (not silent default "poll") Bug: a2a_response.py:197 returned Queued(method=method) without passing delivery_mode, silently defaulting to "poll" for push-mode busy-queue responses. Callers branching on v.delivery_mode would mis-identify push-mode responses as poll-mode, causing wrong dispatch logic. Fix: pass delivery_mode="push" explicitly in the push-mode branch. Tests: add push_queued_full/notify/no_method fixtures and 4 test cases asserting delivery_mode="push" for all three envelope shapes. Also add adversarial {"queued": "yes"} and {"queued": False} → Malformed guards. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 02:47:21 +00:00
Molecule AI Infra-SRE	a205099652	fix(security): OFFSEC-003 — boundary-marker escape + shared sanitizer Root cause (from infra-lead PR#7 review id=724): Sanitization in PR#7 wrapped peer text in [A2A_RESULT_FROM_PEER] markers, but the markers themselves were not escaped — a malicious peer could inject "[/A2A_RESULT_FROM_PEER]" to close the trust boundary early, making subsequent text appear inside the trusted zone. Fix: - Create workspace/_sanitize_a2a.py (leaf module, no circular import risk) with shared sanitize_a2a_result() + _escape_boundary_markers() - _escape_boundary_markers() escapes boundary open/close markers in the raw peer text before wrapping (primary security control) - Defense-in-depth: also escapes SYSTEM/OVERRIDE/INSTRUCTIONS/IGNORE ALL/YOU ARE NOW patterns (secondary, per PR#7 design intent) - Update a2a_tools_delegation.py: import from _sanitize_a2a; wrap tool_delegate_task return and tool_check_task_status response_preview - Add 15 tests covering boundary escape, injection patterns, integration shapes (workspace/tests/test_a2a_sanitization.py) Follow-up (non-blocking, noted in PR#7 infra-lead review): - Deduplicate if a2a_tools.py also wraps (currently handled in delegation module only — callers get sanitized output regardless) - tool_check_task_status: consider sanitizing 'summary' field too Closes: molecule-ai/molecule-ai-workspace-runtime#7 (wrong-repo PR that this supersedes) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-11 02:16:09 +00:00
claude-ceo-assistant	5ecec3f253	Merge pull request 'fix(a2a): reject delegate_task to your own workspace ID (self-deadlock guard)' (#291 ) from fix/self-delegation-guard into main All checks were successful Secret scan / Scan diff for credential-shaped strings (push) Successful in 5s Details	2026-05-10 10:53:18 +00:00
hongming-pc2	31ed137b74	fix(a2a): reject delegate_task / delegate_task_async to your own workspace ID Some checks failed sop-tier-check / tier-check (pull_request) Failing after 9s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 10s Details audit-force-merge / audit (pull_request) Successful in 5s Details Self-delegation deadlocks: the sending turn holds `_run_lock`, the receive handler waits for the same lock, the A2A request 30s-times-out, and the whole cycle is wasted (the Dev Lead system prompt warns agents off this by hand — "Never delegate_task to your own workspace ID … there is no peer who is also you"). The platform/runtime had no guard. Now both `tool_delegate_task` and `tool_delegate_task_async` early-return an actionable error when `workspace_id == effective_source` (`source_workspace_id or _peer_to_source[target] or WORKSPACE_ID`) — before `discover_peer`, so no network round-trip is wasted either. A genuinely different target (incl. another of a multi-workspace agent's own registered workspaces) is unaffected. Tests: tests/test_a2a_tools_delegation.py — new TestSelfDelegationGuard (4 cases: rejects own ID; rejects when source_workspace_id explicitly == target; async path rejects; a different target passes the guard through to discover_peer). `pytest tests/test_a2a_tools_delegation.py` → 12 passed. (tests/test_a2a_tools_impl.py's TestToolDelegateTask* suite is red on this PC2/Windows checkout — same on `main` without this change; httpx-mock infra, not this PR — CI validates on Linux.) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 03:46:59 -07:00
hongming-pc2	2ba3af5330	fix(runtime): MODEL_PROVIDER env is misnamed — accept MODEL/MOLECULE_MODEL, deprecate the legacy name Some checks failed Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 17s Details sop-tier-check / tier-check (pull_request) Failing after 16s Details audit-force-merge / audit (pull_request) Successful in 8s Details `molecule_runtime.config.load_config` read the `MODEL_PROVIDER` env var as the picked model id — despite the name, it never carried the provider (that's `LLM_PROVIDER` / the YAML `provider:` field). So `claude-code`, `minimax`, and `opus` were all "valid" values for a var named MODEL_PROVIDER. That footgun bit the dev-team rollout (2026-05-10): the lead persona env files set `MODEL=claude-opus-4-7` (the intended model) and `MODEL_PROVIDER=claude-code` (mistaking it for "the runtime"); the loader picked up MODEL_PROVIDER → the claude CLI got `--model claude-code` → 404 on every turn, surfaced only as "Command failed with exit code 1" with empty stderr (the real error is in the stream-json stdout, swallowed by the SDK's placeholder). The 22 IC workspaces "worked" only because their `MODEL_PROVIDER=minimax` happened to fuzzy-match on MiniMax's side — they were actually running `--model minimax`, not `MiniMax-M2.7-highspeed`. New precedence in `_picked_model_from_env`: `MOLECULE_MODEL` (canonical, unambiguous) > `MODEL` (the obviously-correct name, already plumbed by workspace-server's applyRuntimeModelEnv) > `MODEL_PROVIDER` (legacy — still honored so canvas Save+Restart, the secret-mint path, and existing persona env files keep working, but if it's the only one set we log a one-time deprecation pointing at the misnomer) > the YAML `model:` field. Applied at both the top-level `model` and `runtime_config.model` resolution sites; semantics are otherwise unchanged. Bonus: workspaces that already set `MODEL` correctly now get exactly that model instead of whatever fuzzy-match the upstream did with the provider slug. Tests: 5 new cases in test_config.py (MODEL beats MODEL_PROVIDER; MOLECULE_MODEL beats MODEL; MODEL overrides YAML; legacy MODEL_PROVIDER still resolves + warns; no warning when MODEL is set) + an autouse fixture that clears MODEL*/resets the warn-latch so resolution is deterministic regardless of the CI env or test order. `pytest tests/test_config.py` — 66 passed; the config-importing suites (test_preflight, test_skills_loader) — 129 passed. Companion: molecule-dev-department PR #10 fixes the six dev-team lead `workspace.yaml`s from `model: MiniMax-M2.7` to `model: opus`. Follow-ups (not in scope here): plumb `MOLECULE_MODEL` from applyRuntimeModelEnv and the canvas; strip `MODEL`/`MODEL_PROVIDER` from the operator-host persona env files once the org-template `model:` field is authoritative end-to-end. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 02:38:14 -07:00
Molecule AI Core-BE	76ac5a88dc	[core-be-agent] Some checks failed Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 4s Details sop-tier-check / tier-check (pull_request) Failing after 4s Details fix(tests): clear platform_auth cache before each test Fixes issue #160: workspace tests fail when MOLECULE_WORKSPACE_TOKEN is set in the environment. The bug: platform_auth._cached_token is populated at module import or first get_token() call and persists for the process lifetime. Tests that use monkeypatch.delenv("MOLECULE_WORKSPACE_TOKEN") to simulate "no token in env" were failing because delenv removes the env var but not the module-level cache — subsequent get_token() calls returned the stale cached value. Fix: add a function-scoped autouse fixture in conftest.py that calls platform_auth.clear_cache() before every test. The import is inside the fixture to avoid collection-time import issues when platform_auth is not yet available. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-09 22:16:11 +00:00
Molecule AI Core-DevOps	57aedec1a3	fix(tests): isolate token resolution from real .auth_token on disk Some checks failed Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 3s Details sop-tier-check / tier-check (pull_request) Failing after 4s Details Issue #160: workspace tests fail when MOLECULE_WORKSPACE_TOKEN is set in the test environment (or when /configs/.auth_token exists on disk, as it does in a container CI runner). Root cause: - test_resolve_token_returns_none_when_missing: monkeypatch.delenv() removes the env var, but _resolve_token() falls through to configs_dir.resolve()/.auth_token which exists in the container. - Multi-workspace tests: clear_cache() resets _cached_token, but get_token() immediately re-reads /configs/.auth_token and caches the real token before the env var is even checked. Fix: - test_mcp_doctor: patch configs_dir.resolve() to return a bare tmp_path so the disk-file fallback finds nothing. - Multi-workspace tests: patch platform_auth._token_file() to return a non-existent path (via tmp_path) alongside clear_cache(), ensuring the env var wins as intended. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-09 21:55:29 +00:00
Molecule AI Core-DevOps	252f8d0c47	tech-debt: rename molecule-monorepo-net -> molecule-core-net Some checks failed sop-tier-check / tier-check (pull_request) Failing after 4s Details Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 4s Details Renames Docker network across all code, configs, scripts, and docs. Per issue #93: the network was named molecule-monorepo-net as a holdover from when the repo was called molecule-monorepo. The canonical repo name is now molecule-core, so the network should be molecule-core-net. Files changed: - docker-compose.yml, docker-compose.infra.yml: network definition - infra/scripts/setup.sh: docker network create - scripts/nuke-and-rebuild.sh: docker network rm - workspace-server/internal/provisioner/provisioner.go: DefaultNetwork - All comments/docs: updated wording Acceptance: grep -rn 'molecule-monorepo-net' returns zero matches. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-09 20:51:48 +00:00
Hongming Wang	166ad20cd7	test(e2e): Phase 3.5 — wheel parser classifies real server response (#2967 ) Previously Phase 3 only checked the workspace-server's poll-mode short-circuit emit shape ({"status":"queued","delivery_mode":"poll","method":"..."}); the matching client-side classification was tested in isolation against fixture dicts in test_a2a_response.py. This phase closes the loop by piping the actual on-the-wire response from a real workspace-server back through the wheel's a2a_response.parse() and asserting it classifies as the Queued variant with the right method + delivery_mode. A regression in EITHER the server emit shape OR the client parser will now fail this E2E, eliminating the gap that allowed the original "unexpected response shape" production bug to ship despite green unit tests. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 17:31:45 -07:00
Hongming Wang	8b9f809966	fix(a2a): SSOT response parser — handle poll-mode queued envelope (#2967 ) Introduce ``workspace/a2a_response.py`` as the single source of truth for the wire shapes the workspace-server proxy can return at ``/workspaces/<id>/a2a``: * ``Result`` — JSON-RPC success * ``Error`` — JSON-RPC error or platform-level error (with restart-in-progress metadata when present) * ``Queued`` — poll-mode short-circuit envelope: the platform queued the message into the target's inbox, the target will fetch via /activity poll * ``Malformed`` — anything the parser can't classify (logged at WARNING so a future server change is loud) ``send_a2a_message`` (in ``a2a_client.py``) now dispatches via ``a2a_response.parse(data)`` instead of inline ``"result" in data`` / ``"error" in data`` sniffing. The Queued variant returns a new ``_A2A_QUEUED_PREFIX`` sentinel so callers can distinguish "delivered async, no synchronous reply" from both success-with-text and failure. reno-stars production data caught two intermittent failures that both reduced to the same root cause: 1. File transfer announce silently failed — when CEO Ryan PC (poll-mode external molecule-mcp) sent the harmi.zip announcement to Reno Stars Business Intelligent (also poll-mode external), ``send_a2a_message`` saw the platform's poll-queued envelope ``{"status":"queued","delivery_mode":"poll","method":"..."}``, didn't recognize it as the synthetic delivery-acknowledgement it is, and returned ``[A2A_ERROR] unexpected response shape``. The agent fell back to a chunk-shipping path; receiver did get the file but operator-facing logs showed a failure that didn't actually fail. 2. Duplicated agent comm — same bug, inverted direction. d76 delegated to 67d, send_a2a_message returned the unexpected-shape error, delegate_task wrapped it as DELEGATION FAILED, the calling agent retried with sharper wording, the recipient saw the same request twice and self-reported "二次请求 — 我先不执行". External molecule-mcp standalone runtimes are inherently poll-mode (they have no public URL), so every external↔external A2A pair was hitting this on every send. The pre-fix client only handled JSON-RPC ``result``/``error`` keys and treated the queued envelope (which has neither) as malformed. RFC #2339 PR 2 added the queued envelope on the server side; the client never caught up. When ``send_a2a_message`` returns the ``_A2A_QUEUED_PREFIX`` sentinel, ``tool_delegate_task`` now transparently falls back to ``_delegate_sync_via_polling`` (RFC #2829 PR-5's durable ``/delegate`` + ``/delegations`` polling path, which DOES work for poll-mode peers because the platform's executeDelegation goroutine writes to the inbox queue and the result row arrives when the target picks it up + replies). The agent gets a real synchronous reply instead of the empty queued sentinel. * ``test_a2a_response.py`` — 62 tests, 100% line coverage on the parser (verified via ``coverage run --source=a2a_response``). Includes adversarial-input fuzzing across ~25 pathological payloads — parser must never raise. * ``test_a2a_client.py::TestSendA2AMessagePollMode`` — 4 tests for the new Queued/Error wiring in ``send_a2a_message``. * ``test_delegation_sync_via_polling.py::TestPollModeAutoFallback`` — 3 tests for the auto-fallback in ``tool_delegate_task``, including negative cases (push-mode reply must NOT trigger fallback; genuine error must NOT silently retry). * Verified all new tests FAIL on pre-fix source by stashing a2a_client.py + a2a_tools_delegation.py and re-running — 5 failures including ImportError for the missing ``_A2A_QUEUED_PREFIX``. Per the operator-debuggability directive: * INFO at every Queued classification (expected variant; operator sees normal poll-mode-peer queueing in log stream). * INFO at the auto-fallback decision in ``tool_delegate_task`` so a future operator can correlate "send returned queued → falling back to polling path" without reading the source. * WARNING at every Malformed classification (server contract drift; operator MUST see this immediately). * Existing transient-retry WARNING preserved. * Mirror Go-side typed model in workspace-server. The wire shape is documented in ``a2a_response.py``'s module docstring with file:line pointers to the canonical emitters; a future PR can introduce ``models/a2a_response.go`` without changing wire behavior. The fixture corpus in ``test_a2a_response.py`` is designed so a one-sided edit breaks CI. * ``send_message_to_user`` and ``chat_upload_receive`` use a different endpoint (``/notify``) and aren't affected by this bug; their parsing stays unchanged. * 135 tests pass across ``test_a2a_response.py`` + ``test_a2a_client.py`` + ``test_delegation_sync_via_polling.py`` + ``test_a2a_tools_impl.py``. * ``coverage run --source=a2a_response -m pytest`` reports 100% line coverage with 0 missing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 17:21:28 -07:00
Hongming Wang	146c0e7c60	fix(a2a-client): recognize poll-mode 'queued' envelope (#2967 ) workspace-server's a2a_proxy poll-mode short-circuit returns {status: "queued", delivery_mode: "poll", method: <a2a_method>} when the peer has no URL to dispatch to (poll-mode peers, including every external molecule-mcp standalone runtime). The bare send_a2a_message parser only knew about JSON-RPC {result, error} keys, so this envelope fell through to the "unexpected response shape" error path. Two production symptoms on the reno-stars tenant traced to it: 1. File transfer logged as failed when it actually succeeded — operator-facing logs showed an A2A_ERROR but the receiving workspace did get the chunked file via the agent's fallback path. 2. delegate_task retried after the false failure → peer received duplicate delegations → conversation got confused, the second peer self-diagnosed in a notify ("⚠️ Peer 二次请求 — 我先不执行"). Add a third branch to the parser, BETWEEN the existing JSON-RPC {result, error} cases and the catch-all "unexpected" fallback. The queued envelope is delivery-acknowledged-but-pending-consumption — not an error — so it returns a clean success string the agent can render as a normal outcome. The success string includes "queued" and "poll" so an operator scanning logs sees the routing path without parsing JSON. Defensive: the new branch only fires when BOTH status="queued" AND delivery_mode="poll" are present. A partial envelope (one key missing) still falls through to the catch-all, so a future server bug that emits a malformed shape gets surfaced instead of silently swallowed. Tests: - test_poll_queued_envelope_returns_success_string — pins the canonical envelope returns a non-error string. Discriminating: verified to FAIL on old code (returned [A2A_ERROR] string), PASS on new. - test_poll_queued_envelope_with_other_method — pins the parser doesn't hardcode message/send. Discriminating: also FAILS on old code. - test_status_queued_without_poll_mode_still_falls_through — pins both keys are required (defensive against future server bugs). 12 existing tests in TestSendA2AMessage still pass — no regression. Scope: hotfix for the bare send_a2a_message path. The full SSOT typed-A2AResponse refactor (#158-#163, parents under #2967) covers the broader vocabulary alignment between Go server and Python client. This PR ends the production symptoms now without preempting that work.	2026-05-05 16:58:48 -07:00
Hongming Wang	2652ea8342	fix(mcp-doctor): heartbeat (idempotent) instead of register (UPSERT) Self-review caught after #2954 landed: check_register() POSTed to /registry/register with agent_card.name="doctor-probe". The endpoint is an UPSERT, so the doctor probe overwrites the workspace's actual agent_card metadata until the real agent's next register call. An operator running `molecule-mcp doctor` against a live workspace would see their canvas briefly display "doctor-probe" as the agent name — invisible production-disruption. Switches to POST /registry/heartbeat. heartbeat only updates last_heartbeat_at (and clears awaiting_agent if needed) — the same work a normal molecule-mcp boot does every 20s in steady state, so the doctor's extra heartbeat is indistinguishable from background traffic. Function renamed check_register → check_token_auth to match what it actually does. check_register kept as back-compat alias so any external test/import still resolves. Also unified the duplicated token-resolution paths into a single _resolve_token() returning (value, source_label). Pre-fix: check_register and _resolve_token_summary read env in parallel ladders — a future env-var addition would have to touch both. New tests: - test_check_token_auth_uses_heartbeat_endpoint: mocks urlopen, asserts the URL ends in /registry/heartbeat AND does NOT contain /registry/register. Pins the load-bearing invariant so a future refactor can't silently re-route through register. - test_resolve_token_returns_value_and_label_for_env: pins the consolidated resolver returns both pieces of info from the same source-decision. - test_resolve_token_returns_none_when_missing: missing-env happy path. Verification: - 13/13 tests pass (10 existing + 3 new) - Manual stripped-env run still renders 4 FAIL + 2 WARN with actionable hints, exit 1. Refs molecule-core#2934 item 6 (doctor side-effect fix-up). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-05 16:11:08 -07:00

1 2 3 4 5

232 Commits