molecule-core

History

Hongming Wang 4b35d25d86 fix(runtime): decouple agent-card readiness from adapter.setup() Today, if `adapter.setup()` raises (most often: an LLM credential is missing/rotated), main.py crashes before the agent-card route is mounted. start.sh restart-loops, /.well-known/agent-card.json never returns 200, and the workspace is invisible to the bench/canvas — operators see "stuck booting forever" with no clear error to act on. The agent-card is a static capability advertisement (name, version, skills, supported protocols). It doesn't need a working LLM. Coupling its mount to setup() conflates availability ("am I up?") with configuration ("can I actually answer?"). They're different concerns. This change: - Builds AgentCard from `config.skills` (static names from config.yaml) BEFORE adapter.setup(), so the route mounts independent of setup state. - Wraps setup() + create_executor in try/except. On success, mounts the real DefaultRequestHandler with rich loaded_skills metadata swapped into the card in-place. On failure, mounts a JSON-RPC handler that returns -32603 "agent not configured" with the setup() exception in error.data. - Heartbeat keeps running on misconfigured boots so the platform marks the workspace as reachable-but-misconfigured rather than crash-looping. Operators redeploy with corrected env without chasing a restart loop. - initial_prompt and idle_loop are skipped on misconfigured boots — they self-fire to /, which would land in -32603 anyway, and the marker would consume on the first useless attempt. Bench impact (RFC #388 strict <120s): codex/openclaw bench-time-outs were the agent-card-never-returns-200 symptom. With this fix those runtimes serve the card immediately on EC2 boot, so the bench measures infrastructure cold-start (claude-code class: ~50–80s) instead of credential-coupled boot. Adds workspace/not_configured_handler.py (factory + module-level so behavior is unit-testable; main.py is `# pragma: no cover`) and workspace/tests/test_not_configured_handler.py (6 tests covering status code, JSON-RPC envelope shape, id-echo, malformed-body fallback, reason surfacing, batch-body safety). All 1665 existing workspace tests pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-05-04 10:22:31 -07:00
..
adapters	fix: comprehensive a2a-sdk 1.x migration sweep across workspace/	2026-04-27 09:42:57 -07:00
builtin_tools	feat(harness): coordinator phase-boundary instrumentation for RFC #2251	2026-04-28 20:11:46 -07:00
lib	feat(workspace): pre-stop serialization for pause/resume (closes #1386 )	2026-04-21 12:40:44 +00:00
molecule_audit	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00
platform_tools	feat(mcp): cross-workspace delegation routing (multi-ws PR-2)	2026-05-04 08:32:24 -07:00
plugins_registry	feat(plugin): implement MCPServerAdaptor (issue #847 )	2026-04-24 01:42:13 +00:00
policies	feat(platform): single-source-of-truth tool registry — adapters consume, no drift	2026-04-28 17:11:36 -07:00
scripts	fix(git-token-helper): close TOCTOU window + stop swallowing chmod errors (closes #1552 )	2026-04-26 08:22:29 -07:00
skill_loader	feat(skills): per-skill runtime compatibility (#119 , hermes pattern)	2026-04-27 01:57:43 -07:00
tests	fix(runtime): decouple agent-card readiness from adapter.setup()	2026-05-04 10:22:31 -07:00
.coveragerc	test(workspace): centralize pytest-cov config + 92% floor (closes #1817 )	2026-04-26 06:21:22 -07:00
a2a_cli.py	fix(runtime): use lowercase wire role for v0.3 JSON-RPC compat layer	2026-04-27 12:40:11 -07:00
a2a_client.py	feat(mcp): cross-workspace delegation routing (multi-ws PR-2)	2026-05-04 08:32:24 -07:00
a2a_executor.py	fix(a2a): route terminal Message via TaskUpdater.complete/failed in task mode	2026-05-03 04:06:45 -07:00
a2a_mcp_server.py	feat(mcp): cross-workspace delegation routing (multi-ws PR-2)	2026-05-04 08:32:24 -07:00
a2a_tools.py	feat(mcp): cross-workspace delegation routing (multi-ws PR-2)	2026-05-04 08:32:24 -07:00
adapter_base.py	feat(workspace): wire EventLog into adapter base (#119 PR-3b)	2026-05-03 01:18:19 -07:00
agent.py	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00
agents_md.py	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00
build-all.sh	fix: update workspace script comments for workspace-template → workspace rename	2026-04-18 01:48:05 -07:00
config.py	feat(workspace): event_log module + EventLogConfig (#119 PR-2)	2026-05-03 00:17:12 -07:00
configs_dir.py	fix(runtime): auto-fallback CONFIGS_DIR for non-container hosts (closes #2458 )	2026-05-01 13:07:55 -07:00
consolidation.py	fix: apply #1124 env-var defaults + scrub F1088 credentials from INCIDENT_LOG.md (#1347 )	2026-04-21 08:11:44 +00:00
coordinator.py	feat(harness): coordinator phase-boundary instrumentation for RFC #2251	2026-04-28 20:11:46 -07:00
Dockerfile	feat(workspace): 45-min gh-token refresh daemon + credential helper cache	2026-04-22 19:52:46 -07:00
entrypoint.sh	fix(workspace): credential helper security hardening (#1797 )	2026-04-23 18:14:55 +00:00
event_log.py	feat(workspace): event_log module + EventLogConfig (#119 PR-2)	2026-05-03 00:17:12 -07:00
events.py	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00
executor_helpers.py	docs(a2a): correct misleading v1-tolerance comments	2026-05-02 02:33:00 -07:00
heartbeat.py	feat(workspace): wire observability config into heartbeat + uvicorn (#119 PR-3a)	2026-05-03 01:01:57 -07:00
inbox.py	mcp: support multi-workspace external-agent registration (PR-1)	2026-05-04 08:06:00 -07:00
initial_prompt.py	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00
internal_chat_uploads.py	fix(workspace): surface errno + path on chat-upload mkdir failure	2026-05-01 11:47:53 -07:00
internal_file_read.py	feat(chat_files): rewrite Download as HTTP-forward (RFC #2312 , PR-D)	2026-04-29 15:19:02 -07:00
main.py	fix(runtime): decouple agent-card readiness from adapter.setup()	2026-05-04 10:22:31 -07:00
mcp_cli.py	fix: bot-lint nits — drop unused imports, add reason to except	2026-05-04 08:16:12 -07:00
molecule_ai_status.py	fix(runtime): replace remaining /app/ legacy paths in agent prompts + docstrings	2026-04-27 11:22:00 -07:00
not_configured_handler.py	fix(runtime): decouple agent-card readiness from adapter.setup()	2026-05-04 10:22:31 -07:00
platform_auth.py	feat(mcp): cross-workspace delegation routing (multi-ws PR-2)	2026-05-04 08:32:24 -07:00
platform_inbound_auth.py	fix(runtime): auto-fallback CONFIGS_DIR for non-container hosts (closes #2458 )	2026-05-01 13:07:55 -07:00
plugins.py	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00
preflight.py	fix(preflight): skip required_env check in MOLECULE_SMOKE_MODE	2026-05-03 03:44:05 -07:00
prompt.py	feat(prompt): Platform Capabilities preamble at top of system prompt	2026-04-29 21:31:13 -07:00
pytest.ini	feat(preflight): replace SUPPORTED_RUNTIMES static list with adapter discovery	2026-04-27 00:44:51 -07:00
rebuild-runtime-images.sh	fix: update workspace script comments for workspace-template → workspace rename	2026-04-18 01:48:05 -07:00
requirements.txt	chore(deps)(deps): update starlette requirement in /workspace	2026-05-03 01:36:45 +00:00
runtime_wedge.py	chore(workspace): drop claude_sdk_executor — Phase 2 of #87	2026-04-27 00:52:55 -07:00
shared_runtime.py	feat(platform): single-source-of-truth tool registry — adapters consume, no drift	2026-04-28 17:11:36 -07:00
smoke_mode.py	chore(smoke): runtime_wedge follow-ups from PR #2473 review	2026-05-01 18:01:51 -07:00
transcript_auth.py	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00
watcher.py	chore: open-source restructure — rename dirs, remove internal files, scrub secrets	2026-04-18 00:24:44 -07:00