molecule-core

Author	SHA1	Message	Date
Hongming Wang	f28b3922f9	Merge pull request #743 from Molecule-AI/feat/issue-727-opus-4-7-default feat: upgrade default workspace model to claude-opus-4-7	2026-04-17 08:47:27 -07:00
Molecule AI Backend Engineer	ebfafb9139	feat: upgrade default workspace model to claude-opus-4-7 (#727 ) Replace the anthropic:claude-sonnet-4-6 default across config, handlers, env example, and litellm proxy config. All tests updated to match the new default; sonnet-4-6 alias kept in litellm_config.yml for pinned workspaces. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 15:30:57 +00:00
molecule-ai[bot]	b83ddc7dff	fix(scheduler): prevent NULL next_run_at from permanently dropping schedules (#722 ) Three bugs caused enabled schedules to silently disappear from the fire query (which requires next_run_at IS NOT NULL AND next_run_at <= now()): Bug 1 - fireSchedule() and recordSkipped(): when ComputeNextRun returned an error, nextRunPtr stayed nil and UPDATE SET next_run_at = $2 wrote NULL. Fix: change to COALESCE($2, next_run_at) so the existing DB value is preserved when $2 is NULL, and log the error explicitly. Bug 2 - org importer (handlers/org.go): nextRun, _ := ComputeNextRun(...) silently discarded the error. A bad cron expression would pass time.Time{} (zero value) to the INSERT. Fix: surface the error, log it, and skip the schedule INSERT via continue. Bug 3 - no startup repair: schedules already NULL'd by the pre-fix binary would never recover. Fix: Start() now calls repairNullNextRunAt() once on boot, recomputing next_run_at for every enabled schedule with a NULL value. Tests: TestFireSchedule_ComputeNextRunError, TestRecordSkipped_ComputeNextRunError, TestRepairNullNextRunAt_RepairsRows, TestRepairNullNextRunAt_DBError_NoPanic, TestOrgImport_ScheduleComputeError (all pass). Fixes #722 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 13:34:28 +00:00
molecule-ai[bot]	c53bf6eebd	Merge pull request #719 from Molecule-AI/fix/issue-697-validate-token-removed-workspace fix(wsauth): add removed-workspace JOIN to ValidateToken (#697)	2026-04-17 12:50:52 +00:00
Hongming Wang	87f2b9abb7	Merge pull request #696 from Molecule-AI/fix/issue-682-684-683-auth-token-fixes fix(security): metrics auth, token revocation hardening, A2A false-negative (#682 #683 #689)	2026-04-17 05:47:08 -07:00
molecule-ai[bot]	059644bc37	fix(wsauth): add removed-workspace JOIN to ValidateToken (#697 ) Defense-in-depth: workspace-scoped ValidateToken now rejects tokens belonging to workspaces with status='removed' at the DB layer, even when revoked_at IS NULL. Mirrors the same guard added to ValidateAnyToken in #696. Updated all test mock patterns (workspace_test, a2a_proxy_test, secrets_test, admin_test_token_test, middleware) to match the new JOIN query. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 12:46:27 +00:00
Molecule AI QA Engineer	5dbac3a5ee	test(security): regression suite for input validation fixes (#685 #686 #687 #688 ) 30 test cases covering all four security fixes from PR #701: #686 — AdminAuth gate on GET /templates and GET /org/templates: - NoAuth returns 401 when tokens are enrolled - FreshInstall fails open (bootstraps correctly) #687 — UUID path param validation: - URL-encoded traversal (..%2f..%2fetc%2fpasswd) → 400 - Non-UUID strings (not-a-uuid, ws-123, XSS payloads) → 400 - Valid UUIDs pass through (regression check) #688 — Field length limits: - name=256, role=1001, model=101 chars → 400 - Exact-boundary values (255/1000/100) → pass (off-by-one guard) #685 — YAML injection via newline/CR: - Newline in name, CR in role → 400 - YAML multi-field injection payload "agent\nrole: injected" → 400 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 12:37:13 +00:00
molecule-ai[bot]	f1b2a2f8a6	fix(security): rebase #685-688 onto main — preserve wsAuth PATCH, add yamlSpecialChars - Rebased onto `15a850ea` (main HEAD, post-#692 IDOR fix) - PATCH /workspaces/:id remains under wsAuth group (not open router) - Added validateWorkspaceID (uuid.Parse check) in Get/Update/Delete - Added validateWorkspaceFields: rejects \n\r in all fields, yamlSpecialChars {}[]\|>*&! in name/role only, enforces max lengths - Template endpoints (GET /templates, GET /org/templates) now require AdminAuth - Replaced stale in-handler sensitiveUpdateFields gate tests with TestWorkspaceUpdate_SensitiveField_AuthEnforcedByMiddleware Closes #685 #686 #687 #688	2026-04-17 12:13:44 +00:00
molecule-ai[bot]	96c06b0174	fix(security): revert #684 schema migration, restore /admin/schedules/health, add ADR-001 Required changes from security auditor before PR #696 can merge: 1. REVERT #684 (token_type schema migration): - Remove migration 029_token_type.{up,down}.sql - Revert wsauth/tokens.go — remove IssueAdminToken, token_type constants, restore HasAnyLiveTokenGlobal and ValidateAnyToken to pre-#684 behavior - Revert admin_test_token.go to use IssueToken (not IssueAdminToken) - Revert associated tests to pre-#684 patterns Path B: formal risk acceptance documented in ADR-001. 2. RESTORE /admin/schedules/health route (regression fix): - Add platform/internal/handlers/admin_schedules_health.go (from PR #671) - Add platform/internal/handlers/admin_schedules_health_test.go (from PR #671) - Wire GET /admin/schedules/health via AdminAuth in router.go 3. ADD ADR-001 (platform/docs/adr/ADR-001-admin-token-scope.md): - Documents #684 as known risk with Phase-H remediation plan - Phase-H tracking issue: Molecule-AI/molecule-core#710	2026-04-17 12:01:12 +00:00
molecule-ai[bot]	a77520c452	fix(security): add token_type column — workspace tokens rejected by AdminAuth (#684 ) Security Auditor confirmed: ValidateAnyToken accepted any live workspace token, meaning a workspace agent bearer could satisfy AdminAuth and reach /bundles/import, /events, /org/import, /settings/secrets, etc. Fix: add token_type TEXT ('workspace' \| 'admin') to workspace_auth_tokens. Migration 029: - ALTER workspace_id DROP NOT NULL (admin tokens have no workspace scope) - ADD COLUMN token_type TEXT NOT NULL DEFAULT 'workspace' - ADD CONSTRAINT token_type_check (IN 'workspace', 'admin') - ADD CONSTRAINT scope_check (workspace tokens MUST have workspace_id; admin tokens MUST have workspace_id = NULL) Code changes: - IssueToken: explicitly inserts token_type = 'workspace' - IssueAdminToken (new): inserts NULL workspace_id + token_type = 'admin' - ValidateAnyToken: now filters WHERE token_type = 'admin' — workspace tokens unconditionally fail - HasAnyLiveTokenGlobal: counts only admin tokens - admin_test_token.go: GetTestToken calls IssueAdminToken (#684) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 11:47:31 +00:00
molecule-ai[bot]	6406c9068b	fix(a2a): surface delivery_confirmed + prevent 503-busy double-delivery (#689 ) Two targeted fixes for the A2A false-negative (delivery succeeded but caller receives A2A_ERROR): Body-read failure: when Do() succeeds (target sent 2xx headers — delivery confirmed) but io.ReadAll(resp.Body) fails, proxy now returns {"delivery_confirmed": true} in the 502 body and logs the activity as successful. Audit trail records true delivery, not a false failed entry. isTransientProxyError fix: delegation retry loop now only retries 503s with {restarting: true} (container died, message NOT delivered). 503 {busy: true} signals the agent IS processing the delivered message — retrying causes double-delivery. Fix prevents the double-delivery race. All 16 packages pass: go test ./... Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 11:26:28 +00:00
molecule-ai[bot]	92a28341fb	Merge pull request #692 from Molecule-AI/fix/issue-680-681-workspace-auth fix(security): auth+ownership on PATCH /workspaces/:id (#680 #681)	2026-04-17 11:03:25 +00:00
molecule-ai[bot]	1f6163b5d2	Merge pull request #659 from Molecule-AI/infra/rebuild-runtime-images-script infra: add rebuild-runtime-images.sh — patches all 6 adapter images with git credential helper (#658)	2026-04-17 10:59:33 +00:00
molecule-ai[bot]	a3e278feb3	fix(security): add auth+ownership to PATCH /workspaces/:id (#680 #681 ) ISSUE #680 — IDOR on PATCH /workspaces/🆔 - Route was on the open router with no auth middleware. Any unauthenticated caller could rename, change role, or update any workspace field of any workspace ID without credentials (zero auth + no ownership check). - Fix: register under wsAuth (WorkspaceAuth middleware) which (a) requires a valid bearer token and (b) validates the token belongs to the target workspace, providing auth + ownership in a single check. - Remove the now-redundant in-handler field-level auth block — the middleware is a strictly stronger gate. Dead code gone. - Remove unused `middleware` import from workspace.go. - Update tests: two tests that asserted the old in-handler 401 are replaced by TestWorkspaceUpdate_SensitiveField_AuthEnforcedByMiddleware (documents that auth is now at the router layer); cosmetic-field test renamed. ISSUE #681 — test-token endpoint auth: - Confirmed: GET /admin/workspaces/:id/test-token already has middleware.AdminAuth(db.DB). No change needed — finding was from older state. Build: `go build ./...` clean. All 15 test packages pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 10:55:23 +00:00
molecule-ai[bot]	15d4b25c78	fix(security): Ed25519 signature verification for Discord webhooks + strip token from error chain HIGH (#659-1): POST /webhooks/discord had no signature verification, allowing any attacker to POST forged Discord slash-command payloads. Add Ed25519 verification via verifyDiscordSignature() before adapter.ParseWebhook() is called. The function reads r.Body, verifies Ed25519(pubKey, timestamp+body, X-Signature-Ed25519), then restores r.Body with io.NopCloser so ParseWebhook can still read the payload. The public key is resolved from the first enabled Discord channel's app_public_key config (plaintext — it is a public key and not in sensitiveFields) with a fallback to DISCORD_APP_PUBLIC_KEY env var; no key configured -> 401 (fail-closed). discordPublicKey() is the DB helper. MEDIUM (#659-2): discord.go SendMessage() wrapped http.Client.Do errors with %w, propagating the *url.Error which includes the full webhook URL (https://discord.com/api/webhooks/{id}/{token}) into logs and error responses. Replace with a static "discord: HTTP request failed" string. Tests added (11 new): - TestVerifyDiscordSignature_Valid / _WrongKey / _TamperedBody / _MissingTimestamp / _MissingSignature / _InvalidHexSignature / _InvalidHexPubKey / _WrongLengthPubKey (real Ed25519 key pairs) - TestChannelHandler_Webhook_Discord_NoKey_Returns401 - TestChannelHandler_Webhook_Discord_InvalidSig_Returns401 - TestChannelHandler_Webhook_Discord_ValidSig_PingAccepted - TestDiscordAdapter_SendMessage_ErrorDoesNotLeakToken go test ./... green. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 10:36:51 +00:00
molecule-ai[bot]	ca8edaf6a4	feat(platform): add GET /admin/schedules/health for cross-workspace schedule monitoring (#618 ) Operators and audit agents can now detect silent cron failures across all workspaces with a single AdminAuth-gated request — no per-workspace bearer tokens required. This closes the proactive detection gap that left issue #85 (cron died silently 10+ hours) undetectable until users noticed missing work. Changes: - platform/internal/handlers/admin_schedules_health.go: new AdminSchedulesHealthHandler - GET /admin/schedules/health joins workspace_schedules + workspaces (excluding removed workspaces), computes status (ok\|stale\|never_run) and stale_threshold_seconds (2 × cron interval via scheduler.ComputeNextRun) - computeStaleThreshold() and classifyScheduleStatus() extracted as package-level helpers for direct unit testing - platform/internal/handlers/admin_schedules_health_test.go: 16 tests - Unit tests for computeStaleThreshold (5min/hourly/daily crons, invalid expr, invalid timezone) and classifyScheduleStatus (never_run/stale/ok/zero-threshold) - Integration tests via sqlmock: empty result, never_run classification, stale detection, ok status, DB error → 500, multi-workspace response, required JSON fields coverage - platform/internal/router/router.go: register GET /admin/schedules/health behind middleware.AdminAuth(db.DB), mirroring the /admin/liveness gate Closes #618 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 10:28:55 +00:00
Molecule AI Backend Engineer	e0d674089f	feat(platform): merge stacked system messages for Hermes/vLLM (#499 ) vLLM (and Nous Hermes portal) only accept a single system message. When the platform builds a messages array from multiple sources (base system prompt + workspace config + per-session override), the consecutive system entries at the front cause vLLM to reject or silently drop all but the first. Adds mergeSystemMessages() — a stateless pre-flight transform in the handlers package that collapses the uninterrupted leading run of {"role":"system"} entries into one, joining their content with "\n\n". Non-system messages between system messages are not touched; a single system message is returned as-is (no allocation). 10 unit tests cover: stacked merge, single-unchanged, no-system passthrough, three-message collapse, interleaved user (trailing system not merged), only-system-messages, empty slice, nil slice, non-string content, and assistant-leading passthrough. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 07:19:30 +00:00
molecule-ai[bot]	f6673b21a0	Merge pull request #641 from Molecule-AI/feat/issue-595-cloudflare-artifacts-demo Merge gate passed (all 7 gates). Cloudflare Artifacts demo integration: 4 routes behind WorkspaceAuth, CF token from env only, import_url HTTPS enforced, CF 5xx errors sanitized, parameterized SQL throughout. Migration 028 uses CREATE TABLE IF NOT EXISTS. Schema migration — CEO explicit authorization in chat (urgent/first-mover). Tip SHA `daf52da` verified. UNSTABLE = known App token scope gap.	2026-04-17 06:43:21 +00:00
Molecule AI Backend Engineer	daf52daa1d	fix(platform): address security review findings on CF Artifacts (#641 ) Four findings from the security audit on PR #641: FIX 1 (MEDIUM): import_url scheme validation - Reject non-HTTPS import URLs with 400 before forwarding to CF API. Prevents SSRF via http://, git://, ssh://, file:// etc. FIX 2 (MEDIUM): CF 5xx error leakage - Add cfErrMessage() helper: returns "upstream service error" for CF 5xx responses and non-CF errors, passes through 4xx messages. - Applied at all four CF-error response sites (Create, Get, Fork, Token). FIX 3 (LOW): repo name validation - Add package-level repoNameRE = ^[a-zA-Z0-9][a-zA-Z0-9_-]{0,62}$ - Validate in Create and Fork handlers when caller supplies an explicit name. Auto-generated names ("molecule-ws-<id>") are always safe and skip validation. FIX 4 (LOW): response body size limit in CF client - Wrap resp.Body with io.LimitReader(1 MB) before json.NewDecoder in do(). Prevents memory exhaustion from a runaway/malicious CF response. Tests: 16 new tests covering all four fixes (cfErrMessage 4xx/5xx/non-API, import_url non-HTTPS cases, invalid repo names in Create and Fork). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:39:47 +00:00
Molecule AI Backend Engineer	3bcb2b21a5	feat(platform): Cloudflare Artifacts demo integration (#595 ) Add a minimal but complete integration with the Cloudflare Artifacts API (private beta Apr 2026, public beta May 2026) — "Git for agents" versioned workspace-snapshot storage. ## What's included `platform/internal/artifacts/client.go` — typed Go HTTP client for the CF Artifacts REST API: - CreateRepo, GetRepo, ForkRepo, ImportRepo, DeleteRepo - CreateToken, RevokeToken - CF v4 response-envelope decoding; APIError with StatusCode + Message `platform/internal/handlers/artifacts.go`* — four workspace-scoped Gin handlers (all behind WorkspaceAuth middleware): - POST /workspaces/:id/artifacts — attach or import a CF Artifacts repo - GET /workspaces/:id/artifacts — get linked repo info (DB + live CF) - POST /workspaces/:id/artifacts/fork — fork the workspace's repo - POST /workspaces/:id/artifacts/token — mint a short-lived git credential `platform/migrations/028_workspace_artifacts.up.sql` — `workspace_artifacts` table: one-to-one link between a workspace and its CF Artifacts repo. Credentials are never stored; only the credential-stripped remote URL. `platform/internal/router/router.go` — wire the four routes into the existing wsAuth group. ## Configuration Two env vars gate the feature (returns 503 when either is absent): - CF_ARTIFACTS_API_TOKEN — Cloudflare API token with Artifacts write perms - CF_ARTIFACTS_NAMESPACE — Cloudflare Artifacts namespace name ## Tests - 10 client-level tests (httptest.Server + CF v4 envelope mocks) - 14 handler-level tests (sqlmock DB + mock CF server) - Helper unit tests for stripCredentials, cfErrToHTTP All 21 packages pass (go test ./...). Closes #595 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:28:58 +00:00
Molecule AI Triage Operator	77313434b1	fix(gate-1): resolve merge conflicts with main Both conflicts were comment-only — identical logic on both sides: - registry.go: kept main's wording ("accidentally clearing") for the monthly_spend comment in Heartbeat; logic is unchanged - workspace.go: kept HEAD's comment (describes PR #634's clamping behaviour: [0, maxMonthlySpend]); logic is unchanged Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:27:14 +00:00
molecule-ai[bot]	7538b2a95c	Merge pull request #611 from Molecule-AI/feat/issue-541-budget-limit-backend Merge gate passed (all 7 gates). Adds budget_limit + monthly_spend columns via 027_workspace_budget (ADD COLUMN IF NOT EXISTS — idempotent). A2A budget enforcement is fail-open on DB errors. WorkspaceAuth on all budget routes. Schema migration — CEO explicit authorization in chat. Merging before #634 which writes to monthly_spend.	2026-04-17 06:25:02 +00:00
Molecule AI Backend Engineer	f1fa92ad84	fix(migrations): renumber budget migration 025→027 to follow gap fix (#631 ) Rebase on origin/fix/issue-631-migration-gap which inserts token_usage (025) and org_plugin_allowlist (026); bump workspace_budget from 025 to 027 so the sequential runner applies all three in the correct order. Update workspace_budget_test.go and workspace_test.go to match the transaction-wrapped INSERT (BeginTx/Commit) introduced on main and the resulting 10-arg WithArgs call. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:22:09 +00:00
Molecule AI Backend Engineer	fce0be30fd	fix(#611 ): remove budget_limit from PATCH /workspaces/:id and strip financial fields from GET Security Auditor findings on PR #611: Fix 1 (BLOCKING): Remove budget_limit handling from Update() entirely. PATCH /workspaces/:id uses ValidateAnyToken — any enrolled workspace bearer could self-clear its own spending ceiling. The dedicated AdminAuth-gated PATCH /workspaces/:id/budget is the only authorised write path. Fix 2 (MEDIUM): Strip budget_limit and monthly_spend from Get() response before c.JSON(). GET /workspaces/:id is on the open router — any caller with a valid UUID must not read billing data. Also updates four existing tests in workspace_budget_test.go that encoded the old (insecure) behaviour, and adds three new regression tests. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:18:41 +00:00
Molecule AI Backend Engineer	4e6e3745f2	fix(issue-541): correct stale 429 comment to 402 in checkWorkspaceBudget Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:18:41 +00:00
Molecule AI Backend Engineer	2fb0aacd41	fix(#541 ): change budget enforcement status from 429 to 402 Budget limit exceeded on A2A proxy now returns HTTP 402 PaymentRequired instead of 429 TooManyRequests, matching the issue spec and the FE amber banner check. Updates a2a_proxy.go, workspace_budget_test.go (renamed ExceededReturns429 → ExceededReturns402, AboveLimitReturns429 → AboveLimitReturns402), and migration comment. All go test ./... pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:18:41 +00:00
Molecule AI Backend Engineer	22af070ef3	feat(#541 ): add dedicated GET/PATCH /workspaces/:id/budget endpoints - New BudgetHandler with GetBudget and PatchBudget methods - GET returns budget_limit (null or int64 USD cents), monthly_spend, and computed budget_remaining (null when no limit, can be negative when over-budget so callers can see the magnitude of the overage) - PATCH accepts {budget_limit: int64\|null}; null clears the ceiling; validates non-negative values; re-reads DB to echo final state - Both handlers are wired in router.go under the WorkspaceAuth group - 14 unit tests covering happy paths, 404, 400 validation, DB errors, over-budget state, zero limit, and clear-limit round-trip - All 20 packages pass go test ./... and go build ./... is clean Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:18:41 +00:00
Molecule AI Backend Engineer	f8106b35be	feat(platform): add per-workspace budget_limit field and A2A enforcement (#541 ) - Migration 025: ADD COLUMN budget_limit BIGINT DEFAULT NULL and monthly_spend BIGINT NOT NULL DEFAULT 0 to workspaces table - Models: BudgetLimit *int64 in CreateWorkspacePayload; MonthlySpend int64 in HeartbeatPayload - workspace.go: scanWorkspaceRow, workspaceListQuery, Get, Create, and Update all handle budget_limit/monthly_spend; budget_limit is gated as a sensitiveUpdateField - registry.go: heartbeat conditionally writes monthly_spend only when payload.MonthlySpend > 0 (avoids overwriting with zero) - a2a_proxy.go: checkWorkspaceBudget() returns 429 when monthly_spend >= budget_limit (NULL = no limit; fail-open on DB error) - Tests: 8 new workspace_budget_test.go tests + patched existing tests for the 20-column scanWorkspaceRow and 10-param CREATE INSERT Field type: BIGINT (int64), units: USD cents (budget_limit=500 = $5.00/month) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:18:41 +00:00
Hongming Wang	44cef47763	Merge pull request #630 from Molecule-AI/fix/issue-615-cap-token-counts fix(platform): cap token counts before upsert to prevent NUMERIC overflow (#615)	2026-04-16 23:17:37 -07:00
Molecule AI Backend Engineer	668c93e513	fix(platform): cap monthly_spend on heartbeat upsert (#615 ) A malicious or buggy agent could report MonthlySpend = math.MaxInt64 causing NUMERIC overflow in the DB or incorrect budget-enforcement comparisons downstream. Changes: - Add MonthlySpend int64 field to HeartbeatPayload (json:"monthly_spend") - Clamp negative values to 0 and values above $10B (1_000_000_000_000 cents) to the cap before any DB write - The two-path UPDATE: when MonthlySpend > 0 after clamping, include monthly_spend = $7 in the UPDATE; otherwise skip to avoid accidentally clearing a previously-reported spend value - 5 regression tests covering: within-bounds passthrough, negative clamp, math.MaxInt64 overflow clamp, exact-cap boundary, and zero/omitted no-update path Note: this branch introduces MonthlySpend to HeartbeatPayload; it will need trivial conflict resolution when feat/issue-541-budget-limit-backend merges, as that branch also adds the field (without the cap). Keep this branch's clamping logic. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:16:06 +00:00
Molecule AI Backend Engineer	13b8965c99	fix(platform): cap token counts before upsert to prevent NUMERIC overflow (#615 ) Adversarial or buggy agents can report INT64_MAX token counts via A2A responses. Without clamping, upsertTokenUsage would pass these directly to Postgres NUMERIC(12,6), causing a silent upsert failure that corrupts the workspace's cost accounting. Fix: clamp input_tokens/output_tokens to [0, 10_000_000] before any arithmetic or DB write. 10M tokens/call is well above any real LLM API response; clamped values still produce valid cost rows. Adds 4 regression tests: - TestUpsertTokenUsage_615_CapsInt64Max — INT64_MAX → maxTokensPerCall - TestUpsertTokenUsage_615_CapsNegative — negative → 0 (no DB call) - TestUpsertTokenUsage_615_NormalValuesUnchanged — passthrough for normal counts - TestUpsertTokenUsage_615_ExactlyAtCap — at-cap value accepted unchanged Closes #615 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 06:03:40 +00:00
molecule-ai[bot]	b948f0b140	Merge pull request #610 from Molecule-AI/feat/issue-591-org-plugin-allowlist feat(platform): per-org plugin governance registry (allowlist)	2026-04-17 05:55:27 +00:00
molecule-ai[bot]	9f815e27a1	Merge pull request #602 from Molecule-AI/feat/issue-593-workspace-token-tracking feat(platform): per-workspace token tracking + GET /workspaces/:id/metrics	2026-04-17 05:54:27 +00:00
molecule-ai[bot]	3ecdcf8c6b	Merge pull request #601 from Molecule-AI/feat/issue-590-agui-sse-endpoint feat(platform): AG-UI compatible SSE endpoint for streaming agent events	2026-04-17 05:45:29 +00:00
Molecule AI Backend Engineer	53284c4626	feat(platform): per-org plugin governance registry (#591 ) Add an org-scoped allowlist table so org admins can restrict which plugins workspace agents are allowed to install. An empty allowlist means allow-all (backward-compatible with existing deployments). • migrations/027_org_plugin_allowlist.{up,down}.sql — new table + unique index on (org_id, plugin_name) • handlers/org_plugin_allowlist.go — resolveOrgID, checkOrgPluginAllowlist (fail-open on DB errors), GetAllowlist, PutAllowlist (atomic tx replace) • handlers/org_plugin_allowlist_test.go — 23 unit tests covering all handler paths, resolveOrgID, and all checkOrgPluginAllowlist branches • handlers/plugins_install.go — allowlist gate between resolveAndStage and deliverToContainer; returns 403 if plugin is blocked • router/router.go — GET/PUT /orgs/:id/plugins/allowlist under AdminAuth All tests pass; go build ./... clean; gosec Issues: 0 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 05:40:23 +00:00
Molecule AI Backend Engineer	f60c9df26f	feat(platform): per-workspace token tracking + GET /workspaces/:id/metrics (#593 ) Migration 026 adds workspace_token_usage table (uuid pk, workspace_id FK with CASCADE, period_start TIMESTAMPTZ, input_tokens, output_tokens, call_count, estimated_cost_usd NUMERIC(12,6), updated_at) with a UNIQUE index on (workspace_id, period_start) for day-granularity upserts. A2A proxy (proxyA2ARequest) now spawns a detached goroutine after each successful call to extractAndUpsertTokenUsage, which: 1. Parses usage.input_tokens / usage.output_tokens from result.usage (JSON-RPC wrapper) with fallback to top-level usage (direct Anthropic). 2. Calls upsertTokenUsage — INSERT ... ON CONFLICT DO UPDATE so multi- call days accumulate correctly. Estimated cost = input×$0.000003 + output×$0.000015 (Claude Sonnet default; adjustable in a later phase). Token tracking never blocks the critical A2A path. New endpoint: GET /workspaces/:id/metrics (wsAuth — WorkspaceAuth bearer bound to :id). Returns: {"input_tokens":N,"output_tokens":N,"total_calls":N, "estimated_cost_usd":"0.000000","period_start":"...","period_end":"..."} 404 if workspace missing. Period is current UTC day. 11 new tests (4 handler + 7 parse-unit); 19/19 packages pass. Closes #593 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 05:29:10 +00:00
Molecule AI Backend Engineer	c2891b5aba	feat(platform): AG-UI compatible SSE endpoint for streaming agent events (#590 ) - Add in-process SSE subscription mechanism to Broadcaster (SubscribeSSE, deliverToSSE) so both RecordAndBroadcast and BroadcastOnly fan out to SSE subscribers — critical because BroadcastOnly skips Redis pub/sub and would be invisible to a Redis-only subscriber (AGENT_MESSAGE, A2A_RESPONSE, TASK_UPDATED are all BroadcastOnly events). - Add handlers/sse.go: SSEHandler.StreamEvents sets text/event-stream headers, checks workspace existence (404 if missing), subscribes via broadcaster, and wraps each WSMessage in an AG-UI envelope: data: {"type":"<event>","timestamp":<unix_ms>,"data":{...}}\n\n - Register wsAuth.GET("/workspaces/:id/events/stream") behind existing WorkspaceAuth middleware — bearer token bound to :id. - Add 6 tests: Content-Type, initial ping, AG-UI format, workspace filter (cross-workspace events not leaked), 404 on missing workspace, multiple sequential events. All 19 packages pass. Build clean. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 05:16:51 +00:00
molecule-ai[bot]	b1c976a54d	fix(github): refresh installation token when TTL < 10 min (#547 ) (#567 ) Root cause: the github-app-auth plugin injects GH_TOKEN + GITHUB_TOKEN into each workspace container's env at provision time (EnvMutator). Those are GitHub App installation tokens with a fixed ~60 min TTL. The plugin has an in-process cache that proactively refreshes 5 min before expiry — but the workspace env is set once at container start and never updated. Any workspace alive >60 min ends up with an expired token. Fix (Option B — on-demand endpoint): pkg/provisionhook: - Add TokenProvider interface: Token(ctx) (token, expiresAt, error) Lives in pkg/ (public) so the github-app-auth plugin can implement it. - Add Registry.FirstTokenProvider() — discovers the first mutator that also satisfies TokenProvider via interface assertion. Safe under concurrent reads (existing RWMutex). platform/internal/handlers/github_token.go: - New GitHubTokenHandler serving GET /admin/github-installation-token - Delegates to the registered TokenProvider (plugin cache — always fresh) - 404 if no GitHub App configured, 500 + [github] prefix log on error - Never logs the token itself platform/internal/handlers/workspace.go: - Add TokenRegistry() getter so the router can wire the handler without coupling to WorkspaceHandler internals platform/internal/router/router.go: - Register GET /admin/github-installation-token under AdminAuth workspace-template/: - scripts/molecule-git-token-helper.sh — git credential helper; calls the platform endpoint on every push/fetch; falls through to next helper (operator PAT) if platform unreachable - entrypoint.sh — configure the credential helper at startup Why Option B over Option A (background goroutine): - The plugin already has its own cache refresh; nothing to refresh here. - Pushing env updates into running containers requires docker exec, which the architecture explicitly rejects (issue #547 "Alternatives"). - Pull-based is stateless, trivially testable, zero extra goroutines. Closes #547 Co-authored-by: Molecule AI DevOps Engineer <devops-engineer@agents.moleculesai.app> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 00:47:03 +00:00
molecule-ai[bot]	d08f237de9	fix(platform): reject self-delegation to prevent _run_lock deadlock (#570 ) When a workspace delegated a task to itself, it would acquire _run_lock twice on the same goroutine mutex, blocking permanently. Add an early-return guard in `DelegationHandler.Delegate` that returns HTTP 400 {"error": "self-delegation not permitted"} as soon as sourceID == body.TargetID, before any DB or A2A work is done. Adds TestDelegate_SelfDelegation_Rejected to delegation_test.go. Closes #548 Co-authored-by: Molecule AI Backend Engineer <backend-engineer@agents.moleculesai.app> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 00:46:20 +00:00
molecule-ai[bot]	a360b64157	fix(platform): persist secrets envelope from POST /workspaces payload (#568 ) `CreateWorkspacePayload` was missing a `Secrets` field, so any `secrets: { KEY: value }` included in a POST /workspaces body was silently dropped by ShouldBindJSON. Changes: - Add `Secrets map[string]string` field to `CreateWorkspacePayload` - Wrap workspace INSERT in a DB transaction; iterate over secrets, encrypt each value via `crypto.Encrypt`, and upsert into `workspace_secrets` within the same tx — rollback both on any failure - Add `mock.ExpectBegin()`/`mock.ExpectCommit()`/`mock.ExpectRollback()` to all existing Create tests that were missing transaction expectations - Add 3 new tests: WithSecrets_Persists, SecretPersistFails_RollsBack, EmptySecrets_OK Closes #545 Co-authored-by: Molecule AI Backend Engineer <backend-engineer@agents.moleculesai.app> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 00:46:17 +00:00
Hongming Wang	b6e039cb49	fix: code review findings — dead code, DRY, rate limit, docs 1. Delete fly_provisioner.go — superseded by control plane architecture. Direct Fly provisioning from tenant was intentionally removed. 2. Extract loadWorkspaceSecrets() — shared by Docker + CP provisioner paths. Eliminates 30-line secret-loading duplication. 3. Token rate limit — max 50 active tokens per workspace. Returns 429 if exceeded. Prevents unbounded token creation by compromised client. 4. CLAUDE.md — add GET/POST/DELETE /workspaces/:id/tokens to route table. 5. .env.example — document MOLECULE_ORG_ID and CP_PROVISION_URL. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 12:04:37 -07:00
Hongming Wang	1ea615df4c	feat(platform): auto-detect SaaS tenant → control plane provisioner No env vars to configure. The platform auto-detects the backend: MOLECULE_ORG_ID set → SaaS tenant → control plane provisioner MOLECULE_ORG_ID empty → self-hosted → Docker provisioner The control plane URL defaults to https://api.moleculesai.app (override with CP_PROVISION_URL for testing). No FLY_API_TOKEN on the tenant. Removed: direct Fly provisioner (FlyProvisioner) — all SaaS workspace provisioning goes through the control plane which holds the Fly token and manages billing, quotas, and cleanup. Two backends: CPProvisioner (SaaS) and Docker Provisioner (self-hosted). Closes #494 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 11:50:52 -07:00
Hongming Wang	1949846001	fix(auth): allow nesting + delete from tenant canvas (same-origin) PATCH /workspaces/:id field-level auth for parent_id/tier/runtime required a bearer token, blocking canvas nesting (drag-to-nest). Added IsSameOriginCanvas check so the tenant canvas can update sensitive fields without a bearer. Exported IsSameOriginCanvas from middleware package so workspace.go can call it for the field-level auth path. DELETE /workspaces/:id is behind AdminAuth which already has the same-origin check — if delete still fails, it's a different issue. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 11:22:45 -07:00
Hongming Wang	7160d1a1a8	feat(platform): Fly Machines provisioner for SaaS workspace deployment When CONTAINER_BACKEND=flyio, workspaces are provisioned as Fly Machines instead of local Docker containers. This enables workspace deployment on SaaS tenants where no Docker daemon is available. New files: - provisioner/fly_provisioner.go: FlyProvisioner with Start/Stop/ IsRunning/Restart/Close via Fly Machines API (api.machines.dev/v1) - FlyRuntimeImages maps runtimes to GHCR image tags Changes: - main.go: select Docker vs Fly based on CONTAINER_BACKEND env var - workspace.go: SetFlyProvisioner() setter, Create checks flyProv first - workspace_provision.go: provisionWorkspaceFly() loads secrets, calls FlyProvisioner.Start, issues auth token for the new machine Env vars for Fly backend: - CONTAINER_BACKEND=flyio (activates Fly provisioner) - FLY_API_TOKEN (Fly deploy token) - FLY_WORKSPACE_APP (Fly app name for workspace machines) - FLY_REGION (default: ord) Closes #494 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 10:51:15 -07:00
Hongming Wang	96b909b8f3	fix: code review findings — token UI, auth hardening, WS dedup 1. Settings panel: wire TokensTab into "API Tokens" tab (was imported but not rendered). Rename "API Keys" → "Secrets", add "API Tokens" tab. Fix docs link → doc.moleculesai.app/docs/tokens. 2. Referer match hardening: require exact host match or trailing slash to prevent evil.com subdomain bypass. Cache CANVAS_PROXY_URL at init time instead of per-request os.Getenv. 3. Extract shared deriveWsBaseUrl() to lib/ws-url.ts — eliminates duplicate 12-line derivation in socket.ts and TerminalTab.tsx. 4. Token list pagination: add ?limit= and ?offset= params (default 50, max 200) to GET /workspaces/:id/tokens. 507/507 canvas tests pass, Go build + vet clean. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 10:42:26 -07:00
Hongming Wang	de9f3d179c	feat(platform): token management API + MCP setup + external agent guide 1. Token Management API (closes production gap): - GET /workspaces/:id/tokens — list tokens (prefix + metadata, never plaintext) - POST /workspaces/:id/tokens — create new token (plaintext returned once) - DELETE /workspaces/:id/tokens/:tokenId — revoke specific token - Behind WorkspaceAuth middleware (need existing token to manage tokens) - Tests skip gracefully when no DB available 2. MCP Server Setup: - Fix .mcp.json to use npx @molecule-ai/mcp-server (was referencing non-existent local ./mcp-server/dist/index.js) - Add comprehensive tool→API mapping doc (87 tools across 15 categories) 3. External Agent Registration Guide: - Step-by-step: create workspace, register, heartbeat, A2A messaging - Python (Flask) and Node.js (Express) complete working examples - Communication rules, lifecycle, security, troubleshooting 4. Token Management Guide: - Bootstrap flow, rotation procedure, security properties Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 08:37:42 -07:00
rabbitblood	ff2394c085	fix(platform): unblock org-template imports against modular workspace templates Two adjacent fixes that surfaced trying to bring the molecule-dev org template back up against the new standalone workspace-template-* repos. 1) handlers/org.go — expand ${VAR} in workspace_dir before validation. The molecule-dev pm/workspace.yaml (and any operator's per-host binding) ships `workspace_dir: ${WORKSPACE_DIR}` so each operator can pick the host path PM bind-mounts. Without expansion the literal "${WORKSPACE_DIR}" string reaches validateWorkspaceDir and fails with "must be an absolute path", aborting the whole org import. Other fields (channel config, prompts) already go through expandWithEnv; workspace_dir was the last hold-out. 2) provisioner/provisioner.go — inject PYTHONPATH=/app for every workspace container. Standalone template Dockerfiles COPY adapter.py to /app and set ENV ADAPTER_MODULE=adapter, but molecule-runtime is a pip console_script entry point so cwd isn't on sys.path automatically. Setting PYTHONPATH here fixes every adapter image at once instead of needing 8 PRs against template repos. Operator override still wins (workspace EnvVars are appended after, so Docker takes the later duplicate). Note: this unblocks the import path but does NOT make claude-code / hermes / etc. boot. The runtime itself has a separate top-level `from adapters import` that breaks against modular templates — tracked at workspace-runtime#1. Tests: TestBuildContainerEnv_InjectsPYTHONPATH + TestBuildContainerEnv_WorkspaceEnvVarsCanOverridePYTHONPATH lock the default + operator-override invariants. expandWithEnv is already covered by TestExpandWithEnv_* — the workspace_dir use site is a one-line call to that primitive. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 07:49:45 -07:00
rabbitblood	e08f28c962	feat(platform): provision-time env mutator hook for plugins Add `provisionhook.EnvMutator` extension point so out-of-tree plugins (e.g. github-app-auth, vault-secrets) can inject or override env vars right before container Start, without forking core or piling more provider-specific code into the handlers package. WorkspaceHandler gains an optional `envMutators provisionhook.Registry` wired in via SetEnvMutators during boot. The hook fires after built-in secret loads + per-agent git identity, so plugins can both read what's already there and override anything they own (GIT_AUTHOR_, GITHUB_TOKEN). A nil registry is a no-op via Registry.Run's nil-receiver branch — keeps the hot path a single nil compare and means existing flows stay green even with zero plugins registered. Mutator failure aborts provisioning and marks the workspace failed with the wrapped error in last_sample_error. Failing fast surfaces the cause to the operator instead of letting an agent boot into opaque "git push 401" loops it can never recover from on its own. Tests cover ordered execution, chained env visibility, first-error abort, nil-receiver no-op, nil-mutator drop, registration order, and concurrent register-vs-run safety (-race clean). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 06:47:09 -07:00
Security Auditor	284fb26558	fix(security): YAML-quote skill/prompt names in generateDefaultConfig + opaque file-write errors Closes #460, #461. #460 — YAML injection via unquoted skill/prompt filenames `generateDefaultConfig` extracted skill directory names and prompt file names from user-supplied `body.Files` keys and wrote them directly into YAML list items without quoting: cfg.WriteString(" - " + s + "\n") `validateRelPath` only blocks path traversal (`../`); it does NOT block YAML control characters including newlines. On Linux, filenames can contain newlines, so an attacker with any live workspace bearer token could submit: {"files": {"skills/legit\nruntime: malicious/SKILL.md": "# skill"}} The generated config.yaml would then contain `runtime: malicious` as a top-level YAML key, overriding the runtime for workspaces provisioned from the template. Fix: extract `yamlEscape` as a reusable local from the same `strings.NewReplacer` already used for the `name` field (#221) and apply it to both the `skills:` and `prompt_files:` list items, wrapping each in double-quotes. #461 — Docker error details in ReplaceFiles 500 responses `ReplaceFiles` returned `fmt.Sprintf("failed to write files: %v", err)` in two 500 paths, where `err` comes from Docker API calls and may include internal container names, volume names, and daemon error messages. Fix: log the full error server-side and return a static opaque string to the caller. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 05:40:45 -07:00
Molecule AI Backend Engineer	eec59fe63b	fix(auth): inject fresh bearer token into config volume on every provision (closes #418 ) Container rebuild or volume wipe caused workspaces to lose /configs/.auth_token. On re-registration the platform returned no auth_token (HasAnyLiveToken==true → no re-issue), leaving the workspace unable to authenticate any subsequent API call. Fix: provisionWorkspaceOpts now calls issueAndInjectToken before Start(). This revokes any existing live tokens (plaintext is irrecoverable from the stored hash, so rotation is the only safe path) and issues a fresh token that is written into cfg.ConfigFiles[".auth_token"]. WriteFilesToContainer delivers it to /configs immediately after ContainerStart, racing safely ahead of the Python adapter's 1-2s startup time. Failure modes are soft: revoke or issue errors skip injection with a warning; provisioning continues and the workspace recovers on the next restart. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 05:26:10 -07:00

1 2 3

125 Commits