molecule-core

Author	SHA1	Message	Date
rabbitblood	762b38fa30	fix(org-import): limit concurrent Docker provisioning to 3 (#1084 ) The org import fired all workspace provisioning goroutines concurrently, overwhelming Docker when creating 39+ containers. Containers timed out, leaving workspaces stuck in 'provisioning' with no schedules or hooks. Fix: - Add provisionConcurrency=3 semaphore limiting concurrent Docker ops - Increase workspaceCreatePacingMs from 50ms to 2000ms between siblings - Pass semaphore through createWorkspaceTree recursion With 39 workspaces at 3 concurrent + 2s pacing, import takes ~30s instead of timing out. Each workspace gets its full template: schedules, hooks, settings, hierarchy. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-20 10:08:17 -07:00
Hongming Wang	2730a20194	Merge pull request #1067 from Molecule-AI/fix/tenant-workspace-auth fix(workspace-server): send X-Molecule-Admin-Token on CP calls	2026-04-20 08:39:49 -07:00
molecule-ai[bot]	bb53ff86b0	Merge pull request #1069 from Molecule-AI/fix/github-token-refresh-1068 fix: GitHub token refresh — WorkspaceAuth path for credential helper (#1068)	2026-04-20 08:37:46 -07:00
Hongming Wang	6c4d1ae4db	test(workspace-server): cover Stop/IsRunning/Close + auth-header + transport errors Closes review gap: pre-PR coverage on CPProvisioner was 37%. After this commit every exported method is exercised: - NewCPProvisioner 100% - authHeaders 100% - Start 91.7% (remainder: json.Marshal error path, unreachable with fixed-type request struct) - Stop 100% (new — header + path + error) - IsRunning 100% (new — 4-state matrix + auth) - Close 100% (new — contract no-op) New cases assert both auth headers (shared secret + admin_token) land on every outbound request, transport failures surface clear errors on Start/Stop, and IsRunning doesn't misreport on transport failure. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 08:37:39 -07:00
rabbitblood	b1bb5f838a	fix: GitHub token refresh — add WorkspaceAuth path for credential helper (#1068 ) PR #729 tightened AdminAuth to require ADMIN_TOKEN, breaking the workspace credential helper which called /admin/github-installation-token with a workspace bearer token. Tokens expired after 60 min with no refresh. Fix: Add /workspaces/:id/github-installation-token under WorkspaceAuth so any authenticated workspace can refresh its GitHub token. Keep the admin path as backward-compatible alias. Update molecule-git-token-helper.sh to use the workspace-scoped path when WORKSPACE_ID is set. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-20 08:30:02 -07:00
Hongming Wang	d3386ad620	fix(workspace-server): send X-Molecule-Admin-Token on CP calls controlplane #118 + #130 made /cp/workspaces/* require a per-tenant admin_token header in addition to the platform-wide shared secret. Without it, every workspace provision / deprovision / status call now 401s. ADMIN_TOKEN is already injected into the tenant container by the controlplane's Secrets Manager bootstrap, so this is purely a header-plumbing change — no new config required on the tenant side. ## Change - CPProvisioner carries adminToken alongside sharedSecret - New authHeaders method sets BOTH auth headers on every outbound request (old authHeader deleted — single call site was misleading once the semantics changed) - Empty values on either header are no-ops so self-hosted / dev deployments without a real CP still work ## Tests Renamed + expanded cp_provisioner_test cases: - TestAuthHeaders_NoopWhenBothEmpty — self-hosted path - TestAuthHeaders_SetsBothWhenBothProvided — prod happy path - TestAuthHeaders_OnlyAdminTokenWhenSecretEmpty — transition window Full workspace-server suite green. ## Rollout Next tenant provision will ship an image with this commit merged. Existing tenants (none in prod right now — hongming was the only one and was purged earlier today) will auto-update via the 5-min image-pull cron. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-20 08:17:50 -07:00
rabbitblood	16a245f96a	Merge branch 'staging' of https://github.com/Molecule-AI/molecule-core into staging	2026-04-20 01:15:39 -07:00
rabbitblood	4683e78ced	chore: gitignore org-templates/ and plugins/ entirely These directories are cloned from their standalone repos (molecule-ai-org-template-, molecule-ai-plugin-) and should never be committed to molecule-core directly. Removed the !/org-templates/molecule-dev/ exception that allowed PR #1056 to land template files in the wrong repo. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-20 01:10:16 -07:00
Hongming Wang	2ac2cea689	Merge pull request #1055 from Molecule-AI/feat/initial-memory-seeding-1050 feat: seed initial memories from org template config (#1050)	2026-04-20 01:03:00 -07:00
rabbitblood	ff7ac87b97	feat: seed initial memories from org template and create payload (#1050 ) Add MemorySeed model and initial_memories support at three levels: - POST /workspaces payload: seed memories on workspace creation - org.yaml workspace config: per-workspace initial_memories with defaults fallback - org.yaml global_memories: org-wide GLOBAL scope memories seeded on the first root workspace during import Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-20 00:35:49 -07:00
Hongming Wang	e345aa832a	Merge pull request #1033 from Molecule-AI/bugfixes/platform-handler-fixes fix: platform handler bug fixes (a2a proxy, secrets, terminal, webhooks)	2026-04-19 22:24:39 -07:00
Hongming Wang	05e2132d92	Merge pull request #1031 from Molecule-AI/fix/remove-baked-oauth-token-1028 fix: remove hardcoded CLAUDE_CODE_OAUTH_TOKEN from provisioner (#1028)	2026-04-19 22:24:36 -07:00
Hongming Wang	f124e2f404	Merge pull request #1030 from Molecule-AI/fix/1027-disable-schedules-on-workspace-delete fix: disable schedules on workspace delete (#1027)	2026-04-19 22:24:33 -07:00
Molecule AI Platform Engineer	32f23d26b0	fix: multiple platform handler bug fixes - secrets.go: Log RowsAffected errors instead of silently discarding them - a2a_proxy.go: Add 60s safety timeout to a2aClient HTTP client - terminal.go: Fix defer ordering - always close WebSocket conn on error, only defer resp.Close() after successful exec attach - webhooks.go: Add shortSHA() helper to safely handle empty HeadSHA Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-20 05:01:01 +00:00
rabbitblood	30fc869c13	test: add cascade schedule disable tests for #1027 - TestWorkspaceDelete_DisablesSchedules — leaf workspace delete disables its schedules - TestWorkspaceDelete_CascadeDisablesDescendantSchedules — parent+child+grandchild cascade - TestWorkspaceDelete_ScheduleDisableOnlyTargetsDeletedWorkspace — negative test Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-19 22:00:50 -07:00
rabbitblood	639b4dbb9f	fix: stop hardcoding CLAUDE_CODE_OAUTH_TOKEN in required_env (#1028 ) The provisioner was unconditionally writing CLAUDE_CODE_OAUTH_TOKEN into config.yaml's required_env for all claude-code workspaces. When the baked token expired, preflight rejected every workspace — even those with a valid token injected via the secrets API at runtime. Changes: - workspace_provision.go: remove hardcoded required_env for claude-code and codex runtimes; tokens are injected at container start via secrets - workspace_provision_test.go: flip assertion to reject hardcoded token Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-19 21:56:21 -07:00
rabbitblood	a139687071	fix: disable schedules when workspace is deleted (#1027 ) When a workspace is deleted (status set to 'removed'), its schedules remained enabled, causing the scheduler to keep firing cron jobs for non-existent containers. Add a cascade disable query alongside the existing token revocation and canvas layout cleanup. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-19 21:53:30 -07:00
Hongming Wang	19cae2986c	Merge pull request #1023 from Molecule-AI/feat/productivity-boost-event-crons-autopush feat: event-driven crons + auto-push hook for agent productivity	2026-04-19 20:34:06 -07:00
rabbitblood	46c20731e6	feat: event-driven cron triggers + auto-push hook for agent productivity Three changes to boost agent throughput: 1. Event-driven cron triggers (webhooks.go): GitHub issues/opened events fire all "pick-up-work" schedules immediately. PR review/submitted events fire "PR review" and "security review" schedules. Uses next_run_at=now() so the scheduler picks them up on next tick. 2. Auto-push hook (executor_helpers.py): After every task completion, agents automatically push unpushed commits and open a PR targeting staging. Guards: only on non-protected branches with unpushed work. Uses /usr/local/bin/git and /usr/local/bin/gh wrappers with baked-in GH_TOKEN. Never crashes the agent — all errors logged and continued. 3. Integration (claude_sdk_executor.py): auto_push_hook() called in the _execute_locked finally block after commit_memory. Closes productivity gap where agents wrote code but never pushed, and where work crons only fired on timers instead of reacting to events. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-19 20:26:35 -07:00
Hongming Wang	85f594d108	Merge pull request #1007 from Molecule-AI/fix/scheduler-defer-busy-969 fix(scheduler): defer cron fires when workspace busy instead of skipping (#969)	2026-04-19 20:21:16 -07:00
Hongming Wang	ccaa0a6b8a	Merge pull request #1012 from Molecule-AI/ci/codeql-workflow-covers-main ci(codeql): scan main + staging via workflow (UI can't multi-branch)	2026-04-19 14:37:41 -07:00
Hongming Wang	07ec90a23c	ci(codeql): cover main + staging via workflow GitHub's UI-configured "Code quality" scan only fires on the default branch (staging), which leaves every staging→main promotion PR unscanned. The "On push and pull requests to" field in the UI has no dropdown; multi-branch scanning on private repos without GHAS isn't available there. Workflow file gives us the control we can't get in the UI: triggers on push + pull_request for both branches. Runs on the same self-hosted mac mini via [self-hosted, macos, arm64]. upload: never — GHAS isn't enabled on this repo so the SARIF upload API 403s. Keep results locally, filter to error+warning severity, fail the PR check on findings, publish SARIF as a workflow artifact. Flipping upload: never → always after GHAS is enabled (if ever) is a one-line change. Picks up the review-flagged improvements from the earlier closed PR: - jq install step (brew, no assumption it's present) - severity filter (error+warning only, drops noisy note-level) - set -euo pipefail - SARIF glob (file name doesn't match matrix language id) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 14:34:04 -07:00
Hongming Wang	0bea001ef3	Merge pull request #1008 from Molecule-AI/fix/ci-canary-verify-self-hosted fix(ci): move canary-verify to self-hosted runner	2026-04-19 11:41:11 -07:00
Hongming Wang	53c55097f8	fix(ci): move canary-verify to self-hosted runner GitHub-hosted ubuntu-latest runs on this repo hit "recent account payments have failed or your spending limit needs to be increased" — same root cause as the publish + CodeQL + molecule-app workflow moves earlier this quarter. canary-verify was the last one still on ubuntu-latest. Switches both jobs to [self-hosted, macos, arm64]. crane install switched from Linux tarball to brew (matches promote-latest.yml's install pattern + avoids /usr/local/bin write perms on the shared mac mini). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 11:26:41 -07:00
rabbitblood	a3d30c1ece	fix(scheduler): defer cron fires when workspace busy instead of skipping (#969 ) Previously, the scheduler skipped cron fires entirely when a workspace had active_tasks > 0 (#115). This caused permanent cron misses for workspaces kept perpetually busy by the 5-min Orchestrator pulse — work crons (pick-up-work, PR review) were skipped every fire because the agent was always processing a delegation. Measured impact on Dev Lead: 17 context-deadline-exceeded timeouts in 2 hours, ~30% of inter-agent messages silently dropped. Fix: when workspace is busy, poll every 10s for up to 2 minutes waiting for idle. If idle within the window, fire normally. If still busy after 2 min, fall back to the original skip behavior. This is a minimal, safe change: - No new goroutines or channels - Same fire path once idle - Bounded wait (2 min max, won't block the scheduler pool) - Falls back to skip if workspace never becomes idle Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-19 08:38:14 -07:00
Hongming Wang	ec14cb2975	Merge pull request #1006 from Molecule-AI/feat/tos-gate-eu-notice feat(canvas): ToS gate modal + us-east-2 data residency notice	2026-04-19 07:54:15 -07:00
Hongming Wang	dd5654b803	feat(canvas): ToS gate modal + us-east-2 data residency notice Wraps /orgs in a TermsGate that polls /cp/auth/terms-status on mount and overlays a blocking modal when the current terms version hasn't been accepted yet. "I agree" POSTs /cp/auth/accept-terms and dismisses the modal; the backend records IP + UA as GDPR Art. 7 proof-of-consent. Also adds a short data residency notice under the page header: workspaces run in AWS us-east-2 (Ohio, US). An EU region selector is a future lift once the infra is provisioned there. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 07:44:47 -07:00
Hongming Wang	adf43c50d8	Merge pull request #1005 from Molecule-AI/feat/credits-phase-5-ui feat(canvas): Phase 5 — credit balance pill + low-balance banner	2026-04-19 07:32:44 -07:00
Hongming Wang	18894bebe8	feat(canvas): Phase 5 — credit balance pill + low-balance banner Adds the UI surface for the credit system to /orgs: - CreditsPill next to each org row. Tone shifts from zinc → amber at 10% of plan to red at zero. - LowCreditsBanner appears under the pill for running orgs when the balance crosses thresholds: overage_used > 0 → "overage active", balance <= 0 → "out of credits, upgrade", trial tail → "trial almost out". - Pure helpers extracted to lib/credits.ts so formatCredits, pillTone, and bannerKind are unit-tested without jsdom. Backend List query now returns credits_balance / plan_monthly_credits / overage_used_credits / overage_cap_credits so no second round-trip is needed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 07:27:29 -07:00
Hongming Wang	24e8c5affd	Merge pull request #1004 from Molecule-AI/staging promote: staging → main — brew cleanup fix	2026-04-19 05:56:18 -07:00
Hongming Wang	eeb19f2584	Merge pull request #1003 from Molecule-AI/ci/promote-latest-self-hosted ci(promote-latest): suppress brew cleanup perm-denied	2026-04-19 05:56:01 -07:00
Hongming Wang	7619e44802	ci(promote-latest): suppress brew cleanup that hits perm-denied on shared runner	2026-04-19 05:55:45 -07:00
Hongming Wang	57596a5b7a	Merge pull request #1002 from Molecule-AI/staging promote: staging → main — self-hosted promote-latest	2026-04-19 05:54:22 -07:00
Hongming Wang	37cc0a004c	Merge pull request #1001 from Molecule-AI/ci/promote-latest-self-hosted ci(promote-latest): run on self-hosted mac mini	2026-04-19 05:53:54 -07:00
Hongming Wang	fb2c126ed1	ci(promote-latest): run on self-hosted mac mini (GH-hosted quota blocked)	2026-04-19 05:53:39 -07:00
Hongming Wang	5de0110dd1	Merge pull request #1000 from Molecule-AI/staging promote: staging → main — promote-latest workflow + codeql self-hosted	2026-04-19 05:52:06 -07:00
Hongming Wang	756e57b788	Merge pull request #999 from Molecule-AI/ci/promote-latest-workflow ci(promote-latest): workflow_dispatch retag :staging-<sha> → :latest	2026-04-19 05:43:45 -07:00
Hongming Wang	5a67c6be4a	ci(promote-latest): workflow_dispatch to retag :staging-<sha> → :latest Escape hatch for the initial rollout window (canary fleet not yet provisioned, so canary-verify.yml's automatic promotion doesn't fire) AND for manual rollback scenarios. Uses the default GITHUB_TOKEN which carries write:packages on repo- owned GHCR images, so no new secrets are needed. crane handles the remote retag without pulling or pushing layers. Validates the src tag exists before retagging + verifies the :latest digest post-retag so a typo can't silently promote the wrong image. Trigger from Actions → promote-latest → Run workflow → enter the short sha (e.g. "4c1d56e"). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 05:42:48 -07:00
Hongming Wang	e9be46ab0b	Merge pull request #997 from Molecule-AI/staging promote: staging → main — unblock publish workflow (private-repo plugin clone)	2026-04-19 05:34:39 -07:00
Hongming Wang	9ecadaf573	Merge pull request #996 from Molecule-AI/fix/publish-clone-plugin-sibling fix(ci): clone sibling plugin repo so publish-workspace-server-image builds	2026-04-19 05:32:01 -07:00
Hongming Wang	ac85ee2a0d	fix(ci): clone sibling plugin repo so publish-workspace-server-image builds Publish has been failing since the 2026-04-18 open-source restructure (#964's merge) because workspace-server/Dockerfile still COPYs ./molecule-ai-plugin-github-app-auth/ but the restructure moved that code out to its own repo. Every main merge since has produced a "failed to compute cache key: /molecule-ai-plugin-github-app-auth: not found" error — prod images haven't moved. Fix: add an actions/checkout step that fetches the plugin repo into the build context before docker build runs. Private-repo safe: uses PLUGIN_REPO_PAT secret (fine-grained PAT with Contents:Read on Molecule-AI/molecule-ai-plugin-github-app-auth). Falls back to the default GITHUB_TOKEN if the plugin repo is public. Ops: set repo secret PLUGIN_REPO_PAT before the next main merge, or publish will fail with a 404 on the checkout step. Also gitignores the cloned dir so local dev builds don't accidentally commit it. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 05:19:31 -07:00
Hongming Wang	62b409423d	Merge pull request #995 from Molecule-AI/staging promote: staging → main — #994 post-checkout UX	2026-04-19 04:35:34 -07:00
Hongming Wang	ede6597cc0	Merge pull request #994 from Molecule-AI/feat/canvas-post-checkout-redirect feat(canvas): post-checkout UX — Stripe success lands on /orgs with live banner	2026-04-19 04:32:02 -07:00
Hongming Wang	3021785391	Merge pull request #993 from Molecule-AI/staging promote: staging → main — canary infra + /orgs + env refresh + perf	2026-04-19 04:26:13 -07:00
Hongming Wang	26b89400ed	test(canvas): bump billing test for /orgs success_url	2026-04-19 04:26:01 -07:00
Hongming Wang	d77378294b	feat(canvas): post-checkout UX — Stripe success lands on /orgs with banner Two small polish items that together close the signup-to-running-tenant flow for real users: 1. Stripe success_url now points at /orgs?checkout=success instead of the current page (was pricing). The old behavior left people staring at plan cards with no indication payment went through — the new behavior drops them right onto their org list where they can watch the status flip. 2. /orgs shows a green "Payment confirmed, workspace spinning up" banner when it sees ?checkout=success, then clears the query param via replaceState so a reload doesn't show it again. 3. /orgs now polls every 5s while any org is awaiting_payment or provisioning. Users see the Stripe webhook's effect live — no manual refresh needed — and once every org settles the polling stops so idle tabs don't hammer /cp/orgs. Paired with PR #992 (the /orgs page itself) this makes the end-to-end flow on BILLING_REQUIRED=true deployments feel right: /pricing → Stripe → /orgs?checkout=success → banner → live poll → "Open" button when org.status transitions to running. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 04:18:32 -07:00
Hongming Wang	08e37f3c87	Merge pull request #992 from Molecule-AI/feat/canvas-orgs-landing feat(canvas): /orgs landing page for post-signup users	2026-04-19 04:15:50 -07:00
Hongming Wang	b29ffb9546	feat(canvas): /orgs landing page for post-signup users CP's Callback handler redirects every new WorkOS session to APP_URL/orgs, but canvas had no such route — new users hit the canvas Home component, which tries to call /workspaces on a tenant that doesn't exist yet, and saw a confusing error. This PR plugs that gap with a dedicated landing page that: - Bounces anonymous visitors back to /cp/auth/login - Zero-org users see a slug-picker (POST /cp/orgs, refresh) - For each existing org, shows status + CTA: * awaiting_payment → amber "Complete payment" → /pricing?org=… * running → emerald "Open" → https://<slug>.moleculesai.app * failed → "Contact support" → mailto * provisioning → read-only "provisioning…" - Surfaces errors inline with a Retry button Deliberately server-light: one GET /cp/orgs, no WebSocket, no canvas store hydration. Goal is to move the user from signup to either Stripe Checkout or their tenant URL with one click each. Closes the last UX gap between the BILLING_REQUIRED gate landing on the CP and real users being able to complete a signup today. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-19 04:13:54 -07:00
Hongming Wang	393ecc74e3	Merge pull request #991 from Molecule-AI/perf/scheduler-returning-clause perf(scheduler): collapse empty-run bump to single RETURNING query	2026-04-19 03:48:42 -07:00
Hongming Wang	69d5af6636	Merge pull request #990 from Molecule-AI/fix/cp-provisioner-tests test(ws-server): CPProvisioner coverage — auth, env fallback, error paths	2026-04-19 03:48:40 -07:00

1 2 3 4 5 ...

1067 Commits