molecule-core

Author	SHA1	Message	Date
Molecule AI SDK-Dev	0506e0cabc	Merge main into staging - resolving 1,388 commit divergence for PR #1573 Main→staging sync: bring staging up to date with main. All conflicts resolved to main's version (newer state).	2026-04-22 13:54:53 +00:00
rabbitblood	6e6de392d9	chore: remove org-templates/molecule-dev from git tracking This directory belongs in the dedicated repo Molecule-AI/molecule-ai-org-template-molecule-dev. It should be cloned locally for platform mounting, never committed to molecule-core. The .gitignore already blocks it. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-20 11:47:13 -07:00
rabbitblood	8da2275c14	feat(template): restructure molecule-dev org template to 39-agent hierarchy Comprehensive rewrite of the Molecule AI dev team org template: - Rename agents to {team}-{role} convention (e.g., core-be, cp-lead, app-qa) - Add 5 new team leads: Core Platform Lead, Controlplane Lead, App & Docs Lead, Infra Lead, SDK Lead - Add new roles: Release Manager, Integration Tester, Technical Writer, Infra-SRE, Infra-Runtime-BE, SDK-Dev, Plugin-Dev - Delete triage-operator and triage-operator-2 (leads own triage now) - Set default model to MiniMax-M2.7, tier 3, idle_interval_seconds 900 - Update org.yaml category_routing to new agent names - Add orchestrator-pulse schedules for all leads (/5 cron) - Add pick-up-work schedules for engineers (/15 cron) - Add qa-review schedules for QA agents (/15 cron) - Add security-scan schedules for security agents (/30 cron) - Add release-cycle and e2e-test schedules for Release Manager and Integration Tester - Update marketing agents with web search MCP and media generation capabilities - All schedule prompts reference Molecule-AI/internal for PLAN.md and known-issues.md - Un-ignore org-templates/molecule-dev/ in .gitignore for version tracking Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-20 00:43:15 -07:00
rabbitblood	15ac834239	feat(template): expand dev team to 30+ roles with multi-repo coverage - org.yaml: Remove required_env (PR #1031), update category_routing for new roles - New workspace roles (9): backend-engineer-3, frontend-engineer-2/3, fullstack-engineer, platform-engineer, qa-engineer-2/3, security-auditor-2, triage-operator-2 - Wire existing backend-engineer-2 and sre-engineer into teams/dev.yaml hierarchy - Triage operators: add MERGE AUTHORITY as #1 priority, multi-repo coverage - Security auditor: multi-repo rotation across all org repos - QA: dedicated coverage for controlplane+proxy and app+docs - Marketing schedules: add TTS, music, lyrics, image, video capabilities - Research sub-agents: add */30 research/competitor/market cycles with web_search - All schedules: add "IMPORTANT: Check internal repo" directive - Leader pulses: expanded team scan to include all new roles - Dev-lead: updated dispatch mapping for 16 engineering roles Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-20 00:29:56 -07:00
Hongming Wang	479a027e4b	chore: open-source restructure — rename dirs, remove internal files, scrub secrets Renames: - platform/ → workspace-server/ (Go module path stays as "platform" for external dep compat — will update after plugin module republish) - workspace-template/ → workspace/ Removed (moved to separate repos or deleted): - PLAN.md — internal roadmap (move to private project board) - HANDOFF.md, AGENTS.md — one-time internal session docs - .claude/ — gitignored entirely (local agent config) - infra/cloudflare-worker/ → Molecule-AI/molecule-tenant-proxy - org-templates/molecule-dev/ → standalone template repo - .mcp-eval/ → molecule-mcp-server repo - test-results/ — ephemeral, gitignored Security scrubbing: - Cloudflare account/zone/KV IDs → placeholders - Real EC2 IPs → <EC2_IP> in all docs - CF token prefix, Neon project ID, Fly app names → redacted - Langfuse dev credentials → parameterized - Personal runner username/machine name → generic Community files: - CONTRIBUTING.md — build, test, branch conventions - CODE_OF_CONDUCT.md — Contributor Covenant 2.1 All Dockerfiles, CI workflows, docker-compose, railway.toml, render.yaml, README, CLAUDE.md updated for new directory names. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-18 00:24:44 -07:00
Hongming Wang	d8026347e5	chore: open-source restructure — rename dirs, remove internal files, scrub secrets Renames: - platform/ → workspace-server/ (Go module path stays as "platform" for external dep compat — will update after plugin module republish) - workspace-template/ → workspace/ Removed (moved to separate repos or deleted): - PLAN.md — internal roadmap (move to private project board) - HANDOFF.md, AGENTS.md — one-time internal session docs - .claude/ — gitignored entirely (local agent config) - infra/cloudflare-worker/ → Molecule-AI/molecule-tenant-proxy - org-templates/molecule-dev/ → standalone template repo - .mcp-eval/ → molecule-mcp-server repo - test-results/ — ephemeral, gitignored Security scrubbing: - Cloudflare account/zone/KV IDs → placeholders - Real EC2 IPs → <EC2_IP> in all docs - CF token prefix, Neon project ID, Fly app names → redacted - Langfuse dev credentials → parameterized - Personal runner username/machine name → generic Community files: - CONTRIBUTING.md — build, test, branch conventions - CODE_OF_CONDUCT.md — Contributor Covenant 2.1 All Dockerfiles, CI workflows, docker-compose, railway.toml, render.yaml, README, CLAUDE.md updated for new directory names. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-18 00:24:44 -07:00
molecule-ai[bot]	9542348ebf	fix(opencode): add full MCP path to opencode.json URL Security Auditor FINDING-1: bare ${MOLECULE_MCP_URL} missing the router path. Fix adds /workspaces/${WORKSPACE_ID}/mcp so opencode reaches MCPHandler. Unblocks PR#842 merge.	2026-04-17 22:06:05 +00:00
molecule-ai[bot]	bf80f15619	fix(opencode): add full MCP path to opencode.json URL Security Auditor FINDING-1: bare ${MOLECULE_MCP_URL} missing the router path. Fix adds /workspaces/${WORKSPACE_ID}/mcp so opencode reaches MCPHandler. Unblocks PR#842 merge.	2026-04-17 22:06:05 +00:00
molecule-ai[bot]	abcc31f5b1	feat(opencode): add org-template opencode.json with header-based MCP auth (closes #813 )	2026-04-17 19:26:10 +00:00
molecule-ai[bot]	745a256b53	feat(opencode): add org-template opencode.json with header-based MCP auth (closes #813 )	2026-04-17 19:26:10 +00:00
molecule-ai[bot]	97978a911a	docs: reference AGENTS.md auto-generation in system prompt template (fixes #781 ) Add org-templates/molecule-dev/system-prompt.md as a canonical org-level shared-context template for all molecule-dev org agents. The Communication section explains that /workspace/AGENTS.md is auto-generated at startup from config.yaml (via agents_md.py / PR #763), describes the AAIF format it follows, explains the GET /workspace/AGENTS.md peer-discovery contract, and tells agents to keep their config.yaml name/role/description accurate as the sole source of truth. Also restructure the /org-templates/ gitignore rule from a hard directory-ignore to a content-glob pattern so this specific reference template can be tracked while all other cloned standalone-repo content remains ignored. Co-authored-by: Molecule AI Documentation Specialist <documentation-specialist@agents.moleculesai.app> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 17:16:50 +00:00
molecule-ai[bot]	c50c1ec70c	docs: reference AGENTS.md auto-generation in system prompt template (fixes #781 ) Add org-templates/molecule-dev/system-prompt.md as a canonical org-level shared-context template for all molecule-dev org agents. The Communication section explains that /workspace/AGENTS.md is auto-generated at startup from config.yaml (via agents_md.py / PR #763), describes the AAIF format it follows, explains the GET /workspace/AGENTS.md peer-discovery contract, and tells agents to keep their config.yaml name/role/description accurate as the sole source of truth. Also restructure the /org-templates/ gitignore rule from a hard directory-ignore to a content-glob pattern so this specific reference template can be tracked while all other cloned standalone-repo content remains ignored. Co-authored-by: Molecule AI Documentation Specialist <documentation-specialist@agents.moleculesai.app> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-17 17:16:50 +00:00
Hongming Wang	8e304e69e8	chore: remove extracted directories, add manifest-driven Docker builds Remove plugins/, workspace-configs-templates/, org-templates/ dirs (now in standalone repos). Add manifest.json listing all 33 repos and scripts/clone-manifest.sh to clone them. Both Dockerfiles now use the manifest script instead of 33 hardcoded git-clone lines. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 04:13:29 -07:00
Hongming Wang	d424bd947f	chore: remove extracted directories, add manifest-driven Docker builds Remove plugins/, workspace-configs-templates/, org-templates/ dirs (now in standalone repos). Add manifest.json listing all 33 repos and scripts/clone-manifest.sh to clone them. Both Dockerfiles now use the manifest script instead of 33 hardcoded git-clone lines. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 04:13:29 -07:00
rabbitblood	40a69d6f87	feat(org-templates): Phase 4 — atomize each role to <role>/workspace.yaml Part 4 of 4 — terminal step of the org.yaml scalability refactor. Each role in the molecule-dev template now owns its own workspace.yaml file, colocated with the existing system-prompt.md / initial-prompt.md / idle-prompt.md / schedules/.md. Team files shrink to a leader's own definition plus a list of !include refs. ## Platform change `resolveYAMLIncludes` now uses a TWO-ROOT model: - Path resolution is relative to the INCLUDING file's directory (natural sibling + cousin refs, C-include / Sass @import convention). - Security bound is the ORIGINAL org root (`rootDir`), preserved across all recursion depths. Sibling-dir refs like `../my-role/workspace.yaml` from a team file are now allowed (they stay inside the org template); refs that escape the root still error. Regression coverage: new `TestResolveYAMLIncludes_SiblingDirAccess` reproduces the Phase 4 pattern (team file at `teams/x.yaml` referencing `../<role>/workspace.yaml`) — fails without the fix, passes with. ## Template change Atomized 15 child workspaces across 3 team files: - `teams/research.yaml`: 58 → 30 lines; 3 children now !include refs - `teams/dev.yaml`: 222 → 38 lines; 6 children now !include refs - `teams/marketing.yaml`: 143 → 28 lines; 6 children now !include refs Each role now has `<role>/workspace.yaml` colocated with its prompts. Example `frontend-engineer/` directory: frontend-engineer/ ├── workspace.yaml (24 lines — name/role/tier/canvas/plugins/...) ├── system-prompt.md (from earlier phases) ├── initial-prompt.md ├── idle-prompt.md └── (no schedules for this role — but if added, schedules/<slug>.md) ## File-size progression across all 4 phases \| State \| org.yaml \| total `.yaml` in tree \| \|---\|---:\|---:\| \| Before (main) \| 1801 lines / 108 KB \| 1801 / 108 KB (one file) \| \| After Phase 1 (#389) \| 1687 \| 1687 / 101 KB \| \| After Phase 2 (#390) \| 676 \| 676 / 35 KB \| \| After Phase 3 (#393) \| 114 \| 683 (1 + 6 teams) / 33 KB \| \| After this PR* \| 114 \| ~698 (1 + 6 + 15 workspace) / 35 KB \| Aggregate size is flat — the decrease came from prompt externalization in Phases 1/2; Phases 3/4 reorganize structure without adding content. The win is readability and ownership: - Every individual file fits on 1-2 screens. - Adding a new role is now: create `<role>/` dir, add `workspace.yaml` + `system-prompt.md` + prompts, add ONE `!include` line to the team file. No touching of aggregated mega-YAML. - Team files can be reviewed + merged independently. ## Tests All 10 `TestResolveYAMLIncludes_*` tests pass, including the real-template integration test (`TestResolveYAMLIncludes_RealMoleculeDev`) which now walks org.yaml → teams/pm.yaml → teams/research.yaml → ../market-analyst/ workspace.yaml and validates the full 21-role tree unmarshals cleanly. Plus all existing `TestResolvePromptRef` + `TestOrgYAML` + `TestInitialPrompt` suites stay green. ## Ops followup After merging all 4 phases and deploying, the `POST /org/import` endpoint should produce a workspace tree byte-identical to the pre-refactor state. Verify with: diff <(curl POST /org/import before) <(curl POST /org/import after) or by spot-checking: - `/configs/config.yaml` bodies across all 21 workspaces - `workspace_schedules.prompt` row values The externalization is lossless — YAML literal to file and back recovers the same string modulo trailing-whitespace normalization. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 03:09:56 -07:00
rabbitblood	665f7f6313	feat(org-templates): Phase 4 — atomize each role to <role>/workspace.yaml Part 4 of 4 — terminal step of the org.yaml scalability refactor. Each role in the molecule-dev template now owns its own workspace.yaml file, colocated with the existing system-prompt.md / initial-prompt.md / idle-prompt.md / schedules/.md. Team files shrink to a leader's own definition plus a list of !include refs. ## Platform change `resolveYAMLIncludes` now uses a TWO-ROOT model: - Path resolution is relative to the INCLUDING file's directory (natural sibling + cousin refs, C-include / Sass @import convention). - Security bound is the ORIGINAL org root (`rootDir`), preserved across all recursion depths. Sibling-dir refs like `../my-role/workspace.yaml` from a team file are now allowed (they stay inside the org template); refs that escape the root still error. Regression coverage: new `TestResolveYAMLIncludes_SiblingDirAccess` reproduces the Phase 4 pattern (team file at `teams/x.yaml` referencing `../<role>/workspace.yaml`) — fails without the fix, passes with. ## Template change Atomized 15 child workspaces across 3 team files: - `teams/research.yaml`: 58 → 30 lines; 3 children now !include refs - `teams/dev.yaml`: 222 → 38 lines; 6 children now !include refs - `teams/marketing.yaml`: 143 → 28 lines; 6 children now !include refs Each role now has `<role>/workspace.yaml` colocated with its prompts. Example `frontend-engineer/` directory: frontend-engineer/ ├── workspace.yaml (24 lines — name/role/tier/canvas/plugins/...) ├── system-prompt.md (from earlier phases) ├── initial-prompt.md ├── idle-prompt.md └── (no schedules for this role — but if added, schedules/<slug>.md) ## File-size progression across all 4 phases \| State \| org.yaml \| total `.yaml` in tree \| \|---\|---:\|---:\| \| Before (main) \| 1801 lines / 108 KB \| 1801 / 108 KB (one file) \| \| After Phase 1 (#389) \| 1687 \| 1687 / 101 KB \| \| After Phase 2 (#390) \| 676 \| 676 / 35 KB \| \| After Phase 3 (#393) \| 114 \| 683 (1 + 6 teams) / 33 KB \| \| After this PR* \| 114 \| ~698 (1 + 6 + 15 workspace) / 35 KB \| Aggregate size is flat — the decrease came from prompt externalization in Phases 1/2; Phases 3/4 reorganize structure without adding content. The win is readability and ownership: - Every individual file fits on 1-2 screens. - Adding a new role is now: create `<role>/` dir, add `workspace.yaml` + `system-prompt.md` + prompts, add ONE `!include` line to the team file. No touching of aggregated mega-YAML. - Team files can be reviewed + merged independently. ## Tests All 10 `TestResolveYAMLIncludes_*` tests pass, including the real-template integration test (`TestResolveYAMLIncludes_RealMoleculeDev`) which now walks org.yaml → teams/pm.yaml → teams/research.yaml → ../market-analyst/ workspace.yaml and validates the full 21-role tree unmarshals cleanly. Plus all existing `TestResolvePromptRef` + `TestOrgYAML` + `TestInitialPrompt` suites stay green. ## Ops followup After merging all 4 phases and deploying, the `POST /org/import` endpoint should produce a workspace tree byte-identical to the pre-refactor state. Verify with: diff <(curl POST /org/import before) <(curl POST /org/import after) or by spot-checking: - `/configs/config.yaml` bodies across all 21 workspaces - `workspace_schedules.prompt` row values The externalization is lossless — YAML literal to file and back recovers the same string modulo trailing-whitespace normalization. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 03:09:56 -07:00
rabbitblood	112c28d885	feat(org-templates): Phase 3 — !include directive + split org.yaml into team files Part 3 of 4 in the scalability refactor. Adds YAML `!include` support to the org importer and splits molecule-dev/org.yaml (676 lines post- Phase 2) into 6 team / role files; top-level org.yaml drops to 114 lines of pure scaffolding. ## Platform changes New `platform/internal/handlers/org_include.go`: - `resolveYAMLIncludes(data, baseDir)` — pre-processes a YAML document, expanding any scalar tagged `!include <path>` with the parsed content of the referenced file. - Path resolution via `resolveInsideRoot` so a crafted `!include ../../etc/passwd` can't escape the org template directory (same defense the existing `files_dir` copy uses). - Nested includes supported: each included file carries its own search root (its directory), so `teams/pm.yaml` with `!include research.yaml` resolves to `teams/research.yaml` — matching the convention of C-include / Sass @import / most package systems. - Cycle detection via visited-set keyed on absolute path; belt-and- braces `maxIncludeDepth = 16` cap in case symlinks or path normalization defeats the set. - Inline-template mode (POST /org/import with raw JSON body, no `dir`) errors cleanly when a file ref is used — can't resolve without a base. Wired into both `ListTemplates` (so /org/templates shows an accurate workspace count after the split) and `Import` (expansion happens before unmarshal into OrgTemplate). ## Template changes molecule-dev/org.yaml now contains only: - name + description - defaults (runtime, plugins, category_routing, initial_prompt text) - `workspaces: [!include teams/pm.yaml, !include teams/marketing.yaml]` New files: - `teams/pm.yaml` — PM top-level, children are !include refs - `teams/research.yaml` — Research Lead + Market Analyst + Technical Researcher + Competitive Intelligence (inline children) - `teams/dev.yaml` — Dev Lead + FE/BE/DevOps/Security/QA/UIUX (inline) - `teams/marketing.yaml` — Marketing Lead + DevRel/PMM/Content/ Community/SEO/Social (inline) - `teams/documentation-specialist.yaml` — leaf - `teams/triage-operator.yaml` — leaf ## File-size impact \| State \| org.yaml lines \| total config size \| \|---\|---:\|---:\| \| Before (main) \| 1801 \| 108 KB \| \| After Phase 1 (#389) \| 1687 \| 101 KB \| \| After Phase 2 (#390) \| 676 \| 35 KB \| \| After this PR \| 114 \| 4 KB (org.yaml only) \| With the 6 team files (total ~570 lines of structural yaml), every file is now under 230 lines and individually readable without scrolling past a single team's boundaries. ## Tests `platform/internal/handlers/org_include_test.go` — 9 cases: - Flat include (single file, single workspace) - Nested include (file → file → file) - Traversal rejection (`../secret.yaml`, `../../secret.yaml`) - Cycle detection (a↔b) - Empty path error - Missing file error - Inline-template error (baseDir empty) - No-op when YAML has no includes (safety: we always run the preprocessor) - Integration: load the real `org-templates/molecule-dev/org.yaml`, resolve includes, unmarshal into OrgTemplate, verify PM + Marketing Lead are top-level and PM has ≥4 children after expansion. All 9 pass + existing `TestResolvePromptRef` + `TestOrgYAML` suites stay green. ## Ownership implication Each team file can now be owned + reviewed independently. When the marketing team adds a 7th role, the diff is in `teams/marketing.yaml` alone — no merge conflicts against PM or research changes in the same review window. Same for the eventual engineer team, security team, etc. ## What's next - Phase 4 (queued): per-workspace atomization. Each role gets `<role>/workspace.yaml`; team files shrink to a list of !include refs. Terminal step in the scalability arc — at that point adding a new role is one new file under `org-templates/molecule-dev/<role>/` plus one line in the team's manifest. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 07:49:56 +00:00
rabbitblood	24a882ccc9	feat(org-templates): Phase 3 — !include directive + split org.yaml into team files Part 3 of 4 in the scalability refactor. Adds YAML `!include` support to the org importer and splits molecule-dev/org.yaml (676 lines post- Phase 2) into 6 team / role files; top-level org.yaml drops to 114 lines of pure scaffolding. ## Platform changes New `platform/internal/handlers/org_include.go`: - `resolveYAMLIncludes(data, baseDir)` — pre-processes a YAML document, expanding any scalar tagged `!include <path>` with the parsed content of the referenced file. - Path resolution via `resolveInsideRoot` so a crafted `!include ../../etc/passwd` can't escape the org template directory (same defense the existing `files_dir` copy uses). - Nested includes supported: each included file carries its own search root (its directory), so `teams/pm.yaml` with `!include research.yaml` resolves to `teams/research.yaml` — matching the convention of C-include / Sass @import / most package systems. - Cycle detection via visited-set keyed on absolute path; belt-and- braces `maxIncludeDepth = 16` cap in case symlinks or path normalization defeats the set. - Inline-template mode (POST /org/import with raw JSON body, no `dir`) errors cleanly when a file ref is used — can't resolve without a base. Wired into both `ListTemplates` (so /org/templates shows an accurate workspace count after the split) and `Import` (expansion happens before unmarshal into OrgTemplate). ## Template changes molecule-dev/org.yaml now contains only: - name + description - defaults (runtime, plugins, category_routing, initial_prompt text) - `workspaces: [!include teams/pm.yaml, !include teams/marketing.yaml]` New files: - `teams/pm.yaml` — PM top-level, children are !include refs - `teams/research.yaml` — Research Lead + Market Analyst + Technical Researcher + Competitive Intelligence (inline children) - `teams/dev.yaml` — Dev Lead + FE/BE/DevOps/Security/QA/UIUX (inline) - `teams/marketing.yaml` — Marketing Lead + DevRel/PMM/Content/ Community/SEO/Social (inline) - `teams/documentation-specialist.yaml` — leaf - `teams/triage-operator.yaml` — leaf ## File-size impact \| State \| org.yaml lines \| total config size \| \|---\|---:\|---:\| \| Before (main) \| 1801 \| 108 KB \| \| After Phase 1 (#389) \| 1687 \| 101 KB \| \| After Phase 2 (#390) \| 676 \| 35 KB \| \| After this PR \| 114 \| 4 KB (org.yaml only) \| With the 6 team files (total ~570 lines of structural yaml), every file is now under 230 lines and individually readable without scrolling past a single team's boundaries. ## Tests `platform/internal/handlers/org_include_test.go` — 9 cases: - Flat include (single file, single workspace) - Nested include (file → file → file) - Traversal rejection (`../secret.yaml`, `../../secret.yaml`) - Cycle detection (a↔b) - Empty path error - Missing file error - Inline-template error (baseDir empty) - No-op when YAML has no includes (safety: we always run the preprocessor) - Integration: load the real `org-templates/molecule-dev/org.yaml`, resolve includes, unmarshal into OrgTemplate, verify PM + Marketing Lead are top-level and PM has ≥4 children after expansion. All 9 pass + existing `TestResolvePromptRef` + `TestOrgYAML` suites stay green. ## Ownership implication Each team file can now be owned + reviewed independently. When the marketing team adds a 7th role, the diff is in `teams/marketing.yaml` alone — no merge conflicts against PM or research changes in the same review window. Same for the eventual engineer team, security team, etc. ## What's next - Phase 4 (queued): per-workspace atomization. Each role gets `<role>/workspace.yaml`; team files shrink to a list of !include refs. Terminal step in the scalability arc — at that point adding a new role is one new file under `org-templates/molecule-dev/<role>/` plus one line in the team's manifest. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 07:49:56 +00:00
Hongming Wang	159197ed4a	feat(org-templates): Phase 2 — bulk migrate 20 roles to file-ref prompts (#395 ) Part 2 of 4 in the org.yaml scalability refactor. Follows PR #389 which added platform support; this PR completes the migration for every role in the `molecule-dev` template. ## Scope All 20 remaining roles moved from inline YAML literals to sibling .md files under their existing `files_dir`: - PM, Research Lead, Dev Lead, Marketing Lead (4 leaders) - Market Analyst, Technical Researcher, Competitive Intelligence (research) - Frontend/Backend/DevOps Engineer, Security Auditor, QA Engineer, UIUX Designer, Triage Operator (dev team) - DevRel, PMM, Content Marketer, Community Manager, SEO Growth Analyst, Social Media Brand (marketing team) Per workspace, externalized (where present): - `initial_prompt: \|...` → `initial-prompt.md` + `initial_prompt_file:` - `idle_prompt: \|...` → `idle-prompt.md` + `idle_prompt_file:` - `schedules[].prompt: \|...` → `schedules/<slug>.md` + `prompt_file:` Totals: 17 initial-prompt files, 12 idle-prompt files, 18 schedule files (47 new files). ## File-size impact \| Before (main) \| After Phase 1 \| After Phase 2 \| Reduction \| \|---\|---\|---\|---\| \| 1801 lines \| 1687 lines \| 676 lines \| -62.5%* \| \| 108 KB \| 101 KB \| 35 KB \| -67% \| org.yaml is now pure structural scaffolding (name / role / tier / model / canvas / plugins / channels / children / category_routing / schedules metadata). Readable end-to-end on one screen per team. ## How the migration was driven A Python round-trip script (using `ruamel.yaml` to preserve comments + formatting) walked the workspace tree recursively, wrote prompts to files keyed by `files_dir`, and replaced inline keys with `_file:` refs. Zero manual YAML hand-editing beyond the Phase 1 Documentation Specialist proof. Script is one-shot; not committed. Slug convention for schedule files: lowercase the schedule name, replace non-alphanumeric with `-`, collapse, cap 60 chars. Examples: - "Orchestrator pulse" → `orchestrator-pulse.md` - "Hourly template fitness audit" → `hourly-template-fitness-audit.md` - "Code quality audit (every 12h)" → `code-quality-audit-every-12h.md` ## Backwards compatibility Fully compatible — Phase 1's resolver prefers inline when both are set, so a future one-off experiment can still drop inline YAML. The migration doesn't remove inline support, just stops using it. ## Verification - [x] `python -c "yaml.safe_load(...)"` on edited org.yaml — parses clean - [x] Walk-and-inspect script: every workspace has exactly the expected `_file:` refs, zero `INLINE_` markers remain - [x] All 47 extracted .md files non-empty + trimmed - [x] `go test -run 'TestResolvePromptRef\|TestOrgYAML\|TestInitialPrompt'` passes (from Phase 1 platform work) - [ ] Post-merge: live `POST /org/import` against a fresh workspace, diff the resulting `/configs/config.yaml` + `workspace_schedules` rows against the pre-migration values (should be identical bodies) ## What's next - Phase 3 (queued):* YAML `!include` directive for org.yaml; split the remaining 676 lines into `teams/{research,dev,marketing,ops}.yaml`. - Phase 4 (queued): per-workspace atomization; each role owns its own `workspace.yaml` manifest. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 00:47:32 -07:00
Hongming Wang	f075c49af1	feat(org-templates): Phase 2 — bulk migrate 20 roles to file-ref prompts (#395 ) Part 2 of 4 in the org.yaml scalability refactor. Follows PR #389 which added platform support; this PR completes the migration for every role in the `molecule-dev` template. ## Scope All 20 remaining roles moved from inline YAML literals to sibling .md files under their existing `files_dir`: - PM, Research Lead, Dev Lead, Marketing Lead (4 leaders) - Market Analyst, Technical Researcher, Competitive Intelligence (research) - Frontend/Backend/DevOps Engineer, Security Auditor, QA Engineer, UIUX Designer, Triage Operator (dev team) - DevRel, PMM, Content Marketer, Community Manager, SEO Growth Analyst, Social Media Brand (marketing team) Per workspace, externalized (where present): - `initial_prompt: \|...` → `initial-prompt.md` + `initial_prompt_file:` - `idle_prompt: \|...` → `idle-prompt.md` + `idle_prompt_file:` - `schedules[].prompt: \|...` → `schedules/<slug>.md` + `prompt_file:` Totals: 17 initial-prompt files, 12 idle-prompt files, 18 schedule files (47 new files). ## File-size impact \| Before (main) \| After Phase 1 \| After Phase 2 \| Reduction \| \|---\|---\|---\|---\| \| 1801 lines \| 1687 lines \| 676 lines \| -62.5%* \| \| 108 KB \| 101 KB \| 35 KB \| -67% \| org.yaml is now pure structural scaffolding (name / role / tier / model / canvas / plugins / channels / children / category_routing / schedules metadata). Readable end-to-end on one screen per team. ## How the migration was driven A Python round-trip script (using `ruamel.yaml` to preserve comments + formatting) walked the workspace tree recursively, wrote prompts to files keyed by `files_dir`, and replaced inline keys with `_file:` refs. Zero manual YAML hand-editing beyond the Phase 1 Documentation Specialist proof. Script is one-shot; not committed. Slug convention for schedule files: lowercase the schedule name, replace non-alphanumeric with `-`, collapse, cap 60 chars. Examples: - "Orchestrator pulse" → `orchestrator-pulse.md` - "Hourly template fitness audit" → `hourly-template-fitness-audit.md` - "Code quality audit (every 12h)" → `code-quality-audit-every-12h.md` ## Backwards compatibility Fully compatible — Phase 1's resolver prefers inline when both are set, so a future one-off experiment can still drop inline YAML. The migration doesn't remove inline support, just stops using it. ## Verification - [x] `python -c "yaml.safe_load(...)"` on edited org.yaml — parses clean - [x] Walk-and-inspect script: every workspace has exactly the expected `_file:` refs, zero `INLINE_` markers remain - [x] All 47 extracted .md files non-empty + trimmed - [x] `go test -run 'TestResolvePromptRef\|TestOrgYAML\|TestInitialPrompt'` passes (from Phase 1 platform work) - [ ] Post-merge: live `POST /org/import` against a fresh workspace, diff the resulting `/configs/config.yaml` + `workspace_schedules` rows against the pre-migration values (should be identical bodies) ## What's next - Phase 3 (queued):* YAML `!include` directive for org.yaml; split the remaining 676 lines into `teams/{research,dev,marketing,ops}.yaml`. - Phase 4 (queued): per-workspace atomization; each role owns its own `workspace.yaml` manifest. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 00:47:32 -07:00
Hongming Wang	ce0e793673	feat(org-templates): Phase 1 — externalize prompt bodies to sibling files (#389 ) Part 1 of 4 in the scalability refactor. Each role can now keep its initial_prompt / idle_prompt / schedule prompts as sibling .md files under files_dir/; inline YAML literals still work for backwards-compat. ## What changes Platform (org.go importer): - `OrgWorkspace` gains `InitialPromptFile`, `IdlePrompt`, `IdlePromptFile`, `IdleIntervalSeconds`. The idle_* fields were previously dropped by the org importer entirely — struct didn't declare them — which is why engineer idle_prompts never propagated from org.yaml to live /configs (I've been manually docker-cp'ing them in every maintenance cron). - `OrgSchedule` gains `PromptFile`. Hourly/weekly cron prompts are the largest bodies in org.yaml (1-5 KB each) and get resolved at import time just like initial_prompt. - `OrgDefaults` gains the same idle_* + _file fields for org-wide fallback. - New `resolvePromptRef(inline, fileRef, orgBaseDir, filesDir)` helper — the single chokepoint for inline-vs-file resolution. Inline wins when both are set. File refs route through `resolveInsideRoot` so a crafted ref can't escape the org template directory (same traversal defense as files_dir). - `createWorkspaceTree` now injects idle_prompt + idle_interval_seconds into the workspace's config.yaml (previously missing — that's the second half of the idle-prompt propagation bug). Tests:* - `org_prompt_ref_test.go` — 10 cases: inline-wins, file-read-when-empty, both-empty, defaults-level resolution, inline-template mode errors, traversal rejection (via file ref AND via files_dir), missing-file errors, and YAML-unmarshal parsing for each new field. Proof migration: - Documentation Specialist (biggest role at 6.9 KB of prompts) moves from inline YAML to `documentation-specialist/{initial-prompt.md, schedules/daily-docs-sync.md, schedules/weekly-terminology-audit.md}`. - org.yaml drops 1801 → 1687 lines (-6.3%) from just this one role. ## Why this matters org.yaml is 108 KB of which 67 KB (62%) is prompt text. At the current 12-role template size that's already unreadable; the marketing + triage- operator additions pushed it to 1801 lines. The 4-phase refactor aims: - Phase 1 (this PR): platform support + 1 role proof. - Phase 2: migrate remaining ~20 roles to file refs. Target: org.yaml at ~600 lines of pure structural scaffolding. - Phase 3: YAML `!include` preprocessor — split org.yaml into teams/{research,dev,marketing,ops}.yaml shards. - Phase 4: per-workspace atomization — each role gets its own workspace.yaml manifest; org.yaml composes them. ## Backwards compatibility - Inline `initial_prompt: \|` / `prompt: \|` / `idle_prompt: \|` all still work. - Missing `prompt_file` refs log + skip the schedule (not fatal) — fail loud so bugs surface during deployment rather than silent-drop. - Inline-template mode (POST /org/import with raw JSON body, no `dir`) errors cleanly when a file ref is used — can't resolve files without a base dir, surface that rather than guessing. ## Test plan - [x] `go build ./...` clean - [x] `go test -run 'TestResolvePromptRef\|TestOrgYAML' ./internal/handlers/` — 10 tests pass - [x] `python -c "yaml.safe_load(...)"` on the edited org.yaml — parses - [ ] Post-merge: deploy platform rebuild, run `POST /org/import` against a fresh workspace, verify Documentation Specialist's /configs/config.yaml contains the initial_prompt body and workspace_schedules rows contain the cron prompts (phantom-success check: grep the actual content, not just the row count). Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 00:32:09 -07:00
Hongming Wang	50990c79f7	feat(org-templates): Phase 1 — externalize prompt bodies to sibling files (#389 ) Part 1 of 4 in the scalability refactor. Each role can now keep its initial_prompt / idle_prompt / schedule prompts as sibling .md files under files_dir/; inline YAML literals still work for backwards-compat. ## What changes Platform (org.go importer): - `OrgWorkspace` gains `InitialPromptFile`, `IdlePrompt`, `IdlePromptFile`, `IdleIntervalSeconds`. The idle_* fields were previously dropped by the org importer entirely — struct didn't declare them — which is why engineer idle_prompts never propagated from org.yaml to live /configs (I've been manually docker-cp'ing them in every maintenance cron). - `OrgSchedule` gains `PromptFile`. Hourly/weekly cron prompts are the largest bodies in org.yaml (1-5 KB each) and get resolved at import time just like initial_prompt. - `OrgDefaults` gains the same idle_* + _file fields for org-wide fallback. - New `resolvePromptRef(inline, fileRef, orgBaseDir, filesDir)` helper — the single chokepoint for inline-vs-file resolution. Inline wins when both are set. File refs route through `resolveInsideRoot` so a crafted ref can't escape the org template directory (same traversal defense as files_dir). - `createWorkspaceTree` now injects idle_prompt + idle_interval_seconds into the workspace's config.yaml (previously missing — that's the second half of the idle-prompt propagation bug). Tests:* - `org_prompt_ref_test.go` — 10 cases: inline-wins, file-read-when-empty, both-empty, defaults-level resolution, inline-template mode errors, traversal rejection (via file ref AND via files_dir), missing-file errors, and YAML-unmarshal parsing for each new field. Proof migration: - Documentation Specialist (biggest role at 6.9 KB of prompts) moves from inline YAML to `documentation-specialist/{initial-prompt.md, schedules/daily-docs-sync.md, schedules/weekly-terminology-audit.md}`. - org.yaml drops 1801 → 1687 lines (-6.3%) from just this one role. ## Why this matters org.yaml is 108 KB of which 67 KB (62%) is prompt text. At the current 12-role template size that's already unreadable; the marketing + triage- operator additions pushed it to 1801 lines. The 4-phase refactor aims: - Phase 1 (this PR): platform support + 1 role proof. - Phase 2: migrate remaining ~20 roles to file refs. Target: org.yaml at ~600 lines of pure structural scaffolding. - Phase 3: YAML `!include` preprocessor — split org.yaml into teams/{research,dev,marketing,ops}.yaml shards. - Phase 4: per-workspace atomization — each role gets its own workspace.yaml manifest; org.yaml composes them. ## Backwards compatibility - Inline `initial_prompt: \|` / `prompt: \|` / `idle_prompt: \|` all still work. - Missing `prompt_file` refs log + skip the schedule (not fatal) — fail loud so bugs surface during deployment rather than silent-drop. - Inline-template mode (POST /org/import with raw JSON body, no `dir`) errors cleanly when a file ref is used — can't resolve files without a base dir, surface that rather than guessing. ## Test plan - [x] `go build ./...` clean - [x] `go test -run 'TestResolvePromptRef\|TestOrgYAML' ./internal/handlers/` — 10 tests pass - [x] `python -c "yaml.safe_load(...)"` on the edited org.yaml — parses - [ ] Post-merge: deploy platform rebuild, run `POST /org/import` against a fresh workspace, verify Documentation Specialist's /configs/config.yaml contains the initial_prompt body and workspace_schedules rows contain the cron prompts (phantom-success check: grep the actual content, not just the row count). Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-16 00:32:09 -07:00
Hongming Wang	0bcebff908	config(org): add Telegram to Dev Lead and Research Lead (#385 ) * feat(adapters): add gemini-cli runtime adapter (closes #332) Adds a `gemini-cli` workspace runtime backed by Google's Gemini CLI (@google/gemini-cli, ~101k ★, Apache 2.0). Mirrors the claude-code adapter pattern: Docker image installs the CLI, CLIAgentExecutor drives the subprocess, A2A MCP tools wire via ~/.gemini/settings.json. Changes: - workspace-template/adapters/gemini_cli/ — new adapter (Dockerfile, adapter.py, __init__.py, requirements.txt); setup() seeds GEMINI.md from system-prompt.md and injects A2A MCP server into settings.json - workspace-template/cli_executor.py — adds gemini-cli to RUNTIME_PRESETS (--yolo flag, -p prompt, --model, GEMINI_API_KEY env auth); adds mcp_via_settings preset flag to skip --mcp-config injection for runtimes that own their own settings file - workspace-configs-templates/gemini-cli/ — default config.yaml + system-prompt.md template - tests/test_adapters.py — adds gemini-cli to expected adapter set - CLAUDE.md — documents new runtime row in the image table Requires: GEMINI_API_KEY global secret. Build: bash workspace-template/build-all.sh gemini-cli Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(provisioner): add gemini-cli to RuntimeImages map Without this entry, POST /workspaces with runtime:gemini-cli falls back to workspace-template:langgraph (wrong image, missing gemini dep) instead of workspace-template:gemini-cli. Every runtime MUST have an entry here. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * config(org): add Telegram to Dev Lead and Research Lead (closes #383) Completes leadership-tier Telegram coverage: PM ✓ DevOps ✓ Security ✓ → Dev Lead ✓ Research Lead ✓ Both roles produce high-value async output (architecture decisions, eco-watch summaries) that was invisible until the user polled the canvas. Same bot_token/chat_id secrets as the other three roles — no new credentials required. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: DevOps Engineer <devops@molecule.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 00:00:10 -07:00
Hongming Wang	eaecdb5c46	config(org): add Telegram to Dev Lead and Research Lead (#385 ) * feat(adapters): add gemini-cli runtime adapter (closes #332) Adds a `gemini-cli` workspace runtime backed by Google's Gemini CLI (@google/gemini-cli, ~101k ★, Apache 2.0). Mirrors the claude-code adapter pattern: Docker image installs the CLI, CLIAgentExecutor drives the subprocess, A2A MCP tools wire via ~/.gemini/settings.json. Changes: - workspace-template/adapters/gemini_cli/ — new adapter (Dockerfile, adapter.py, __init__.py, requirements.txt); setup() seeds GEMINI.md from system-prompt.md and injects A2A MCP server into settings.json - workspace-template/cli_executor.py — adds gemini-cli to RUNTIME_PRESETS (--yolo flag, -p prompt, --model, GEMINI_API_KEY env auth); adds mcp_via_settings preset flag to skip --mcp-config injection for runtimes that own their own settings file - workspace-configs-templates/gemini-cli/ — default config.yaml + system-prompt.md template - tests/test_adapters.py — adds gemini-cli to expected adapter set - CLAUDE.md — documents new runtime row in the image table Requires: GEMINI_API_KEY global secret. Build: bash workspace-template/build-all.sh gemini-cli Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(provisioner): add gemini-cli to RuntimeImages map Without this entry, POST /workspaces with runtime:gemini-cli falls back to workspace-template:langgraph (wrong image, missing gemini dep) instead of workspace-template:gemini-cli. Every runtime MUST have an entry here. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * config(org): add Telegram to Dev Lead and Research Lead (closes #383) Completes leadership-tier Telegram coverage: PM ✓ DevOps ✓ Security ✓ → Dev Lead ✓ Research Lead ✓ Both roles produce high-value async output (architecture decisions, eco-watch summaries) that was invisible until the user polled the canvas. Same bot_token/chat_id secrets as the other three roles — no new credentials required. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: DevOps Engineer <devops@molecule.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-16 00:00:10 -07:00
Hongming Wang	df5821a251	chore(handoff): triage-operator role + agent handoff package Wraps up a ~100-tick autonomous triage session by converting the prior operator's institutional knowledge into standing, checked-in artifacts so the next team picking up the hourly PR + issue cycle can drop in without re-discovering everything from scratch. ## New role: Triage Operator Peer to Dev Lead, Research Lead, Documentation Specialist under PM. Owns the 7-gate PR verification + issue-pickup cycle across both molecule-monorepo and molecule-controlplane. NOT an engineer — never writes logic, never makes design calls. Mechanical fixes on other people's branches + verified-merge only. Runs on cron `17 * * * *`. On first boot reads four handoff files + the last 20 lines of cron-learnings.jsonl, waits for the scheduled tick (no first-boot triage — known stale-state footgun). ## Files org-templates/molecule-dev/triage-operator/ - system-prompt.md (48 lines) — role prompt loaded at boot. Standing rules, verification discipline, escalation paths. - philosophy.md (135 lines) — 10 principles each tied to a real incident. Rule 2 ("tool succeeded ≠ work done") references the WorkOS refresh-token + missing-migration saga. Rule 3 (authority verification) references PR #370 CEO directive hold. - playbook.md (234 lines) — step-by-step tick flow (Step 0 guards → 1 list → 2 seven-gate → 3 docs sync → 4 issue pickup → 5 report). Expected 5–30 min wall-clock. When-not-to-triage. - handoff-notes.md (146 lines) — point-in-time state for the NEXT operator arriving fresh. 15 PRs merged this session, in-flight items, design-call backlog with recommendations per issue. - SKILL.md (152 lines) — installable skill spec. Invocation, inputs, outputs, required composed skills, edge cases, output format. .claude/AGENT_HANDOFF.md (206 lines) — top-level handoff for any Claude Code agent working this repo (not just the triage operator). The 10 principles (one-liners), communication style the user expects, currently-live state, open items, what NOT to do, break- glass escalation conditions. Points at triage-operator/philosophy.md for full incident context. ## Wiring org.yaml gains a Triage Operator workspace block under PM with: - tier: 3, model: opus - 8 plugins (careful-bash, session-context, cron-learnings, code-review, cross-vendor-review, llm-judge, update-docs, hitl) - Hourly cron at `:17` with the full Step 0–5 flow inline as prompt - canvas position (1150, 250) — peer to Documentation Specialist ## Why this ships now The 30-min manual triage cron was cancelled per CEO direction. The role moves to another team. Without this handoff package they'd be rediscovering the same incident-classes I shipped fixes for (#318 fail-open, #327 cross-tenant decrypt, #351 tokenless grace, WorkOS refresh-token saga, missing migration runner). The philosophy file gives them the scar tissue in ~10 min of reading; the playbook gives them the steps; the SKILL gives them an invocable entry point. No code changes outside org.yaml. Existing TestPlugins_UnionWithDefaults still passes (verified in platform test run). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 23:41:01 -07:00
Hongming Wang	2e43bb7271	chore(handoff): triage-operator role + agent handoff package Wraps up a ~100-tick autonomous triage session by converting the prior operator's institutional knowledge into standing, checked-in artifacts so the next team picking up the hourly PR + issue cycle can drop in without re-discovering everything from scratch. ## New role: Triage Operator Peer to Dev Lead, Research Lead, Documentation Specialist under PM. Owns the 7-gate PR verification + issue-pickup cycle across both molecule-monorepo and molecule-controlplane. NOT an engineer — never writes logic, never makes design calls. Mechanical fixes on other people's branches + verified-merge only. Runs on cron `17 * * * *`. On first boot reads four handoff files + the last 20 lines of cron-learnings.jsonl, waits for the scheduled tick (no first-boot triage — known stale-state footgun). ## Files org-templates/molecule-dev/triage-operator/ - system-prompt.md (48 lines) — role prompt loaded at boot. Standing rules, verification discipline, escalation paths. - philosophy.md (135 lines) — 10 principles each tied to a real incident. Rule 2 ("tool succeeded ≠ work done") references the WorkOS refresh-token + missing-migration saga. Rule 3 (authority verification) references PR #370 CEO directive hold. - playbook.md (234 lines) — step-by-step tick flow (Step 0 guards → 1 list → 2 seven-gate → 3 docs sync → 4 issue pickup → 5 report). Expected 5–30 min wall-clock. When-not-to-triage. - handoff-notes.md (146 lines) — point-in-time state for the NEXT operator arriving fresh. 15 PRs merged this session, in-flight items, design-call backlog with recommendations per issue. - SKILL.md (152 lines) — installable skill spec. Invocation, inputs, outputs, required composed skills, edge cases, output format. .claude/AGENT_HANDOFF.md (206 lines) — top-level handoff for any Claude Code agent working this repo (not just the triage operator). The 10 principles (one-liners), communication style the user expects, currently-live state, open items, what NOT to do, break- glass escalation conditions. Points at triage-operator/philosophy.md for full incident context. ## Wiring org.yaml gains a Triage Operator workspace block under PM with: - tier: 3, model: opus - 8 plugins (careful-bash, session-context, cron-learnings, code-review, cross-vendor-review, llm-judge, update-docs, hitl) - Hourly cron at `:17` with the full Step 0–5 flow inline as prompt - canvas position (1150, 250) — peer to Documentation Specialist ## Why this ships now The 30-min manual triage cron was cancelled per CEO direction. The role moves to another team. Without this handoff package they'd be rediscovering the same incident-classes I shipped fixes for (#318 fail-open, #327 cross-tenant decrypt, #351 tokenless grace, WorkOS refresh-token saga, missing migration runner). The philosophy file gives them the scar tissue in ~10 min of reading; the playbook gives them the steps; the SKILL gives them an invocable entry point. No code changes outside org.yaml. Existing TestPlugins_UnionWithDefaults still passes (verified in platform test run). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 23:41:01 -07:00
Hongming Wang	b2e1631640	feat(org-templates): add 7-role marketing team sub-tree (#373 ) Add Marketing Lead + 6 reports as a peer sub-tree of PM under the CEO: DevRel Engineer, Product Marketing Manager, Content Marketer, Community Manager, SEO Growth Analyst, Social Media / Brand. - Marketing Lead: tier-3 Opus CMO-equivalent with a 5-min orchestrator pulse (minutes 4/9/14/... offset from Dev Lead's 2/7/12/...) that dispatches cross-role work, reviews drafts, and routes cross-team asks back to PM. - DevRel + PMM: tier-3 Opus (technical writing + positioning judgment). Each has an idle_prompt for proactive issue-claim plus an hourly evolution cron (DevRel = sample-coverage audit, PMM = competitor diff against docs/ecosystem-watch.md). - Content / Community / SEO / Social: tier-2 Sonnet with idle_prompts for backlog-pull (matches the #205 idle-loop pattern proven on Technical Researcher + Market Analyst + Competitive Intelligence). Each has an hourly cron tuned to its surface. - category_routing gets 6 new keys (content, positioning, community, growth, social, devrel) so audit_summary messages fan out correctly. - Canvas positions lay out the marketing cluster to the right of PM/Dev Lead (x=1000-1300, y=50/250/400) so the graph stays readable. Each role also gets a system-prompt.md under its files_dir with responsibilities, team interfaces, conventions, and self-review gates (molecule-skill-llm-judge or molecule-hitl depending on risk). Per CEO directive 2026-04-16 ("comprehensive marketing team"). This is PR 1 of 2 — follow-up will add cross-tree A2A conventions and wire DevRel ↔ Backend Engineer / PMM ↔ Competitive Intelligence delegations. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 23:20:04 -07:00
Hongming Wang	592fe6d7f7	feat(org-templates): add 7-role marketing team sub-tree (#373 ) Add Marketing Lead + 6 reports as a peer sub-tree of PM under the CEO: DevRel Engineer, Product Marketing Manager, Content Marketer, Community Manager, SEO Growth Analyst, Social Media / Brand. - Marketing Lead: tier-3 Opus CMO-equivalent with a 5-min orchestrator pulse (minutes 4/9/14/... offset from Dev Lead's 2/7/12/...) that dispatches cross-role work, reviews drafts, and routes cross-team asks back to PM. - DevRel + PMM: tier-3 Opus (technical writing + positioning judgment). Each has an idle_prompt for proactive issue-claim plus an hourly evolution cron (DevRel = sample-coverage audit, PMM = competitor diff against docs/ecosystem-watch.md). - Content / Community / SEO / Social: tier-2 Sonnet with idle_prompts for backlog-pull (matches the #205 idle-loop pattern proven on Technical Researcher + Market Analyst + Competitive Intelligence). Each has an hourly cron tuned to its surface. - category_routing gets 6 new keys (content, positioning, community, growth, social, devrel) so audit_summary messages fan out correctly. - Canvas positions lay out the marketing cluster to the right of PM/Dev Lead (x=1000-1300, y=50/250/400) so the graph stays readable. Each role also gets a system-prompt.md under its files_dir with responsibilities, team interfaces, conventions, and self-review gates (molecule-skill-llm-judge or molecule-hitl depending on risk). Per CEO directive 2026-04-16 ("comprehensive marketing team"). This is PR 1 of 2 — follow-up will add cross-tree A2A conventions and wire DevRel ↔ Backend Engineer / PMM ↔ Competitive Intelligence delegations. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 23:20:04 -07:00
rabbitblood	90d68ca039	feat(template): engineers pick up issues proactively (CEO 2026-04-16 directive) CEO directive verbatim: "devs should pick up issues and declare that its assigned to them, PM and leaders regularly check in. dont just rely on outside reviewer". Adds `idle_prompt` + `idle_interval_seconds: 600` to Frontend Engineer, Backend Engineer, and DevOps Engineer. Each engineer now polls open GH issues matching its specialty, claims unassigned ones via `gh issue edit --add-assignee @me`, leaves a public comment declaring the pickup, and commits memory to prevent double-pickup on the next tick. Previously engineers were reactive-only per the #159 orchestrator/worker split. The CEO is correcting that: devs should be a true self-organizing unit, not a work-queue that only advances when an outside reviewer dispatches. ## Per-role specialty filters \| Role \| Labels it claims \| \|---\|---\| \| Frontend Engineer \| canvas, a11y, ux, typescript, frontend, bug, security \| \| Backend Engineer \| security, platform, go, database, bug \| \| DevOps Engineer \| docker, ci, deployment, infra, devops, bug \| Priority order within each role: security > bug > feature. ## Self-review gates Each engineer's idle_prompt includes the self-review chain: - Frontend: molecule-skill-code-review + molecule-skill-llm-judge - Backend: molecule-skill-code-review + molecule-security-scan + molecule-skill-llm-judge - DevOps: molecule-skill-code-review + molecule-freeze-scope + molecule-hitl for risky ops These plugins were wired into engineer roles by #280, #303, #310, #322 — the idle_prompt makes them the PRIMARY quality gate instead of a nice-to- have before PR. Matches the "team self-regulates, don't rely on outside reviewer" spirit. ## Hard rules (same shape as researcher idle_prompts from #216/#321) - Max 1 claim per tick (1 `gh issue edit --add-assignee` call) - Never take someone else's assigned issue - Under 90 seconds wall-clock for the claim + plan step - Don't double-pick: check `task-assigned:<role>` memory first - No busy-work fabrication: write "<role>-idle HH:MM — no work" if nothing matches ## What this does NOT change - Leaders' orchestrator pulses still dispatch (#159) — this is the TAIL pickup, not the primary dispatch path. Dev Lead still prioritizes via its own pulse. - PR merging still goes through reviewer per `feedback_never_merge_prs.md`. This directive is about the QUALITY GATE (team self-review, peer review via Dev Lead's pulse) not about bypassing merge approval. - Destructive/irreversible ops still need explicit human ack via molecule-hitl's @requires_approval decorator. ## Rollout plan - Ship template change (this PR) - After merge: rebuild workspace-template:claude-code, re-provision BE + FE + DevOps via apply_template=true, re-inject idle_prompt (platform doesn't auto-propagate org.yaml to live configs — tracked separately) - Measure: 24h of activity_logs. Should see `a2a_receive` events every 10 min per engineer, response bodies mentioning claim decisions or idle-clean states, and `gh issue edit` events showing up as assignees. ## Related - `feedback_devs_pick_up_issues_leaders_check_in.md` — memory saved last cycle - #159 orchestrator/worker split (leaders dispatch) - #216 / #321 researcher idle_prompts (same pattern applied to researchers) - `project_north_star_24_7.md` — team self-regulation is the north-star	2026-04-15 22:49:10 -07:00
rabbitblood	4f9ef2dd0e	feat(template): engineers pick up issues proactively (CEO 2026-04-16 directive) CEO directive verbatim: "devs should pick up issues and declare that its assigned to them, PM and leaders regularly check in. dont just rely on outside reviewer". Adds `idle_prompt` + `idle_interval_seconds: 600` to Frontend Engineer, Backend Engineer, and DevOps Engineer. Each engineer now polls open GH issues matching its specialty, claims unassigned ones via `gh issue edit --add-assignee @me`, leaves a public comment declaring the pickup, and commits memory to prevent double-pickup on the next tick. Previously engineers were reactive-only per the #159 orchestrator/worker split. The CEO is correcting that: devs should be a true self-organizing unit, not a work-queue that only advances when an outside reviewer dispatches. ## Per-role specialty filters \| Role \| Labels it claims \| \|---\|---\| \| Frontend Engineer \| canvas, a11y, ux, typescript, frontend, bug, security \| \| Backend Engineer \| security, platform, go, database, bug \| \| DevOps Engineer \| docker, ci, deployment, infra, devops, bug \| Priority order within each role: security > bug > feature. ## Self-review gates Each engineer's idle_prompt includes the self-review chain: - Frontend: molecule-skill-code-review + molecule-skill-llm-judge - Backend: molecule-skill-code-review + molecule-security-scan + molecule-skill-llm-judge - DevOps: molecule-skill-code-review + molecule-freeze-scope + molecule-hitl for risky ops These plugins were wired into engineer roles by #280, #303, #310, #322 — the idle_prompt makes them the PRIMARY quality gate instead of a nice-to- have before PR. Matches the "team self-regulates, don't rely on outside reviewer" spirit. ## Hard rules (same shape as researcher idle_prompts from #216/#321) - Max 1 claim per tick (1 `gh issue edit --add-assignee` call) - Never take someone else's assigned issue - Under 90 seconds wall-clock for the claim + plan step - Don't double-pick: check `task-assigned:<role>` memory first - No busy-work fabrication: write "<role>-idle HH:MM — no work" if nothing matches ## What this does NOT change - Leaders' orchestrator pulses still dispatch (#159) — this is the TAIL pickup, not the primary dispatch path. Dev Lead still prioritizes via its own pulse. - PR merging still goes through reviewer per `feedback_never_merge_prs.md`. This directive is about the QUALITY GATE (team self-review, peer review via Dev Lead's pulse) not about bypassing merge approval. - Destructive/irreversible ops still need explicit human ack via molecule-hitl's @requires_approval decorator. ## Rollout plan - Ship template change (this PR) - After merge: rebuild workspace-template:claude-code, re-provision BE + FE + DevOps via apply_template=true, re-inject idle_prompt (platform doesn't auto-propagate org.yaml to live configs — tracked separately) - Measure: 24h of activity_logs. Should see `a2a_receive` events every 10 min per engineer, response bodies mentioning claim decisions or idle-clean states, and `gh issue edit` events showing up as assignees. ## Related - `feedback_devs_pick_up_issues_leaders_check_in.md` — memory saved last cycle - #159 orchestrator/worker split (leaders dispatch) - #216 / #321 researcher idle_prompts (same pattern applied to researchers) - `project_north_star_24_7.md` — team self-regulation is the north-star	2026-04-15 22:49:10 -07:00
Hongming Wang	6b153ca3cb	chore(auditor): close #319 + #337 prompt drift on Security Auditor (#342 ) Two recent platform-level security changes (#319 channel_config encryption, #337 constant-time webhook_secret compare) were not reflected in the Security Auditor's system prompt or the schedule cron prompt. That meant the auditor wouldn't proactively look for the next instance of either class — a new credential field added to channel_config without being added to sensitiveFields, or a new secret comparison using raw `!=`, would slip through until a human happened to notice. Updated two files: 1. org-templates/molecule-dev/security-auditor/system-prompt.md Added two bullets to "What You Check": - Secret comparisons must use subtle.ConstantTimeCompare / crypto.timingSafeEqual (cites #337 as the repo's recent instance) - Secret storage at rest: any new channel_config credential field must be added to sensitiveFields and exercised in both the Encrypt (write) and Decrypt (read) boundary helpers, and the ec1: prefix must never leak into API responses (cites #319) 2. org-templates/molecule-dev/org.yaml Same two checks added to the Security Auditor's 12-hour cron prompt's "MANUAL REVIEW of every changed file" section. Wording is concrete enough to paste into a grep: "flag any `!=` / `==` / bytes.Equal against a user-supplied value that gates auth". Pure config / prompt — no code changes, no tests to write. YAML parse verified, TestPlugins_UnionWithDefaults still passes. Closes #342 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 21:24:34 -07:00
Hongming Wang	2da48dda13	chore(auditor): close #319 + #337 prompt drift on Security Auditor (#342 ) Two recent platform-level security changes (#319 channel_config encryption, #337 constant-time webhook_secret compare) were not reflected in the Security Auditor's system prompt or the schedule cron prompt. That meant the auditor wouldn't proactively look for the next instance of either class — a new credential field added to channel_config without being added to sensitiveFields, or a new secret comparison using raw `!=`, would slip through until a human happened to notice. Updated two files: 1. org-templates/molecule-dev/security-auditor/system-prompt.md Added two bullets to "What You Check": - Secret comparisons must use subtle.ConstantTimeCompare / crypto.timingSafeEqual (cites #337 as the repo's recent instance) - Secret storage at rest: any new channel_config credential field must be added to sensitiveFields and exercised in both the Encrypt (write) and Decrypt (read) boundary helpers, and the ec1: prefix must never leak into API responses (cites #319) 2. org-templates/molecule-dev/org.yaml Same two checks added to the Security Auditor's 12-hour cron prompt's "MANUAL REVIEW of every changed file" section. Wording is concrete enough to paste into a grep: "flag any `!=` / `==` / bytes.Equal against a user-supplied value that gates auth". Pure config / prompt — no code changes, no tests to write. YAML parse verified, TestPlugins_UnionWithDefaults still passes. Closes #342 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 21:24:34 -07:00
Hongming Wang	d3a7e4c8f9	chore(org): wire molecule-compliance + molecule-audit + molecule-freeze-scope (closes #322 ) Config-only YAML. CI green on all 6 checks (E2E cancel = run-supersession pattern). Adds missing plugin wiring: Security Auditor→compliance+audit, Backend→compliance, QA→compliance, DevOps→freeze-scope. Closes #322.	2026-04-15 21:13:26 -07:00
Hongming Wang	2eec33a279	chore(org): wire molecule-compliance + molecule-audit + molecule-freeze-scope (closes #322 ) Config-only YAML. CI green on all 6 checks (E2E cancel = run-supersession pattern). Adds missing plugin wiring: Security Auditor→compliance+audit, Backend→compliance, QA→compliance, DevOps→freeze-scope. Closes #322.	2026-04-15 21:13:26 -07:00
Hongming Wang	4d7b1f56de	chore(template): widen idle-loop to Market Analyst + Competitive Intelligence (wave 2) Expands autonomous orchestration reach to Market Analyst and Competitive Intelligence roles.	2026-04-15 20:29:41 -07:00
Hongming Wang	02cd80c5f6	chore(template): widen idle-loop to Market Analyst + Competitive Intelligence (wave 2) Expands autonomous orchestration reach to Market Analyst and Competitive Intelligence roles.	2026-04-15 20:29:41 -07:00
Hongming Wang	3252af6ea6	fix(template): Telegram channel for Security Auditor + DevOps Engineer (#246 #247 ) Closes #246 Closes #247 Critical security findings and CI build-break alerts are now pushed via Telegram instead of waiting for someone to manually check memory/logs.	2026-04-15 19:57:34 -07:00
Hongming Wang	c71bd04cf1	fix(template): Telegram channel for Security Auditor + DevOps Engineer (#246 #247 ) Closes #246 Closes #247 Critical security findings and CI build-break alerts are now pushed via Telegram instead of waiting for someone to manually check memory/logs.	2026-04-15 19:57:34 -07:00
Hongming Wang	ac8daf2f70	feat(template): add molecule-skill-llm-judge to Backend + Frontend Engineer (#310 ) Backend Engineer and Frontend Engineer were missing molecule-skill-llm-judge while Dev Lead, QA Engineer, and Security Auditor already have it. llm-judge lets engineers self-gate their PR against the issue body before requesting review, catching 'shipped the wrong thing' before Dev Lead sees it. No new plugins needed — already installed org-wide. Closes #310	2026-04-16 02:48:08 +00:00
Hongming Wang	af06c1e702	feat(template): add molecule-skill-llm-judge to Backend + Frontend Engineer (#310 ) Backend Engineer and Frontend Engineer were missing molecule-skill-llm-judge while Dev Lead, QA Engineer, and Security Auditor already have it. llm-judge lets engineers self-gate their PR against the issue body before requesting review, catching 'shipped the wrong thing' before Dev Lead sees it. No new plugins needed — already installed org-wide. Closes #310	2026-04-16 02:48:08 +00:00
Hongming Wang	ece45bbf45	fix(template): UIUX Designer cron from 15min to hourly (#306 ) Closes #306. The cron expression was "5,20,35,50 * * * " (every 15 min = 96 ticks/day) despite the schedule being named "Hourly UI/UX audit". Each tick launches Chromium, takes 8 screenshots, runs them through Claude vision, and delegates to PM — 768 vision calls/day from one workspace with no meaningful delta between ticks (canvas UI only changes on deploys). Changed to "5 * * *" (hourly, at :05 past the hour). 6x reduction in cost + noise. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 19:22:19 -07:00
Hongming Wang	dd10c0d1a2	fix(template): UIUX Designer cron from 15min to hourly (#306 ) Closes #306. The cron expression was "5,20,35,50 * * * " (every 15 min = 96 ticks/day) despite the schedule being named "Hourly UI/UX audit". Each tick launches Chromium, takes 8 screenshots, runs them through Claude vision, and delegates to PM — 768 vision calls/day from one workspace with no meaningful delta between ticks (canvas UI only changes on deploys). Changed to "5 * * *" (hourly, at :05 past the hour). 6x reduction in cost + noise. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 19:22:19 -07:00
Hongming Wang	d9065bcc4d	feat(template): add molecule-security-scan to Backend Engineer (#303 ) Closes #303. Surfaces CVE/secret scanning at dev time instead of waiting for the Security Auditor's 12h cron. Backend Engineer's plugin list: [molecule-hitl, molecule-skill-code-review, molecule-security-scan]. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 19:21:11 -07:00
Hongming Wang	3fefad4534	feat(template): add molecule-security-scan to Backend Engineer (#303 ) Closes #303. Surfaces CVE/secret scanning at dev time instead of waiting for the Security Auditor's 12h cron. Backend Engineer's plugin list: [molecule-hitl, molecule-skill-code-review, molecule-security-scan]. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 19:21:11 -07:00
Hongming Wang	ea12ff9761	feat(template): add molecule-skill-code-review to Frontend/Backend/DevOps Engineer (#280 ) Closes #280. Self-review rubric now runs on the same workspaces that raise PRs, not just on the reviewers. Dev Lead uses the same 16-criteria rubric in review, so catching issues pre-PR cuts the review loop. - Frontend Engineer: new plugins: [molecule-skill-code-review] - Backend Engineer: plugins extended from [molecule-hitl] to [molecule-hitl, molecule-skill-code-review] - DevOps Engineer: plugins extended from [molecule-hitl] to [molecule-hitl, molecule-skill-code-review] The issue didn't explicitly call out DevOps Engineer but the reasoning applies — DevOps Engineer writes Dockerfiles + CI workflows + infra scripts that Dev Lead reviews with the same rubric. Including here for consistency. Verified all 5 reviewer/engineer roles' plugin lists via walk-script: Dev Lead: [code-review, llm-judge] Frontend Eng: [code-review] ← NEW Backend Eng: [hitl, code-review] ← NEW DevOps Eng: [hitl, code-review] ← NEW Security Aud: [code-review, cross-vendor, llm-judge, security-scan, hitl] Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:55:24 -07:00
Hongming Wang	ed25fa11da	feat(template): add molecule-skill-code-review to Frontend/Backend/DevOps Engineer (#280 ) Closes #280. Self-review rubric now runs on the same workspaces that raise PRs, not just on the reviewers. Dev Lead uses the same 16-criteria rubric in review, so catching issues pre-PR cuts the review loop. - Frontend Engineer: new plugins: [molecule-skill-code-review] - Backend Engineer: plugins extended from [molecule-hitl] to [molecule-hitl, molecule-skill-code-review] - DevOps Engineer: plugins extended from [molecule-hitl] to [molecule-hitl, molecule-skill-code-review] The issue didn't explicitly call out DevOps Engineer but the reasoning applies — DevOps Engineer writes Dockerfiles + CI workflows + infra scripts that Dev Lead reviews with the same rubric. Including here for consistency. Verified all 5 reviewer/engineer roles' plugin lists via walk-script: Dev Lead: [code-review, llm-judge] Frontend Eng: [code-review] ← NEW Backend Eng: [hitl, code-review] ← NEW DevOps Eng: [hitl, code-review] ← NEW Security Aud: [code-review, cross-vendor, llm-judge, security-scan, hitl] Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:55:24 -07:00
Hongming Wang	bb366c13ba	feat(template): wire molecule-hitl + molecule-security-scan into roles (#266 , #275 ) Closes #266 and #275. Per-role install matrix matching the per-tick #266 triage comment. ## Added plugins \| Role \| Plugin \| Rationale \| \|---\|---\|---\| \| Backend Engineer \| molecule-hitl \| Scope includes destructive DB migrations + runtime config changes — @requires_approval stops unattended agents from shipping prod schema mutations. \| \| DevOps Engineer \| molecule-hitl \| Scope covers fly deploys + registry pushes + CI pipeline mutations — @requires_approval before destructive infra ops. \| \| Security Auditor \| molecule-hitl \| Gates public issue filing for critical findings; prevents false-positive spam of the tracker. \| \| Security Auditor \| molecule-security-scan \| Primary consumer of gosec/bandit/CVE scanning via builtin_tools/security_scan.py. Security Auditor system prompt already expects to run these tools; this wires them. \| ## Per-PR #71 semantics Each workspace's `plugins:` UNIONs with `defaults.plugins` — these additions don't drop any existing plugin. Security Auditor's list went from 3 → 5; Backend + DevOps Engineer now have a role-specific list layered on top of defaults. ## NOT adding (yet) Dev Lead / Research Lead / Technical Researcher / QA Engineer / UIUX Designer / PM / Documentation Specialist — none have destructive ops scope in the role description. If you want belt-and-suspenders HITL coverage I can extend this PR; leaving narrow for now. ## Test plan - [x] YAML parses cleanly (python3 -c 'import yaml; yaml.safe_load(...)') - [x] Three edited roles' plugins lists verified by walk-script - [ ] Next org re-import activates the plugins on each workspace container - [ ] Agents invoke request_approval / security_scan from their system prompts after re-import Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:21:58 -07:00
Hongming Wang	2dbb608723	feat(template): wire molecule-hitl + molecule-security-scan into roles (#266 , #275 ) Closes #266 and #275. Per-role install matrix matching the per-tick #266 triage comment. ## Added plugins \| Role \| Plugin \| Rationale \| \|---\|---\|---\| \| Backend Engineer \| molecule-hitl \| Scope includes destructive DB migrations + runtime config changes — @requires_approval stops unattended agents from shipping prod schema mutations. \| \| DevOps Engineer \| molecule-hitl \| Scope covers fly deploys + registry pushes + CI pipeline mutations — @requires_approval before destructive infra ops. \| \| Security Auditor \| molecule-hitl \| Gates public issue filing for critical findings; prevents false-positive spam of the tracker. \| \| Security Auditor \| molecule-security-scan \| Primary consumer of gosec/bandit/CVE scanning via builtin_tools/security_scan.py. Security Auditor system prompt already expects to run these tools; this wires them. \| ## Per-PR #71 semantics Each workspace's `plugins:` UNIONs with `defaults.plugins` — these additions don't drop any existing plugin. Security Auditor's list went from 3 → 5; Backend + DevOps Engineer now have a role-specific list layered on top of defaults. ## NOT adding (yet) Dev Lead / Research Lead / Technical Researcher / QA Engineer / UIUX Designer / PM / Documentation Specialist — none have destructive ops scope in the role description. If you want belt-and-suspenders HITL coverage I can extend this PR; leaving narrow for now. ## Test plan - [x] YAML parses cleanly (python3 -c 'import yaml; yaml.safe_load(...)') - [x] Three edited roles' plugins lists verified by walk-script - [ ] Next org re-import activates the plugins on each workspace container - [ ] Agents invoke request_approval / security_scan from their system prompts after re-import Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 16:21:58 -07:00
Hongming Wang	2362eb3a9e	chore(template): add YAML injection to Security Auditor check list (#248 ) Closes #248. Three instances of the same YAML-injection bug class (#221 name/role, #233 template path, #241 runtime/model) shipped in this repo over the last weeks. The common root cause is the Security Auditor's system prompt didn't list YAML injection as an explicit check class, so audits missed the pattern every time. Adds: - "YAML injection" to the 'Think like an attacker' list in How You Work - An explicit entry in What You Check with the three prior instances cited so future auditors see the pattern and the fix shape (double-quoted scalars or a proper YAML encoder) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 13:18:52 -07:00
Hongming Wang	e1ff890150	chore(template): add YAML injection to Security Auditor check list (#248 ) Closes #248. Three instances of the same YAML-injection bug class (#221 name/role, #233 template path, #241 runtime/model) shipped in this repo over the last weeks. The common root cause is the Security Auditor's system prompt didn't list YAML injection as an explicit check class, so audits missed the pattern every time. Adds: - "YAML injection" to the 'Think like an attacker' list in How You Work - An explicit entry in What You Check with the three prior instances cited so future auditors see the pattern and the fix shape (double-quoted scalars or a proper YAML encoder) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 13:18:52 -07:00

1 2 3

105 Commits