fix(synth-e2e): branch MODEL_SLUG by runtime so langgraph gets colon-form

The original script hardcoded `MODEL_SLUG="openai/gpt-4o"` (slash) and claimed "non-hermes runtimes ignore the prefix" — wrong for langgraph, which delegates model resolution to langchain's `init_chat_model`. That function requires `<provider>:<model>` (colon) and treats slash-form as OpenRouter routing, falling through without auth even when OPENAI_API_KEY is set. Surfaced 2026-05-03 after the a2a-sdk v1 contract bugs (PR #2558+#2563+#2567) cleared the masking layers — synth-E2E firing 2026-05-03T12:14 returned a properly-shaped task with state=failed + "Could not resolve authentication method" inside the agent body. continuous-synth-e2e.yml defaults E2E_RUNTIME=langgraph for the cron, so every firing hit this. Hermes still gets the slash-form it needs; claude-code uses the entry-id pattern. Adds E2E_MODEL_SLUG override for operator-dispatched runs that want to pin a specific slug. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-03 05:17:55 -07:00 · 2026-05-03 05:17:55 -07:00 · cbc69f5e7e
commit cbc69f5e7e
parent c71f641b12
1 changed files with 36 additions and 9 deletions
--- a/tests/e2e/test_staging_full_saas.sh
+++ b/tests/e2e/test_staging_full_saas.sh
@ -352,15 +352,42 @@ print(json.dumps({
 ")
 fi

-# Model slug MUST be provider-prefixed for hermes — the template's
-# derive-provider.sh parses the slug prefix (`openai/…`, `anthropic/…`,
-# `minimax/…`) to set HERMES_INFERENCE_PROVIDER at install time. A bare
-# "gpt-4o" has no prefix → provider falls back to hermes auto-detect →
-# picks Anthropic default → tries Anthropic API with the OpenAI key →
-# 401 on A2A. Same trap that trapped prod users in PR #1714. We pin
-# "openai/gpt-4o" here because the E2E's secret is always the OpenAI
-# key; non-hermes runtimes ignore the prefix.
-MODEL_SLUG="openai/gpt-4o"
+# Model slug format depends on the runtime — different model resolvers
+# parse it differently:
+#
+#   hermes      → "openai/gpt-4o"  (slash-form: derive-provider.sh splits
+#                                    on the prefix to set
+#                                    HERMES_INFERENCE_PROVIDER. Bare
+#                                    "gpt-4o" falls through to Anthropic
+#                                    default + 401, see PR #1714.)
+#
+#   langgraph   → "openai:gpt-4o"  (colon-form: langchain init_chat_model
+#                                    requires "<provider>:<model>".
+#                                    Slash-form was misinterpreted as
+#                                    OpenRouter routing → fell through
+#                                    without auth, surfaced 2026-05-03
+#                                    after the a2a-sdk v1 contract bugs
+#                                    PR #2558+#2563+#2567 cleared the
+#                                    masking layers.)
+#
+#   claude-code → "sonnet"         (entry-id form: claude-code template's
+#                                    config.yaml uses bare model names,
+#                                    auth comes via CLAUDE_CODE_OAUTH_TOKEN
+#                                    or ANTHROPIC_API_KEY rather than the
+#                                    slug.)
+#
+# When E2E_MODEL_SLUG is set, it overrides this dispatch — useful when an
+# operator dispatches the workflow to test a specific slug.
+if [ -n "${E2E_MODEL_SLUG:-}" ]; then
+  MODEL_SLUG="$E2E_MODEL_SLUG"
+else
+  case "$RUNTIME" in
+    hermes)      MODEL_SLUG="openai/gpt-4o" ;;
+    langgraph)   MODEL_SLUG="openai:gpt-4o" ;;
+    claude-code) MODEL_SLUG="sonnet" ;;
+    *)           MODEL_SLUG="openai/gpt-4o" ;;  # safest fallback (matches hermes)
+  esac
+fi

 log "5/11 Provisioning parent workspace (runtime=$RUNTIME)..."
 PARENT_RESP=$(tenant_call POST /workspaces \