fix(e2e): use hermes-registered model in canvas staging setup (#2225) #2230

Closed
core-be wants to merge 1 commits from fix/2225-e2e-canvas-stale-hermes-model into main
Member

Summary

Change

  • model: gpt-4omoonshot/kimi-k2.6 (listed in providers.yaml:778 under hermes native set).
  • Updates the inline comment that references the old model pair.

Why this model

  • moonshot/kimi-k2.6 is a platform-managed model already used by other hermes fixtures after the P4 reconcile (see eacb8183 workspace_test.go updates).
  • The workspace still boots into the pre-start credential-abort shape (no CP proxy env on staging), which the harness already tolerates as RENDERABLE per #2199 — the tab-UI test does not exercise the agent.

Test plan

  • Re-run E2E Staging Canvas on this PR (paths filter will trigger because canvas/e2e changed).
  • Verify workspace create returns 201, not 422.
  • Verify Canvas tabs spec proceeds to tab-rendering assertions.
## Summary - Closes #2225 - Fixes the `E2E Staging Canvas (Playwright)` main-red failure introduced when provider-registry SSOT (internal#718 P4, commit eacb8183) flipped the `(runtime, model)` gate from WARN to hard-reject 422. - `canvas/e2e/staging-setup.ts` was still requesting `runtime: hermes, model: gpt-4o`, which is no longer in the hermes native set. ## Change - `model`: `gpt-4o` → `moonshot/kimi-k2.6` (listed in `providers.yaml:778` under hermes native set). - Updates the inline comment that references the old model pair. ## Why this model - `moonshot/kimi-k2.6` is a platform-managed model already used by other hermes fixtures after the P4 reconcile (see eacb8183 workspace_test.go updates). - The workspace still boots into the pre-start credential-abort shape (no CP proxy env on staging), which the harness already tolerates as RENDERABLE per #2199 — the tab-UI test does not exercise the agent. ## Test plan - [ ] Re-run `E2E Staging Canvas` on this PR (paths filter will trigger because canvas/e2e changed). - [ ] Verify workspace create returns 201, not 422. - [ ] Verify Canvas tabs spec proceeds to tab-rendering assertions.
core-be added 1 commit 2026-06-04 08:40:33 +00:00
fix(e2e): use hermes-registered model in canvas staging setup (#2225)
sop-tier-check / tier-check (pull_request_target) Successful in 5s
ci-arm64-advisory / fast-checks (pull_request) Waiting to run
CI / Python Lint & Test (pull_request) Successful in 6s
Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 8s
CI / Detect changes (pull_request) Successful in 8s
Lint forbidden tenant-env keys / Scan for repo-host token write into tenant workspace surface (pull_request) Successful in 5s
E2E API Smoke Test / detect-changes (pull_request) Successful in 8s
Lint forbidden tenant-env keys / Scan workspace_secrets writers for forbidden env keys (pull_request) Successful in 6s
E2E Chat / detect-changes (pull_request) Successful in 9s
Harness Replays / detect-changes (pull_request) Successful in 9s
Handlers Postgres Integration / detect-changes (pull_request) Successful in 11s
CI / Platform (Go) (pull_request) Successful in 2s
sop-checklist / review-refire (pull_request_target) Has been skipped
E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 2s
E2E Chat / E2E Chat (pull_request) Successful in 3s
E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 14s
Harness Replays / Harness Replays (pull_request) Successful in 2s
qa-review / approved (pull_request_target) Failing after 7s
Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 3s
CI / Shellcheck (E2E scripts) (pull_request) Successful in 7s
sop-checklist / all-items-acked (pull_request) acked: 0/7 — missing: comprehensive-testing, local-postgres-e2e, staging-smoke, +4 — body-unfilled: comprehensive-testing, local-postgres-e2
sop-checklist / na-declarations (pull_request) N/A: (none)
security-review / approved (pull_request_target) Failing after 7s
sop-checklist / all-items-acked (pull_request_target) Successful in 6s
E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 2s
Lint shellcheck (arm64 pilot) / shellcheck-arm64 (pilot) (pull_request) Successful in 14s
gate-check-v3 / gate-check (pull_request_target) Failing after 16s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 21s
lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m16s
audit-force-merge / audit (pull_request_target) Has been skipped
CI / Canvas (Next.js) (pull_request) Successful in 6m16s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
CI / all-required (pull_request) Successful in 2s
2576b56142
The provider-registry SSOT (internal#718 P4) hard-rejects unregistered
(runtime, model) pairs with 422 UNREGISTERED_MODEL_FOR_RUNTIME.
canvas/e2e/staging-setup.ts was still creating a workspace with
runtime=hermes + model=gpt-4o, which is no longer in the hermes native
set after the P4 flip (eacb8183). This caused every E2E Staging Canvas
run that passed the paths filter to fail at workspace creation.

Update the fixture to model=moonshot/kimi-k2.6, a model listed in the
hermes native provider set (providers.yaml:778). The workspace still
boots into the pre-start credential-abort shape (platform_managed without
CP proxy env on staging) which the harness already tolerates as
RENDERABLE per #2199 — the tab-UI test does not exercise the agent.

Closes #2225.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Author
Member

Superseded by commit 6e682444 (PR #2229) which merged the same model-fix plus skip-if-token-absent onto main. Closing to avoid duplicate open PRs.

Superseded by commit `6e682444` (PR #2229) which merged the same model-fix plus skip-if-token-absent onto main. Closing to avoid duplicate open PRs.
core-be closed this pull request 2026-06-04 08:54:40 +00:00
Some checks are pending
sop-tier-check / tier-check (pull_request_target) Successful in 5s
ci-arm64-advisory / fast-checks (pull_request) Waiting to run
CI / Python Lint & Test (pull_request) Successful in 6s
Block internal-flavored paths / Block forbidden paths (pull_request) Successful in 8s
CI / Detect changes (pull_request) Successful in 8s
Lint forbidden tenant-env keys / Scan for repo-host token write into tenant workspace surface (pull_request) Successful in 5s
E2E API Smoke Test / detect-changes (pull_request) Successful in 8s
Lint forbidden tenant-env keys / Scan workspace_secrets writers for forbidden env keys (pull_request) Successful in 6s
E2E Chat / detect-changes (pull_request) Successful in 9s
Harness Replays / detect-changes (pull_request) Successful in 9s
Handlers Postgres Integration / detect-changes (pull_request) Successful in 11s
CI / Platform (Go) (pull_request) Successful in 2s
sop-checklist / review-refire (pull_request_target) Has been skipped
E2E API Smoke Test / E2E API Smoke Test (pull_request) Successful in 2s
Required
Details
E2E Chat / E2E Chat (pull_request) Successful in 3s
E2E Staging Canvas (Playwright) / detect-changes (pull_request) Successful in 14s
Harness Replays / Harness Replays (pull_request) Successful in 2s
qa-review / approved (pull_request_target) Failing after 7s
Handlers Postgres Integration / Handlers Postgres Integration (pull_request) Successful in 3s
Required
Details
CI / Shellcheck (E2E scripts) (pull_request) Successful in 7s
sop-checklist / all-items-acked (pull_request) acked: 0/7 — missing: comprehensive-testing, local-postgres-e2e, staging-smoke, +4 — body-unfilled: comprehensive-testing, local-postgres-e2
sop-checklist / na-declarations (pull_request) N/A: (none)
security-review / approved (pull_request_target) Failing after 7s
sop-checklist / all-items-acked (pull_request_target) Successful in 6s
E2E Staging Canvas (Playwright) / Canvas tabs E2E (pull_request) Successful in 2s
Lint shellcheck (arm64 pilot) / shellcheck-arm64 (pilot) (pull_request) Successful in 14s
gate-check-v3 / gate-check (pull_request_target) Failing after 16s
Secret scan / Scan diff for credential-shaped strings (pull_request) Successful in 21s
lint-required-no-paths / lint-required-no-paths (pull_request) Successful in 1m16s
audit-force-merge / audit (pull_request_target) Has been skipped
CI / Canvas (Next.js) (pull_request) Successful in 6m16s
CI / Canvas Deploy Reminder (pull_request) Has been skipped
CI / all-required (pull_request) Successful in 2s
Required
Details
E2E Peer Visibility (literal MCP list_peers) / E2E Peer Visibility (pull_request)
Required

Pull request closed

Sign in to join this conversation.
No Reviewers
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: molecule-ai/molecule-core#2230