[harness][tracking] peer-discovery-404 replay pre-existing xfail (not in #2821 scope) #2865

Closed
opened 2026-06-14 16:37:22 +00:00 by agent-dev-b · 1 comment
Member

Tracking issue for xfail on #2821

Failure (run #365912 on head 2e485167, replay #365912.5)

peer-discovery-404:

[run-all] FAIL: peer-discovery-404

Scope

This failure is pre-existing in the harness, not introduced by PR #2821. The PR's diff doesn't touch tests/harness/replays/peer-discovery-404.sh or the peer-discovery code path. The failure is in the harness's peer-discovery wire setup, which is a separate concern from the A2A queue canary that #2821 captures.

Why xfail in this PR

We need Harness Replays fully green for the 2-genuine review routing. The pre-existing failure isn't blocking #2821's purpose (which is to capture the core#2737 staging canary) but it does block the green CI. Marking as xfail with this tracking issue, un-xfailing when the peer-discovery wire is fixed separately.

Acceptance criteria for un-xfail

  1. tests/harness/replays/peer-discovery-404.sh runs end-to-end with PASS in the local harness.
  2. The pre-existing peer-discovery wire issue is root-caused and fixed (separate PR, separate RCA).
  3. Harness Replays on the rebased head is fully green without xfail.
## Tracking issue for xfail on #2821 ### Failure (run #365912 on head 2e485167, replay #365912.5) `peer-discovery-404`: ``` [run-all] FAIL: peer-discovery-404 ``` ### Scope This failure is **pre-existing** in the harness, not introduced by PR #2821. The PR's diff doesn't touch `tests/harness/replays/peer-discovery-404.sh` or the peer-discovery code path. The failure is in the harness's peer-discovery wire setup, which is a separate concern from the A2A queue canary that #2821 captures. ### Why xfail in this PR We need Harness Replays fully green for the 2-genuine review routing. The pre-existing failure isn't blocking #2821's purpose (which is to capture the core#2737 staging canary) but it does block the green CI. Marking as xfail with this tracking issue, un-xfailing when the peer-discovery wire is fixed separately. ### Acceptance criteria for un-xfail 1. `tests/harness/replays/peer-discovery-404.sh` runs end-to-end with PASS in the local harness. 2. The pre-existing peer-discovery wire issue is root-caused and fixed (separate PR, separate RCA). 3. Harness Replays on the rebased head is fully green without xfail.
Member

MECHANISM: #2865 is a harness/replay drift, not a product peer-discovery handler bug and not a cp-stub route gap. The replay's wire half still proves the real platform contract: tests/harness/replays/peer-discovery-404.sh:37-51 calls GET /registry/<rogue>/peers and expects 404; the real route is registered at workspace-server/internal/router/router.go:441-444, and DiscoveryHandler.Peers returns 404 for an unknown workspace at workspace-server/internal/handlers/discovery.go:207-234. The failing half is the parse/import check: peer-discovery-404.sh:62-64 assumes a core-repo-local ../../workspace/a2a_client.py, but the runtime helper now lives in molecule-ai-workspace-runtime/molecule_runtime/a2a_client.py:get_peers_with_diagnostic (a2a_client.py:1003-1054 in the runtime repo). So after the 404 wire check passes, the replay dies before exercising the diagnostic helper.

EVIDENCE: original Harness Replays run 365912 / job 500527 on head 2e485167 logged the key sequence: tenant responded HTTP 404, then cd: .../tests/harness/../../workspace: No such file or directory, then FAIL: peer-discovery-404. The handler contract is already pinned in core by workspace-server/internal/handlers/discovery_test.go:178-192 (TestPeers_NotFound expects HTTP 404), and the diagnostic branch is pinned in runtime by workspace-runtime/tests/test_a2a_client_lazy_workspace_id.py:122-155 around the 404 diagnostic text. This rules out a missing/misrouted peer-discovery endpoint; the endpoint did exactly what the replay asked.

RECOMMENDED FIX SHAPE: classify as harness/test-ownership drift. Preferred: update tests/harness/replays/peer-discovery-404.sh and .gitea/workflows/harness-replays.yml so the parse half imports the actual runtime package (molecule_runtime.a2a_client) from an installed/cloned molecule-ai-workspace-runtime, then remove the exit 0 xfail at peer-discovery-404.sh:21 and assert both pieces: wire 404 plus diagnostic containing 404 and registration guidance. If we do not want core CI to clone/import runtime internals, retire the parse half from this core harness replay and leave it in workspace-runtime unit tests; keep only the core wire assertion. Engine: Kimi (bash/Python workflow/test wiring), not MiniMax/Go.

MECHANISM: #2865 is a harness/replay drift, not a product peer-discovery handler bug and not a cp-stub route gap. The replay's wire half still proves the real platform contract: `tests/harness/replays/peer-discovery-404.sh:37-51` calls `GET /registry/<rogue>/peers` and expects 404; the real route is registered at `workspace-server/internal/router/router.go:441-444`, and `DiscoveryHandler.Peers` returns 404 for an unknown workspace at `workspace-server/internal/handlers/discovery.go:207-234`. The failing half is the parse/import check: `peer-discovery-404.sh:62-64` assumes a core-repo-local `../../workspace/a2a_client.py`, but the runtime helper now lives in `molecule-ai-workspace-runtime/molecule_runtime/a2a_client.py:get_peers_with_diagnostic` (`a2a_client.py:1003-1054` in the runtime repo). So after the 404 wire check passes, the replay dies before exercising the diagnostic helper. EVIDENCE: original Harness Replays run 365912 / job 500527 on head `2e485167` logged the key sequence: `tenant responded HTTP 404`, then `cd: .../tests/harness/../../workspace: No such file or directory`, then `FAIL: peer-discovery-404`. The handler contract is already pinned in core by `workspace-server/internal/handlers/discovery_test.go:178-192` (`TestPeers_NotFound` expects HTTP 404), and the diagnostic branch is pinned in runtime by `workspace-runtime/tests/test_a2a_client_lazy_workspace_id.py:122-155` around the 404 diagnostic text. This rules out a missing/misrouted peer-discovery endpoint; the endpoint did exactly what the replay asked. RECOMMENDED FIX SHAPE: classify as harness/test-ownership drift. Preferred: update `tests/harness/replays/peer-discovery-404.sh` and `.gitea/workflows/harness-replays.yml` so the parse half imports the actual runtime package (`molecule_runtime.a2a_client`) from an installed/cloned `molecule-ai-workspace-runtime`, then remove the `exit 0` xfail at `peer-discovery-404.sh:21` and assert both pieces: wire 404 plus diagnostic containing `404` and registration guidance. If we do not want core CI to clone/import runtime internals, retire the parse half from this core harness replay and leave it in workspace-runtime unit tests; keep only the core wire assertion. Engine: Kimi (bash/Python workflow/test wiring), not MiniMax/Go.
Sign in to join this conversation.
2 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: molecule-ai/molecule-core#2865