fix(e2e): increase liveness probe max_tokens from 4 to 32 (#2204) #2209
Reference in New Issue
Block a user
Delete Branch "fix/2204-liveness-probe-max-tokens"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
What
Increase the staging full-SaaS E2E liveness-probe
max_tokensfrom4to32.Why
Reasoning models (MiniMax M2/M2.7, Moonshot K2.6) can consume the entire 4-token budget on their internal reasoning/thinking preamble, leaving zero tokens for the actual
"ok"reply. This causes the liveness probe to return empty content and the canary to fail, even though the provider and workspace are healthy.How
One-line change in
test_staging_full_saas.shstep 8d.Test plan
Risk
Negligible. The liveness probe is a smoke test; 32 tokens is still tiny and will not materially increase cost or latency.
Rollback
Revert this commit.
Related
molecule-ai-workspace-template-claude-code(thinking-block extraction in executor)Reviewer notes