hermes-agent

History

Trev 63d06dd93d fix(agent): downgrade xhigh→max on Anthropic pre-4.7 adaptive models Regression from #11161 (Claude Opus 4.7 migration, commit `0517ac3e`). The Opus 4.7 migration changed `ADAPTIVE_EFFORT_MAP["xhigh"]` from "max" (the pre-migration alias) to "xhigh" to preserve the new 4.7 effort level as distinct from max. This is correct for 4.7, but Opus/Sonnet 4.6 only expose 4 levels (low/medium/high/max) — sending "xhigh" there now 400s: BadRequestError [HTTP 400]: This model does not support effort level 'xhigh'. Supported levels: high, low, max, medium. Users who set reasoning_effort=xhigh as their default (xhigh is the recommended default for coding/agentic on 4.7 per the Anthropic migration guide) now 400 every request the moment they switch back to a 4.6 model via `/model` or config. Verified live against the Anthropic API on `anthropic==0.94.0`. Fix: make the mapping model-aware. Add `_supports_xhigh_effort()` predicate (matches 4-7/4.7 substrings, mirroring the existing `_supports_adaptive_thinking` / `_forbids_sampling_params` pattern). On pre-4.7 adaptive models, downgrade xhigh→max (the strongest effort those models accept, restoring pre-migration behavior). On 4.7+, keep xhigh as a distinct level. Per Anthropic's migration guide, xhigh is 4.7-only: https://platform.claude.com/docs/en/about-claude/models/migration-guide > Opus 4.7 effort levels: max, xhigh (new), high, medium, low. > Opus 4.6 effort levels: max, high, medium, low. SDK typing confirms: `anthropic.types.OutputConfigParam.effort: Literal[ "low", "medium", "high", "max"]` (v0.94.0 not yet updated for xhigh). ## Test plan Verified live on macOS 15.5 / anthropic==0.94.0: claude-opus-4-6 + effort=xhigh → output_config.effort=max → 200 OK claude-opus-4-7 + effort=xhigh → output_config.effort=xhigh → 200 OK claude-opus-4-6 + effort=max → output_config.effort=max → 200 OK claude-opus-4-7 + effort=max → output_config.effort=max → 200 OK `tests/agent/test_anthropic_adapter.py` — 120 pass (replaced 1 bugged test that asserted the broken behavior, added 1 for 4.7 preservation). Full adapter suite: 120 passed in 1.05s. Broader suite (agent + run_agent + cli/gateway reasoning): 2140 passed (2 pre-existing failures on clean upstream/main, unrelated). ## Platforms Tested on macOS 15.5. No platform-specific code paths touched.		2026-04-16 12:00:56 -07:00
..
__init__.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
anthropic_adapter.py	fix(agent): downgrade xhigh→max on Anthropic pre-4.7 adaptive models	2026-04-16 12:00:56 -07:00
auxiliary_client.py	fix(agent): complete Claude Opus 4.7 API migration	2026-04-16 10:48:20 -07:00
bedrock_adapter.py	feat: native AWS Bedrock provider via Converse API	2026-04-15 16:17:17 -07:00
context_compressor.py	fix(context_compressor): always keep last user message in tail to prevent active-task loss	2026-04-16 07:45:31 -07:00
context_engine.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
context_references.py	fix(agent): preserve quoted @file references with spaces	2026-04-10 13:05:01 -07:00
copilot_acp_client.py	fix: bridge tool-calls in copilot-acp adapter	2026-04-06 01:47:57 -07:00
credential_pool.py	fix(copilot): preserve base URL and gpt-5-mini routing	2026-04-15 15:04:14 -07:00
display.py	fix: remove context pressure warnings entirely (#11039 )	2026-04-16 06:44:23 -07:00
error_classifier.py	feat: native AWS Bedrock provider via Converse API	2026-04-15 16:17:17 -07:00
insights.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
manual_compression_feedback.py	fix(gateway): make manual compression feedback truthful	2026-04-10 21:16:53 -07:00
memory_manager.py	feat(honcho): context injection overhaul, 5-tool surface, cost safety, session isolation (#10619 )	2026-04-15 19:12:19 -07:00
memory_provider.py	refactor(memory): drop on_session_reset — commit-only is enough	2026-04-15 11:28:45 -07:00
model_metadata.py	fix(agent): complete Claude Opus 4.7 API migration	2026-04-16 10:48:20 -07:00
models_dev.py	feat: add Ollama Cloud as built-in provider	2026-04-16 02:22:09 -07:00
nous_rate_guard.py	fix: Nous Portal rate limit guard — prevent retry amplification (#10568 )	2026-04-15 16:31:48 -07:00
prompt_builder.py	fix(prompt): list all supported Telegram markdown formatting	2026-04-15 17:54:13 -07:00
prompt_caching.py	fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter	2026-03-21 16:54:43 -07:00
rate_limit_tracker.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
redact.py	fix(security): add JWT token and Discord mention redaction (#10547 )	2026-04-15 16:08:52 -07:00
retry_utils.py	feat(agent): add jittered retry backoff	2026-04-08 00:41:36 -07:00
skill_commands.py	fix: use absolute skill_dir for external skills (#10313 ) (#10587 )	2026-04-15 17:22:55 -07:00
skill_utils.py	feat(plugins): namespaced skill registration for plugin skill bundles	2026-04-14 10:42:58 -07:00
smart_model_routing.py	fix: UTF-8 config encoding, pairing hint, credential_pool key, header normalization (#7174 )	2026-04-10 05:33:48 -07:00
subdirectory_hints.py	fix(agent): catch PermissionError in subdirectory hint discovery	2026-04-09 03:10:30 -07:00
title_generator.py	fix: title_generator no longer logs as 'compression' task	2026-04-12 04:17:18 -07:00
trajectory.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
usage_pricing.py	feat: native AWS Bedrock provider via Converse API	2026-04-15 16:17:17 -07:00