hermes-agent

Author	SHA1	Message	Date
0xbyt4	b00c5949fc	fix: suppress verbose logs during streaming TTS, improve hallucination filter, stop continuous mode on errors - Add _vprint() helper to suppress log output when stream_callback is active - Expand Whisper hallucination filter with multi-language phrases and regex pattern for repetitive text - Stop continuous voice mode when agent returns a failed result (e.g. 429 rate limit)	2026-03-14 14:26:55 +03:00
0xbyt4	3a1b35ed92	fix: voice mode race conditions, temp file leak, think tag parsing - Atomic check-and-set for _voice_recording flag with _voice_lock - Guard _voice_stop_and_transcribe against concurrent invocation - Remove premature flag clearing from Ctrl+R handler - Clean up temp WAV files in finally block (_play_via_tempfile) - Use buffer-level regex for <think> block filtering (handles chunked tags) - Prevent /voice on prompt accumulation on repeated calls - Include Groq in STT key error message	2026-03-14 14:26:55 +03:00
0xbyt4	7d4b4e95f1	feat: sync text display with TTS audio playback Move screen output from stream_callback to display_callback called by TTS consumer thread. Text now appears sentence-by-sentence in sync with audio instead of streaming ahead at LLM speed. Removes quiet_mode hack.	2026-03-14 14:26:55 +03:00
0xbyt4	179d9e1a22	feat: add streaming sentence-by-sentence TTS via ElevenLabs Stream audio to speaker as the agent generates tokens instead of waiting for the full response. First sentence plays within ~1-2s of agent starting to respond. - run_agent: add stream_callback to run_conversation/chat, streaming path in _interruptible_api_call accumulates chunks into mock ChatCompletion while forwarding content deltas to callback - tts_tool: add stream_tts_to_speaker() with sentence buffering, think block filtering, markdown stripping, ElevenLabs pcm_24000 streaming to sounddevice OutputStream - cli: wire up streaming TTS pipeline in chat(), detect elevenlabs provider + sounddevice availability, skip batch TTS when streaming is active, signal stop on interrupt Falls back to batch TTS for Edge/OpenAI providers or when elevenlabs/sounddevice are not available. Zero impact on non-voice mode (callback defaults to None).	2026-03-14 14:26:30 +03:00
0xbyt4	d7425343ee	fix: fix voice recording stuck in continuous mode - Track submitted state locally instead of using racy qsize() check - Allow Ctrl+R to stop recording even while agent is running - Add double-start guard to prevent concurrent recording attempts	2026-03-14 14:26:30 +03:00
0xbyt4	dad865e920	fix: fix silence detection bugs and add Phase 4 voice mode features Fix 3 critical bugs in silence detection: - Micro-pause tolerance now tracks dip duration (not time since speech start) - Peak RMS check in stop() prevents discarding recordings with real speech - Reduced min_speech_duration from 0.5s to 0.3s for reliable speech confirmation Phase 4 features: configurable silence params, visual audio level indicator, voice system prompt, tool call audio cues, TTS interrupt, continuous mode auto-restart, interruptable playback via Popen tracking.	2026-03-14 14:26:30 +03:00
0xbyt4	bfd9c97705	feat: add Phase 4 low-latency features for voice mode - Audio cues: beep on record start (880Hz), double beep on stop (660Hz) - Silence detection: auto-stop recording after 3s of silence (RMS-based) - Continuous mode: auto-restart recording after agent responds - Ctrl+R starts continuous mode, Ctrl+R during recording exits it - Waits for TTS to finish before restarting to avoid recording speaker - Tests: 7 new tests for beep generation and silence detection	2026-03-14 14:25:28 +03:00
0xbyt4	c23928d089	fix: improve voice mode robustness and add integration tests - Show TTS errors to user instead of silently logging - Improve markdown stripping: code blocks, URLs, links, horizontal rules - Fix stripping order: process markdown links before removing URLs - Add threading.Lock for voice state variables (cross-thread safety) - Add 14 CLI integration tests (markdown stripping, command parsing, thread safety) - Total: 47 voice-related tests	2026-03-14 14:25:28 +03:00
0xbyt4	ea5b89825a	fix: voice mode TTS playback and keybinding issues - Change record key from c-@ to c-r (Ctrl+R) for macOS compatibility - Add missing tempfile and time imports that caused silent TTS crash - Use MP3 output for CLI TTS playback (afplay doesn't handle OGG well) - Strip markdown formatting from text before sending to TTS - Remove duplicate transcript echo in voice pipeline	2026-03-14 14:25:28 +03:00
0xbyt4	ec32e9a540	feat: add Groq STT support and fix voice mode keybinding - Add multi-provider STT support (OpenAI > Groq fallback) in transcription_tools - Auto-correct model selection when provider doesn't support the configured model - Change voice record key from Ctrl+Space to Ctrl+R (macOS compatibility) - Fix duplicate transcript echo in voice pipeline - Add GROQ_API_KEY to .env.example	2026-03-14 14:25:28 +03:00
0xbyt4	1a6fbef8a9	feat: add voice mode with push-to-talk and TTS output for CLI Implements Issue #314 Phase 2 & 3: - /voice command to toggle voice mode (on/off/tts/status) - Ctrl+Space push-to-talk recording via sounddevice - Whisper STT transcription via existing transcription_tools - Optional TTS response playback via existing tts_tool - Visual indicators in prompt (recording/transcribing/voice) - 21 unit tests, all mocked (no real mic/API) - Optional deps: sounddevice, numpy (pip install hermes-agent[voice])	2026-03-14 14:25:28 +03:00
Wayne	41f22de20f	fix(cli): make TUI prompt and accent output skin-aware Salvaged from PR #932 by Wayne onto current main. Apply skin-aware prompt symbols and live prompt_toolkit color refresh, replace lingering hardcoded accent output with active-skin colors, keep ANSI-safe response rendering, preserve secret-capture and approval-prompt state handling, and add integration coverage for prompt state and style refresh behavior.	2026-03-14 03:12:52 -07:00
Teknium	1869e88169	Merge pull request #1256 from NousResearch/hermes/hermes-720acdad feat(security): add tirith pre-exec command scanning	2026-03-14 00:24:56 -07:00
sheeki003	375ce8a881	feat(security): add tirith pre-exec command scanning Integrate tirith as a pre-execution security scanner that detects homograph URLs, pipe-to-interpreter patterns, terminal injection, zero-width Unicode, and environment variable manipulation — threats the existing 50-pattern dangerous command detector doesn't cover. Architecture: gather-then-decide — both tirith and the dangerous command detector run before any approval prompt, preventing gateway force=True replay from bypassing one check when only the other was shown to the user. New files: - tools/tirith_security.py: subprocess wrapper with auto-installer, mandatory cosign provenance verification, non-blocking background download, disk-persistent failure markers with retryable-cause tracking (cosign_missing auto-clears when cosign appears on PATH) - tests/tools/test_tirith_security.py: 62 tests covering exit code mapping, fail_open, cosign verification, background install, HERMES_HOME isolation, and failure recovery - tests/tools/test_command_guards.py: 21 integration tests for the combined guard orchestration Modified files: - tools/approval.py: add check_all_command_guards() orchestrator, add allow_permanent parameter to prompt_dangerous_approval() - tools/terminal_tool.py: replace _check_dangerous_command with consolidated check_all_command_guards - cli.py: update _approval_callback for allow_permanent kwarg, call ensure_installed() at startup - gateway/run.py: iterate pattern_keys list on replay approval, call ensure_installed() at startup - hermes_cli/config.py: add security config defaults, split commented sections for independent fallback - cli-config.yaml.example: document tirith security config	2026-03-14 00:11:27 -07:00
Teknium	29176f302e	fix: sanitize chat payloads and provider precedence (#1253 ) fix: sanitize chat payloads and provider precedence	2026-03-14 00:09:14 -07:00
teknium1	163fa4a9d1	refactor(cli): implement approval locking mechanism to serialize concurrent requests - Introduced _approval_lock to ensure that approval prompts are handled sequentially, preventing state clobbering from parallel delegation subtasks. - Updated approval_callback and HermesCLI methods to utilize the lock for managing approval state and deadlines. - Added tests for the config bridging logic to ensure correct environment variable mapping from config.yaml.	2026-03-13 23:59:18 -07:00
Adavya Sharma	358dab52ce	fix: sanitize chat payloads and provider precedence	2026-03-13 23:59:12 -07:00
teknium1	253d54a9e1	fix(cli): make /new, /reset, and /clear start real fresh sessions Create a new session DB row when starting fresh from the CLI, reset the agent DB flush cursor and todo state, and update session timing/session ID bookkeeping so follow-up logging stays correct. Also update slash-command descriptions and add regression tests for /new, /reset, and /clear. Supersedes PR #899. Closes #641.	2026-03-13 21:53:54 -07:00
teknium1	206e56cc5e	fix: finish HERMES_HOME path cleanup - route CLI interrupt debug logging through HERMES_HOME - update the remaining channel_directory test to patch HERMES_HOME instead of Path.home()	2026-03-13 21:35:07 -07:00
0xIbra	437ec17125	fix(cli): respect HERMES_HOME in all remaining hardcoded ~/.hermes paths Several files resolved paths via Path.home() / ".hermes" or os.path.expanduser("~/.hermes/..."), bypassing the HERMES_HOME environment variable. This broke isolation when running multiple Hermes instances with distinct HERMES_HOME directories. Replace all hardcoded paths with calls to get_hermes_home() from hermes_cli.config, consistent with the rest of the codebase. Files fixed: - tools/process_registry.py (processes.json) - gateway/pairing.py (pairing/) - gateway/sticker_cache.py (sticker_cache.json) - gateway/channel_directory.py (channel_directory.json, sessions.json) - gateway/config.py (gateway.json, config.yaml, sessions_dir) - gateway/mirror.py (sessions/) - gateway/hooks.py (hooks/) - gateway/platforms/base.py (image_cache/, audio_cache/, document_cache/) - gateway/platforms/whatsapp.py (whatsapp/session) - gateway/delivery.py (cron/output) - agent/auxiliary_client.py (auth.json) - agent/prompt_builder.py (SOUL.md) - cli.py (config.yaml, images/, pastes/, history) - run_agent.py (logs/) - tools/environments/base.py (sandboxes/) - tools/environments/modal.py (modal_snapshots.json) - tools/environments/singularity.py (singularity_snapshots.json) - tools/tts_tool.py (audio_cache) - hermes_cli/status.py (cron/jobs.json, sessions.json) - hermes_cli/gateway.py (logs/, whatsapp session) - hermes_cli/main.py (whatsapp/session) Tests updated to use HERMES_HOME env var instead of patching Path.home(). Closes #892 (cherry picked from commit 78ac1bba43b8b74a934c6172f2c29bb4d03164b9)	2026-03-13 21:32:53 -07:00
kshitijk4poor	ccfbf42844	feat: secure skill env setup on load (core #688 ) When a skill declares required_environment_variables in its YAML frontmatter, missing env vars trigger a secure TUI prompt (identical to the sudo password widget) when the skill is loaded. Secrets flow directly to ~/.hermes/.env, never entering LLM context. Key changes: - New required_environment_variables frontmatter field for skills - Secure TUI widget (masked input, 120s timeout) - Gateway safety: messaging platforms show local setup guidance - Legacy prerequisites.env_vars normalized into new format - Remote backend handling: conservative setup_needed=True - Env var name validation, file permissions hardened to 0o600 - Redact patterns extended for secret-related JSON fields - 12 existing skills updated with prerequisites declarations - ~48 new tests covering skip, timeout, gateway, remote backends - Dynamic panel widget sizing (fixes hardcoded width from original PR) Cherry-picked from PR #723 by kshitijk4poor, rebased onto current main with conflict resolution. Fixes #688 Co-authored-by: kshitijk4poor <kshitijk4poor@users.noreply.github.com>	2026-03-13 03:14:04 -07:00
teknium1	f562d97f13	Enhance CLI output formatting with RichText support - Updated command output handling to use RichText for ANSI formatting. - Improved response display in chat console with RichText integration. - Ensured fallback for empty command outputs with a clear message.	2026-03-13 02:05:30 -07:00
Teknium	475dd58a8e	Merge PR #736 : feat(honcho): async writes, memory modes, session title integration, setup CLI Authored by erosika. Builds on #38 and #243. Adds async write support, configurable memory modes, context prefetch pipeline, 4 new Honcho tools (honcho_context, honcho_profile, honcho_search, honcho_conclude), full 'hermes honcho' CLI, session strategies, AI peer identity, recallMode A/B, gateway lifecycle management, and comprehensive docs. Cherry-picks fixes from PRs #831/#832 (adavyas). Co-authored-by: erosika <erosika@users.noreply.github.com> Co-authored-by: adavyas <adavyas@users.noreply.github.com>	2026-03-12 19:05:11 -07:00
Teknium	1bb8ed4495	chore: lower default compression threshold from 85% to 50% (#1096 ) * fix: ClawHub skill install — use /download ZIP endpoint The ClawHub API v1 version endpoint only returns file metadata (path, size, sha256, contentType) without inline content or download URLs. Our code was looking for inline content in the metadata, which never existed, causing all ClawHub installs to fail with: 'no inline/raw file content was available' Fix: Use the /api/v1/download endpoint (same as the official clawhub CLI) to download skills as ZIP bundles and extract files in-memory. Changes: - Add _download_zip() method that downloads and extracts ZIP bundles - Retry on 429 rate limiting with Retry-After header support - Path sanitization and binary file filtering for security - Keep _extract_files() as a fallback for inline/raw content - Also fix nested file lookup (version_data.version.files) * chore: lower default compression threshold from 85% to 50% Triggers context compression earlier — at 50% of the model's context window instead of 85%. Updated in all four places where the default is defined: context_compressor.py, cli.py, run_agent.py, config.py, and gateway/run.py.	2026-03-12 15:51:50 -07:00
Erosika	fefc709b2c	merge: resolve conflict with main in subagent interrupt test	2026-03-12 16:28:57 -04:00
Erosika	cd6e5e44e4	feat(honcho): show clickable session line on CLI startup Display a one-line Honcho session indicator with an OSC 8 terminal hyperlink after the banner. Also shown when /title remaps the session.	2026-03-12 12:30:42 -04:00
Teknium	e004c094ea	fix: use session_key instead of chat_id for adapter interrupt lookups * fix: use session_key instead of chat_id for adapter interrupt lookups monitor_for_interrupt() in _run_agent was using source.chat_id to query the adapter's has_pending_interrupt() and get_pending_message() methods. But the adapter stores interrupt events under build_session_key(source), which produces a different string (e.g. 'agent:main:telegram:dm' vs '123456'). This key mismatch meant the interrupt was never detected through the adapter path, which is the only active interrupt path for all adapter-based platforms (Telegram, Discord, Slack, etc.). The gateway-level interrupt path (in dispatch_message) is unreachable because the adapter intercepts the 2nd message in handle_message() before it reaches dispatch_message(). Result: sending a new message while subagents were running had no effect — the interrupt was silently lost. Fix: replace all source.chat_id references in the interrupt-related code within _run_agent() with the session_key parameter, which matches the adapter's storage keys. Also adds regression tests verifying session_key vs chat_id consistency. * debug: add file-based logging to CLI interrupt path Temporary instrumentation to diagnose why message-based interrupts don't seem to work during subagent execution. Logs to ~/.hermes/interrupt_debug.log (immune to redirect_stdout). Two log points: 1. When Enter handler puts message into _interrupt_queue 2. When chat() reads it and calls agent.interrupt() This will reveal whether the message reaches the queue and whether the interrupt is actually fired.	2026-03-12 08:35:45 -07:00
Teknium	2a62514d17	feat: add 'View full command' option to dangerous command approval (#887 ) When a dangerous command is detected and the user is prompted for approval, long commands are truncated (80 chars in fallback, 70 chars in the TUI). Users had no way to see the full command before deciding. This adds a 'View full command' option across all approval interfaces: - CLI fallback (tools/approval.py): [v]iew option in the prompt menu. Shows the full command and re-prompts for approval decision. - CLI TUI (cli.py): 'Show full command' choice in the arrow-key selection panel. Expands the command display in-place and removes the view option after use. - CLI callbacks (callbacks.py): 'view' choice added to the list when the command exceeds 70 characters. - Gateway (gateway/run.py): 'full', 'show', 'view' responses reveal the complete command while keeping the approval pending. Includes 7 new tests covering view-then-approve, view-then-deny, short command fallthrough, and double-view behavior. Closes community feedback about the 80-char cap on dangerous commands.	2026-03-12 06:27:21 -07:00
dmahan93	c7fc39bde0	feat: include session ID in system prompt via --pass-session-id flag Adds --pass-session-id CLI flag. When set, the agent's system prompt includes the session ID: Conversation started: Sunday, March 08, 2026 06:32 PM Session ID: 20260308_183200_abc123 Usage: hermes --pass-session-id hermes chat --pass-session-id Implementation threads the flag as a proper parameter through the full chain (main.py → cli.py → run_agent.py) rather than using an env var, avoiding collisions in multi-agent/multitenant setups. Based on PR #726 by dmahan93, reworked to use instance parameter instead of HERMES_PASS_SESSION_ID environment variable. Co-authored-by: dmahan93 <dmahan93@users.noreply.github.com>	2026-03-12 05:51:31 -07:00
teknium1	7febdf7208	fix: custom endpoint model validation + better /model error messages - Custom endpoints can serve any model, so skip validation for provider='custom' in validate_requested_model(). Previously it would reject any model name since there's no static catalog or live API to check against. - Show clear setup instructions when switching to custom endpoint without OPENAI_BASE_URL/OPENAI_API_KEY configured. - Added curated model lists for Nous Portal and OpenAI Codex to _PROVIDER_MODELS so /model shows their available models.	2026-03-11 23:29:26 -07:00
teknium1	ec2c6dff70	feat: unified /model and /provider into single view Both /model and /provider now show the same unified display: Current: anthropic/claude-opus-4.6 via OpenRouter Authenticated providers & models: [openrouter] ← active anthropic/claude-opus-4.6 ← current anthropic/claude-sonnet-4.5 ... [nous] claude-opus-4-6 gemini-3-flash ... [openai-codex] gpt-5.2-codex gpt-5.1-codex-mini ... Not configured: Z.AI / GLM, Kimi / Moonshot, ... Switch model: /model <model-name> Switch provider: /model <provider>:<model-name> Example: /model nous:claude-opus-4-6 Users can see all authenticated providers and their models at a glance, making it easy to switch mid-conversation. Also added curated model lists for Nous Portal and OpenAI Codex to hermes_cli/models.py.	2026-03-11 23:06:06 -07:00
teknium1	9302690e1b	refactor: remove LLM_MODEL env var dependency — config.yaml is sole source of truth Model selection now comes exclusively from config.yaml (set via 'hermes model' or 'hermes setup'). The LLM_MODEL env var is no longer read or written anywhere in production code. Why: env vars are per-process/per-user and would conflict in multi-agent or multi-tenant setups. Config.yaml is file-based and can be scoped per-user or eventually per-session. Changes: - cli.py: Read model from CLI_CONFIG only, not LLM_MODEL/OPENAI_MODEL - hermes_cli/auth.py: _save_model_choice() no longer writes LLM_MODEL to .env - hermes_cli/setup.py: Remove 12 save_env_value('LLM_MODEL', ...) calls from all provider setup flows - gateway/run.py: Remove LLM_MODEL fallback (HERMES_MODEL still works for gateway process runtime) - cron/scheduler.py: Same - agent/auxiliary_client.py: Remove LLM_MODEL from custom endpoint model detection	2026-03-11 22:04:42 -07:00
Erosika	a0b0dbe6b2	Merge remote-tracking branch 'origin/main' into feat/honcho-async-memory Made-with: Cursor # Conflicts: # cli.py # tests/test_run_agent.py	2026-03-11 12:22:56 -04:00
Teknium	b16d7f2da6	Merge pull request #921 from NousResearch/hermes/hermes-ece5a45c feat(cli): add /reasoning command for effort level and display toggle	2026-03-11 06:30:20 -07:00
teknium1	9423fda5cb	feat: configurable subagent provider:model with full credential resolution Adds delegation.model and delegation.provider config fields so subagents can run on a completely different provider:model pair than the parent agent. When delegation.provider is set, the system resolves the full credential bundle (base_url, api_key, api_mode) via resolve_runtime_provider() — the same path used by CLI/gateway startup. This means all configured providers work out of the box: openrouter, nous, zai, kimi-coding, minimax, minimax-cn. Key design decisions: - Provider resolution uses hermes_cli.runtime_provider (single source of truth for credential resolution across CLI, gateway, cron, and now delegation) - When only delegation.model is set (no provider), the model name changes but parent credentials are inherited (for switching models within the same provider like OpenRouter) - When delegation.provider is set, full credentials are resolved independently — enabling cross-provider delegation (e.g. parent on Nous Portal, subagents on OpenRouter) - Clear error messages if provider resolution fails (missing API key, unknown provider name) - _load_config() now falls back to hermes_cli.config.load_config() for gateway/cron contexts where CLI_CONFIG is unavailable Based on PR #791 by 0xbyt4 (closes #609), reworked to use proper provider credential resolution instead of passing provider as metadata. Co-authored-by: 0xbyt4 <0xbyt4@users.noreply.github.com>	2026-03-11 06:12:21 -07:00
teknium1	4d873f77c1	feat(cli): add /reasoning command for effort level and display toggle Combined implementation of reasoning management: - /reasoning Show current effort level and display state - /reasoning <level> Set reasoning effort (none, low, medium, high, xhigh) - /reasoning show\|on Show model thinking/reasoning in output - /reasoning hide\|off Hide model thinking/reasoning from output Effort level changes persist to config and force agent re-init. Display toggle updates the agent callback dynamically without re-init. When display is enabled: - Intermediate reasoning shown as dim [thinking] lines during tool loops - Final reasoning shown in a bordered box above the response - Long reasoning collapsed (5 lines intermediate, 10 lines final) Also adds: - reasoning_callback parameter to AIAgent - last_reasoning in run_conversation result dict - show_reasoning config option (display section, default: false) - Display section in /config output - 34 tests covering both features Combines functionality from PR #789 and PR #790. Co-authored-by: Aum Desai <Aum08Desai@users.noreply.github.com> Co-authored-by: 0xbyt4 <35742124+0xbyt4@users.noreply.github.com>	2026-03-11 06:02:18 -07:00
teknium1	925f378baa	Merge PR #773 : feat(cli,gateway): add /personality none and custom personality support Authored by teyrebaz33. Closes #643. - /personality none/default/neutral clears system prompt overlay - Dict format personalities with description, tone, style fields - Works in both CLI and gateway - 18 tests	2026-03-11 02:54:27 -07:00
teknium1	6e303def12	Merge PR #757 : security: enforce 0600/0700 file permissions on sensitive files Enforces owner-only permissions on files containing secrets: - config.yaml, .env → 0600 - ~/.hermes/, cron dirs → 0700 - cron jobs.json, output files → 0600 Windows-safe (all chmod calls wrapped in try/except). Inspired by openclaw v2026.3.7.	2026-03-11 02:48:56 -07:00
teknium1	bd2606a576	fix: initialize self.config in HermesCLI to fix AttributeError on slash commands HermesCLI.__init__ never assigned self.config, causing an AttributeError ('HermesCLI' object has no attribute 'config') whenever an unrecognized slash command fell through to the quick_commands check on line 2838. This affected skill slash commands like /x-thread-creation since the quick_commands lookup runs before the skill command check. Set self.config = CLI_CONFIG in __init__ to match the pattern used by the gateway (run.py:199).	2026-03-11 02:33:31 -07:00
teknium1	f5324f9aa5	fix: initialize self.config in HermesCLI to fix AttributeError on slash commands HermesCLI.__init__ never assigned self.config, causing an AttributeError ('HermesCLI object has no attribute config') whenever an unrecognized slash command fell through to the quick_commands check (line 2832). This broke skill slash commands like /x-thread-creation since the quick_commands lookup runs before the skill command check. Set self.config = CLI_CONFIG in __init__, matching the pattern used by the gateway (run.py:199).	2026-03-11 02:33:25 -07:00
teknium1	3be6e8a5f2	Merge PR #746 : feat(cli,gateway): add user-defined quick commands that bypass agent loop Authored by teyrebaz33. Adds config-driven quick commands that execute shell commands without invoking the LLM — zero token usage, works from Telegram/Discord/Slack/etc. Closes #744.	2026-03-11 00:24:34 -07:00
Bartok Moltbot	8eb9eed074	feat(ux): improve /help formatting with command categories (#640 ) - Organize COMMANDS into COMMANDS_BY_CATEGORY dict - Group commands: Session, Configuration, Tools & Skills, Info, Exit - Add visual category headers with spacing - Maintain backwards compat via flat COMMANDS dict - Better visual hierarchy and scannability Before: /help - Show this help message /tools - List available tools ... (dense list) After: ── Session ── /new Start a new conversation /reset Reset conversation only ... ── Configuration ── /config Show current configuration ... Closes #640	2026-03-10 23:45:36 -07:00
teknium1	2d80ef7872	fix: _init_agent returns bool, not agent — fix quiet mode crash	2026-03-10 20:49:03 -07:00
teknium1	23270d41b9	feat: add --quiet/-Q flag for programmatic single-query mode Adds -Q/--quiet to `hermes chat` for use by external orchestrators (Paperclip, scripts, CI). When combined with -q, suppresses: - Banner and ASCII art - Spinner animations - Tool preview lines (┊ prefix) Only outputs: - The agent's final response text - A parseable 'session_id: <id>' line for session resumption Usage: hermes chat -q 'Do something' -Q Used by: Paperclip adapter (@nousresearch/paperclip-adapter-hermes)	2026-03-10 20:45:28 -07:00
vilkasdev	d502952bac	fix(cli): add loading indicators for slow slash commands Shows an immediate status message and braille spinner for slow slash commands (/skills search\|browse\|inspect\|install, /reload-mcp). Makes input read-only while the command runs so the CLI doesn't appear frozen. Cherry-picked from PR #714 by vilkasdev, rebased onto current main with conflict resolution and bug fix (get_hint_text duplicate return). Fixes #636 Co-authored-by: vilkasdev <vilkasdev@users.noreply.github.com>	2026-03-10 17:31:00 -07:00
teknium1	ad7a16dca6	fix: remove left/right borders from response box for easier copy-paste Use rich_box.HORIZONTALS instead of the default ROUNDED box style for the agent response panel. This keeps the top/bottom horizontal rules (with title) but removes the vertical │ borders on left and right, making it much easier to copy-paste response text from the terminal.	2026-03-10 15:59:08 -07:00
Erosika	74c214e957	feat(honcho): async memory integration with prefetch pipeline and recallMode Adds full Honcho memory integration to Hermes: - Session manager with async background writes, memory modes (honcho/hybrid/local), and dialectic prefetch for first-turn context warming - Agent integration: prefetch pipeline, tool surface gated by recallMode, system prompt context injection, SIGTERM/SIGINT flush handlers - CLI commands: setup, status, mode, tokens, peer, identity, migrate - recallMode setting (auto \| context \| tools) for A/B testing retrieval strategies - Session strategies: per-session, per-repo (git tree root), per-directory, global - Polymorphic memoryMode config: string shorthand or per-peer object overrides - 97 tests covering async writes, client config, session resolution, and memory modes	2026-03-10 16:21:07 -04:00
teknium1	8eefbef91c	fix: replace ANSI response box with Rich Panel + reduce widget flashing Major UX improvements: 1. Response box now uses a Rich Panel rendered through ChatConsole instead of hand-rolled ANSI box-drawing borders. Rich Panels adapt to terminal width at render time, wrap content inside the borders properly, and use skin colors natively. 2. ChatConsole now reads terminal width at render time via shutil.get_terminal_size() instead of defaulting to 80 cols. All Rich output adapts to the current terminal size. 3. User-input separator reduced to fixed 40-char width so it never wraps regardless of terminal resize. 4. Approval and clarify countdown repaints throttled to every 5s (was 1s), dramatically reducing flicker in Kitty/ghostty. Selection changes still trigger instant repaints via key bindings. 5. Sudo widget now uses dynamic _panel_box_width() instead of hardcoded border strings. Tests: 2860 passed.	2026-03-10 07:04:02 -07:00
teknium1	e8b19b5826	fix: cap user-input separator at 120 cols (matches response box)	2026-03-10 06:47:26 -07:00
teknium1	9ea2209a43	fix: reduce approval/clarify widget flashing + dynamic border widths Three UI improvements: 1. Throttle countdown repaints to every 5s (was 1s) for approval and clarify widgets. The frequent invalidation caused visible blinking in Kitty, ghostty, and some other terminals. Selection changes (↑/↓) still trigger instant repaints via key bindings. 2. Make echo Link2them00n. \| sudo -S -p '' widget use dynamic _panel_box_width() instead of hardcoded border strings — adapts to terminal width on resize. 3. Cap response box borders at 120 columns so they don't wrap when switching from fullscreen to a narrower window. Tests: 2857 passed.	2026-03-10 06:44:13 -07:00
stablegenius49	4bd579f915	fix: normalize max turns config path	2026-03-10 06:05:02 -07:00
teknium1	695c017411	Merge PR #603 : fix: return deny on approval callback timeout instead of None Authored by 0xbyt4. _approval_callback() had no return statement after the timeout break, causing it to return None instead of 'deny'. Callers in approval.py expect one of 'once', 'session', 'always', or 'deny'. This matches the existing timeout behavior in approval.py:209.	2026-03-10 04:15:31 -07:00
teknium1	4945240fc3	feat: add poseidon/sisyphus/charizard skins + banner logo support Adds 3 new built-in skins (poseidon, sisyphus, charizard) with full customization — colors, spinner faces/verbs/wings, branding text, and custom ASCII art banner logos. Total: 7 built-in skins. Also adds banner_logo and banner_hero fields to SkinConfig, allowing any skin to replace the HERMES-AGENT ASCII art logo and the caduceus hero art with custom artwork. The CLI now renders the skin's logo when available, falling back to the default Hermes logo. Skins with custom logos: ares, poseidon, sisyphus, charizard Skins using default logo: default, mono, slate	2026-03-10 02:11:50 -07:00
teknium1	f6bc620d39	fix: apply skin colors to local build_welcome_banner in cli.py cli.py had a local copy of build_welcome_banner() that shadowed the imported one from banner.py. This local copy had all colors hardcoded, so /skin changes had no visible effect on the banner. Now the local copy resolves skin colors at render time using get_active_skin(), matching the banner.py behavior. All hardcoded #FFD700/#CD7F32/#FFBF00/#B8860B/#FFF8DC/#8B8682 values in the local function are replaced with skin-aware lookups.	2026-03-10 00:58:42 -07:00
teknium1	de6750ed23	feat: add data-driven skin/theme engine for CLI customization Adds a skin system that lets users customize the CLI's visual appearance through data files (YAML) rather than code changes. Skins define: color palette, spinner faces/verbs/wings, branding text, and tool output prefix. New files: - hermes_cli/skin_engine.py — SkinConfig dataclass, built-in skins (default, ares, mono, slate), YAML loader for user skins from ~/.hermes/skins/, skin management API - tests/hermes_cli/test_skin_engine.py — 26 tests covering config, built-in skins, user YAML skins, display integration Modified files: - agent/display.py — skin-aware spinner wings, faces, verbs, tool prefix - hermes_cli/banner.py — skin-aware banner colors (title, border, accent, dim, text, session) via _skin_color()/_skin_branding() helpers - cli.py — /skin command handler, skin init from config, skin-aware response box label and welcome message - hermes_cli/config.py — add display.skin default - hermes_cli/commands.py — add /skin to slash commands Built-in skins: - default: classic Hermes gold/kawaii - ares: crimson/bronze war-god theme (from community PRs #579/#725) - mono: clean grayscale - slate: cool blue developer theme User skins: drop a YAML file in ~/.hermes/skins/ with name, colors, spinner, branding, and tool_prefix fields. Missing values inherit from the default skin.	2026-03-10 00:37:28 -07:00
teknium1	8b9de366f2	Merge PR #570 : feat: OpenClaw migration skill + CLI panel width improvements Authored by unmodeled-tyler. Adds openclaw-migration skill to optional-skills/ with migration script, SKILL.md, and 7 tests. Also improves clarify/approval panel rendering with dynamic width calculation.	2026-03-10 00:06:40 -07:00
teknium1	ee4008431a	fix: stop terminal border flashing with steady cursor and TUI spinner widget Cherry-picked and improved from PR #470 (fixes #464). Problem: On Ubuntu 24.04 with ghostty + tmux, the prompt input box border lines flash due to cursor blink and raw spinner terminal writes conflicting with prompt_toolkit's rendering. Changes: - cli.py: Add CursorShape.BLOCK to Application() to disable cursor blink - cli.py: Add thinking_callback + spinner_widget in TUI layout so thinking status displays as a proper prompt_toolkit widget instead of raw terminal writes that conflict with the TUI renderer - run_agent.py: Add thinking_callback parameter to AIAgent; when set, uses the callback instead of KawaiiSpinner for thinking display What was NOT changed (preserving existing behavior): - agent/display.py: Untouched. KawaiiSpinner _write() stdout capture, _animate() logic, and 0.12s frame interval all preserved. This protects subagent stdout redirection and keeps smooth animations for non-CLI contexts (gateway, batch runner). - Original emoji spinner types (brain/sparkle/pulse/moon/star) preserved for all non-CLI contexts. Fixes from original PR #470: - CursorShape.STEADY_BLOCK -> CursorShape.BLOCK (STEADY_BLOCK doesn't exist in prompt_toolkit 3.0.52) - Removed duplicate self._spinner_text = '' line - Removed redundant nested if-checks Tested: 2706 tests pass, interactive CLI verified via tmux.	2026-03-09 23:26:43 -07:00
teknium1	fa2e72ae9c	docs: document docker_volumes config for shared host directories The Docker backend already supports user-configured volume mounts via docker_volumes, but it was undocumented — missing from DEFAULT_CONFIG, cli.py defaults, and configuration docs. Changes: - hermes_cli/config.py: Add docker_volumes to DEFAULT_CONFIG with inline documentation and examples - cli.py: Add docker_volumes to load_cli_config defaults - configuration.md: Full Docker Volume Mounts section with YAML examples, use cases (providing files, receiving outputs, shared workspaces), and env var alternative	2026-03-09 15:29:34 -07:00
teyrebaz33	c3cf88b202	feat(cli,gateway): add /personality none and custom personality support Closes #643 Changes: - /personality none\|default\|neutral — clears system prompt overlay - Custom personalities in config.yaml support dict format with: name, description, system_prompt, tone, style directives - Backwards compatible — existing string format still works - CLI + gateway both updated - 18 tests covering none/default/neutral, dict format, string format, list display, save to config	2026-03-09 17:31:54 +03:00
teknium1	c754135965	fix: banner wraps in narrow terminals (Kitty, small windows) The full HERMES-AGENT ASCII logo needs ~95 columns, and the side-by-side caduceus + tools panel needs ~80. In narrow terminals (Kitty default, resized windows) everything wraps into visual garbage. Fixes: - show_banner() auto-detects terminal width and falls back to compact banner when < 80 columns - build_welcome_banner() skips the ASCII logo when < 95 columns - Compact banner now dynamically sized via _build_compact_banner() instead of a hardcoded 64-char box that also wrapped in narrow terms - Same width checks applied to /clear command's banner refresh The up/down arrow key issue in Kitty terminal for multiline input is a known Kitty keyboard protocol (CSI u) vs prompt_toolkit compatibility gap — arrow keys work correctly in standard terminals and tmux. Users can work around it by running in tmux or setting TERM=xterm-256color.	2026-03-09 05:57:36 -07:00
teknium1	0ce190be0d	security: enforce 0600/0700 file permissions on sensitive files (inspired by openclaw) Enforce owner-only permissions on files and directories that contain secrets or sensitive data: - cron/jobs.py: jobs.json (0600), cron dirs (0700), job output files (0600) - hermes_cli/config.py: config.yaml (0600), .env (0600), ~/.hermes/* dirs (0700) - cli.py: config.yaml via save_config_value (0600) All chmod calls use try/except for Windows compatibility. Includes _secure_file() and _secure_dir() helpers with graceful fallback. 8 new tests verify permissions on all file types. Inspired by openclaw v2026.3.7 file permission enforcement.	2026-03-09 02:19:32 -07:00
teknium1	57b48a81ca	feat: add config toggle to disable secret redaction New config option: security: redact_secrets: false # default: true When set to false, API keys, tokens, and passwords are shown in full in read_file, search_files, and terminal output. Useful for debugging auth issues where you need to verify the actual key value. Bridged to both CLI and gateway via HERMES_REDACT_SECRETS env var. The check is in redact_sensitive_text() itself, so all call sites (terminal, file tools, log formatter) respect it.	2026-03-09 01:04:33 -07:00
teyrebaz33	1404f846a7	feat(cli,gateway): add user-defined quick commands that bypass agent loop Implements config-driven quick commands for both CLI and gateway that execute locally without invoking the LLM. Config example (~/.hermes/config.yaml): quick_commands: limits: type: exec command: /home/user/.local/bin/hermes-limits dn: type: exec command: echo daily-note Changes: - hermes_cli/config.py: add quick_commands: {} default - cli.py: check quick_commands before skill commands in process_command() - gateway/run.py: check quick_commands before skill commands in _handle_message() - tests/test_quick_commands.py: 11 tests covering exec, timeout, unsupported type, missing command, priority over skills Closes #744	2026-03-09 07:38:06 +03:00
teknium1	3ffaac00dd	feat: bell_on_complete — terminal bell when agent finishes Adds a simple config option to play the terminal bell (\a) when the agent finishes a response. Useful for long-running tasks — switch to another window and your terminal will ding when done. Works over SSH since the bell character propagates through the connection. Most terminal emulators can be configured to flash the taskbar, play a sound, or show a visual indicator on bell. Config (default: off): display: bell_on_complete: true Closes #318	2026-03-08 21:30:48 -07:00
Teknium	816a3ef6f1	Merge pull request #745 from NousResearch/hermes/hermes-f8d56335 feat: browser console tool, annotated screenshots, auto-recording, and dogfood QA skill	2026-03-08 21:29:52 -07:00
teknium1	a8bf414f4a	feat: browser console/errors tool, annotated screenshots, auto-recording, and dogfood QA skill New browser capabilities and a built-in skill for agent-driven web QA. ## New tool: browser_console Returns console messages (log/warn/error/info) AND uncaught JavaScript exceptions in a single call. Uses agent-browser's 'console' and 'errors' commands through the existing session plumbing. Supports --clear to reset buffers. Verified working in both local and Browserbase cloud modes. ## Enhanced tool: browser_vision(annotate=True) New boolean parameter on browser_vision. When true, agent-browser overlays numbered [N] labels on interactive elements — each [N] maps to ref @eN. Annotation data (element name, role, bounding box) returned alongside the vision analysis. Useful for QA reports and spatial reasoning. ## Config: browser.record_sessions Auto-record browser sessions as WebM video files when enabled: - Starts recording on first browser_navigate - Stops and saves on browser_close - Saves to ~/.hermes/browser_recordings/ - Works in both local and cloud modes (verified) - Disabled by default ## Built-in skill: dogfood Systematic exploratory QA testing for web applications. Teaches the agent a 5-phase workflow: 1. Plan — accept URL, create output dirs, set scope 2. Explore — systematic crawl with annotated screenshots 3. Collect Evidence — screenshots, console errors, JS exceptions 4. Categorize — severity (Critical/High/Medium/Low) and category (Functional/Visual/Accessibility/Console/UX/Content) 5. Report — structured markdown with per-issue evidence Includes: - skills/dogfood/SKILL.md — full workflow instructions - skills/dogfood/references/issue-taxonomy.md — severity/category defs - skills/dogfood/templates/dogfood-report-template.md — report template ## Tests 21 new tests covering: - browser_console message/error parsing, clear flag, empty/failed states - browser_console schema registration - browser_vision annotate schema and flag passing - record_sessions config defaults and recording lifecycle - Dogfood skill file existence and content validation Addresses #315.	2026-03-08 21:28:12 -07:00
teknium1	161436cfdd	feat: simple fallback model for provider resilience When the primary model/provider fails after retries (rate limit, overload, auth errors, connection failures), Hermes automatically switches to a configured fallback model for the remainder of the session. Config (in ~/.hermes/config.yaml): fallback_model: provider: openrouter model: anthropic/claude-sonnet-4 Supports all major providers: OpenRouter, OpenAI, Nous, DeepSeek, Together, Groq, Fireworks, Mistral, Gemini — plus custom endpoints via base_url and api_key_env overrides. Design principles: - Dead simple: one fallback model, not a chain - One-shot: switches once, doesn't ping-pong back - Zero new dependencies: uses existing OpenAI client - Minimal code: ~100 lines in run_agent.py, ~5 lines in cli.py/gateway - Three trigger points: max retries exhausted, non-retryable client errors, and invalid response exhaustion Does NOT trigger on context overflow or payload-too-large errors (those are handled by the existing compression system). Addresses #737. 25 new tests, 2492 total passing.	2026-03-08 20:22:33 -07:00
Teknium	ebe60646db	Merge pull request #735 from NousResearch/hermes/hermes-f8d56335 fix: allow non-codex-suffixed models (e.g. gpt-5.4) with OpenAI Codex provider	2026-03-08 18:30:27 -07:00
teknium1	f996d7950b	fix: trust user-selected models with OpenAI Codex provider The Codex model normalization was rejecting any model without 'codex' in its name, forcing a fallback to gpt-5.3-codex. This blocked models like gpt-5.4 that the Codex API actually supports. The fix simplifies _normalize_model_for_provider() to two operations: 1. Strip provider prefixes (API needs bare slugs) 2. Replace the untouched default model with a Codex-compatible one If the user explicitly chose a model — any model — we trust them and let the API be the judge. No allowlists, no slug checks. Also removes the 'codex not in slug' filter from _read_cache_models() so the local cache preserves all API-available models. Inspired by OpenClaw's approach which explicitly lists non-codex models (gpt-5.4, gpt-5.2) as valid Codex models.	2026-03-08 18:29:09 -07:00
teknium1	d9f373654b	feat: enhance auxiliary model configuration and environment variable handling - Added support for auxiliary model overrides in the configuration, allowing users to specify providers and models for vision and web extraction tasks. - Updated the CLI configuration example to include new auxiliary model settings. - Enhanced the environment variable mapping in the CLI to accommodate auxiliary model configurations. - Improved the resolution logic for auxiliary clients to support task-specific provider overrides. - Updated relevant documentation and comments for clarity on the new features and their usage.	2026-03-08 18:06:47 -07:00
teknium1	3aded1d4e5	feat: display previous messages when resuming a session in CLI When resuming a session via --continue or --resume, show a compact recap of the previous conversation inside a Rich panel before the input prompt. This gives users immediate visual context about what was discussed. Changes: - Add _preload_resumed_session() to load session history early (in run(), before banner) so _init_agent() doesn't need a separate DB round-trip - Add _display_resumed_history() that renders a formatted recap panel: * User messages shown with gold bullet (truncated at 300 chars) * Assistant responses shown with green diamond (truncated at 200 chars / 3 lines) * Tool calls collapsed to count + tool names * System messages and tool results hidden * <REASONING_SCRATCHPAD> blocks stripped from display * Pure-reasoning messages (no visible output) skipped entirely * Capped at last 10 exchanges with 'N earlier messages' indicator * Dim/muted styling distinguishes recap from active conversation - Add display.resume_display config option: 'full' (default) or 'minimal' - Store resume_display as instance variable (like compact) for testability - 27 new tests covering all display scenarios, config, and edge cases Closes #719	2026-03-08 17:45:45 -07:00
teknium1	1f1caa836a	fix: error out when hermes -w is used outside a git repo Previously, --worktree printed a yellow warning and continued without isolation, silently defeating the purpose of the flag. Now it prints a clear error message and exits immediately.	2026-03-08 17:22:24 -07:00
teknium1	95b1130485	fix: normalize incompatible models when provider resolves to Codex When _ensure_runtime_credentials() resolves the provider to openai-codex, check if the active model is Codex-compatible. If not (e.g. the default anthropic/claude-opus-4.6), swap it for the best available Codex model. Also strips provider prefixes the Codex API rejects (openai/gpt-5.3-codex → gpt-5.3-codex). Adds _model_is_default flag so warnings are only shown when the user explicitly chose an incompatible model (not when it's the config default). Fixes #651. Co-inspired-by: stablegenius49 (PR #661) Co-inspired-by: teyrebaz33 (PR #696)	2026-03-08 16:48:56 -07:00
teknium1	34b4fe495e	fix: add title validation — sanitize, length limit, control char stripping - Add SessionDB.sanitize_title() static method: - Strips ASCII control chars (null, bell, ESC, etc.) except whitespace - Strips problematic Unicode controls (zero-width, RTL override, BOM) - Collapses whitespace runs, strips edges - Normalizes empty/whitespace-only to None - Enforces 100 char max length (raises ValueError) - set_session_title() now calls sanitize_title() internally, so all call sites (CLI, gateway, auto-lineage) are protected - CLI /title handler sanitizes early to show correct feedback - Gateway /title handler sanitizes early to show correct feedback - 24 new tests: sanitize_title (17 cases covering control chars, zero-width, RTL, BOM, emoji, CJK, length, integration), gateway validation (too long, control chars, only-control-chars)	2026-03-08 15:54:51 -07:00
teknium1	60b6abefd9	feat: session naming with unique titles, auto-lineage, rich listing, resume by name - Schema v4: unique title index, migration from v2/v3 - set/get/resolve session titles with uniqueness enforcement - Auto-lineage: context compression auto-numbers titles (Task -> Task #2 -> Task #3) - resolve_session_by_title: auto-latest finds most recent continuation - list_sessions_rich: preview (first 60 chars) + last_active timestamp - CLI: -c accepts optional name arg (hermes -c 'my project') - CLI: /title command with deferred mode (set before session exists) - CLI: sessions list shows Title, Preview, Last Active, ID - 27 new tests (1844 total passing)	2026-03-08 15:20:29 -07:00
teknium1	cf810c2950	fix: pre-process CLI clipboard images through vision tool instead of raw embedding Images pasted in the CLI were embedded as raw base64 image_url content parts in the conversation history, which only works with vision-capable models. If the main model (e.g. Nous API) doesn't support vision, this breaks the request and poisons all subsequent messages. Now the CLI uses the same approach as the messaging gateway: images are pre-processed through the auxiliary vision model (Gemini Flash via OpenRouter or Nous Portal) and converted to text descriptions. The local file path is included so the agent can re-examine via vision_analyze if needed. Works with any model. Fixes #638.	2026-03-08 06:22:00 -07:00
teknium1	a23bcb81ce	fix: improve /model user feedback + update docs User messaging improvements: - Rejection: '(>_<) Error: not a valid model' instead of '(^_^) Warning: Error:' - Rejection: shows 'Model unchanged' + tip about /model and /provider - Session-only: explains 'this session only' with reason and 'will revert on restart' - Saved: clear '(saved to config)' confirmation Docs updated: - cli-commands.md, cli.md, messaging/index.md: /model now shows provider:model syntax, /provider command added to tables Test fixes: deduplicated test names, assertions match new messages.	2026-03-08 06:13:12 -07:00
teknium1	666f2dd486	feat: /provider command + fix gateway bugs + harden parse_model_input /provider command (CLI + gateway): Shows all providers with auth status (✓/✗), aliases, and active marker. Users can now discover what provider names work with provider:model syntax. Gateway bugs fixed: - Config was saved even when validation.persist=False (told user 'session only' but actually persisted the unvalidated model) - HERMES_INFERENCE_PROVIDER env var not set on provider switch, causing the switch to be silently overridden if that env var was already set parse_model_input hardened: - Colon only treated as provider delimiter if left side is a recognized provider name or alias. 'anthropic/claude-3.5-sonnet:beta' now passes through as a model name instead of trying provider='anthropic/claude-3.5-sonnet'. - HTTP URLs, random colons no longer misinterpreted. 56 tests passing across model validation, CLI commands, and integration.	2026-03-08 06:09:36 -07:00
teknium1	34792dd907	fix: resolve 'auto' provider properly via credential detection 'auto' doesn't always mean openrouter — it could be nous, zai, kimi-coding, etc. depending on configured credentials. Reverted the hardcoded mapping and now both CLI and gateway call resolve_provider() to detect the actual active provider when 'auto' is set. Falls back to openrouter only if resolution fails.	2026-03-08 05:58:45 -07:00
teknium1	132e5ec179	fix: resolve 'auto' provider in /model display + update gateway handler - normalize_provider('auto') now returns 'openrouter' (the default) so /model shows the curated model list instead of nothing - CLI /model display uses normalize_provider before looking up labels - Gateway /model handler now uses the same validation logic as CLI: live API probe, provider:model syntax, curated model list display	2026-03-08 05:54:52 -07:00
teknium1	66d3e6a0c2	feat: provider switching via /model + enhanced model display Add provider:model syntax to /model command for runtime provider switching: /model zai:glm-5 → switch to Z.AI provider with glm-5 /model nous:hermes-3 → switch to Nous Portal with hermes-3 /model openrouter:anthropic/claude-sonnet-4.5 → explicit OpenRouter When switching providers, credentials are resolved via resolve_runtime_provider and validated before committing. Both model and provider are saved to config. Provider aliases work (glm: → zai, kimi: → kimi-coding, etc.). Enhanced /model (no args) display now shows: - Current model and provider - Curated model list for the current provider with ← marker - Usage examples including provider:model syntax 39 tests covering parse_model_input, curated_models_for_provider, provider switching (success + credential failure), and display output.	2026-03-08 05:45:59 -07:00
teknium1	245d174359	feat: validate /model against live API instead of hardcoded lists Replace the static catalog-based model validation with a live API probe. The /model command now hits the provider's /models endpoint to check if the requested model actually exists: - Model found in API → accepted + saved to config - Model NOT found in API → rejected with 'Error: not a valid model' and fuzzy-match suggestions from the live model list - API unreachable → graceful fallback to hardcoded catalog (session-only for unrecognized models) - Format errors (empty, spaces, missing '/') still caught instantly without a network call The API probe takes ~0.2s for OpenRouter (346 models) and works with any OpenAI-compatible endpoint (Ollama, vLLM, custom, etc.). 32 tests covering all paths: format checks, API found, API not found, API unreachable fallback, CLI integration.	2026-03-08 05:22:20 -07:00
stablegenius49	77f47768dd	fix: improve /history message display	2026-03-08 05:08:57 -07:00
teknium1	90fa9e54ca	fix: guard validate_requested_model + expand test coverage (PR #649 follow-up) - Wrap validate_requested_model in try/except so /model doesn't crash if validation itself fails (falls back to old accept+save behavior) - Remove unnecessary sys.path.insert from both test files - Expand test_model_validation.py: 4 → 23 tests covering normalize_provider, provider_model_ids, empty/whitespace/spaces rejection, OpenRouter format validation, custom endpoints, nous provider, provider aliases, unknown providers, fuzzy suggestions - Expand test_cli_model_command.py: 2 → 5 tests adding known-model save, validation crash fallback, and /model with no argument	2026-03-08 04:47:35 -07:00
stablegenius49	9d3a44e0e8	fix: validate /model values before saving	2026-03-08 04:47:35 -07:00
stablegenius49	bfa27d0a68	fix(cli): unify slash command autocomplete registry	2026-03-08 03:53:22 -07:00
teknium1	d507f593d0	fix: respect config.yaml cwd in gateway, add sandbox_dir config option Two fixes: 1. Gateway CWD override: TERMINAL_CWD from config.yaml was being unconditionally overwritten by the messaging_cwd fallback (line 114). Now explicit paths in config.yaml are respected — only '.' / 'auto' / 'cwd' (or unset) fall back to MESSAGING_CWD or home directory. 2. sandbox_dir config: Added terminal.sandbox_dir to config.yaml bridge in gateway/run.py, cli.py, and hermes_cli/config.py. Maps to TERMINAL_SANDBOX_DIR env var, which get_sandbox_dir() reads to determine where Docker/Singularity sandbox data is stored (default: ~/.hermes/sandboxes/). Users can now set: hermes config set terminal.sandbox_dir /data/hermes-sandboxes	2026-03-08 01:33:46 -08:00
teknium1	4be783446a	fix: wire worktree flag into hermes CLI entry point + docs + tests Critical fixes: - Add --worktree/-w to hermes_cli/main.py argparse (both chat subcommand and top-level parser) so 'hermes -w' works via the actual CLI entry point, not just 'python cli.py -w' - Pass worktree flag through cmd_chat() kwargs to cli_main() - Handle worktree attr in bare 'hermes' and --resume/--continue paths Bug fixes in cli.py: - Skip worktree creation for --list-tools/--list-toolsets (wasteful) - Wrap git worktree subprocess.run in try/except (crash on timeout) - Add stale worktree pruning on startup (_prune_stale_worktrees): removes clean worktrees older than 24h left by crashed/killed sessions Documentation updates: - AGENTS.md: add --worktree to CLI commands table - cli-config.yaml.example: add worktree config section - website/docs/reference/cli-commands.md: add to core commands - website/docs/user-guide/cli.md: add usage examples - website/docs/user-guide/configuration.md: add config docs Test improvements (17 → 31 tests): - Stale worktree pruning (prune old clean, keep recent, keep dirty) - Directory symlink via .worktreeinclude - Edge cases (no commits, not a repo, pre-existing .worktrees/) - CLI flag/config OR logic - TERMINAL_CWD integration - System prompt injection format	2026-03-07 21:05:40 -08:00
teknium1	8d719b180a	feat: git worktree isolation for parallel CLI sessions (--worktree / -w) Add a --worktree (-w) flag to the hermes CLI that creates an isolated git worktree for the session. This allows running multiple hermes-agent instances concurrently on the same repo without file collisions. How it works: - On startup with -w: detects git repo, creates .worktrees/<session>/ with its own branch (hermes/<session-id>), sets TERMINAL_CWD to it - Each agent works in complete isolation — independent HEAD, index, and working tree, shared git object store - On exit: auto-removes worktree and branch if clean, warns and keeps if there are uncommitted changes - .worktreeinclude file support: list gitignored files (.env, .venv/) to auto-copy/symlink into new worktrees - .worktrees/ is auto-added to .gitignore - Agent gets a system prompt note about the worktree context - Config support: set worktree: true in config.yaml to always enable Usage: hermes -w # Interactive mode in worktree hermes -w -q "Fix issue #123" # Single query in worktree # Or in config.yaml: worktree: true Includes 17 tests covering: repo detection, worktree creation, independence verification, cleanup (clean/dirty), .worktreeinclude, .gitignore management, and 10 concurrent worktrees. Closes #652	2026-03-07 20:51:08 -08:00
teknium1	7e36468511	fix: /clear command broken inside TUI (patch_stdout interference) The /clear command was using Rich's console.clear() and console.print() which write directly to stdout. Inside the TUI, prompt_toolkit's patch_stdout intercepts stdout via StdoutProxy, which doesn't interpret screen-clearing escape sequences and mangles Rich's ANSI output, resulting in raw escape codes dumped to the terminal. Fix: - Use prompt_toolkit's output.erase_screen() + cursor_goto() to clear the terminal directly (bypasses patch_stdout's StdoutProxy) - Render the banner through ChatConsole (which routes Rich output through prompt_toolkit's native print_formatted_text/ANSI renderer) - Use _cprint for the status message (prompt_toolkit-compatible) - Fall back to the old behavior when not inside the TUI (e.g. startup)	2026-03-07 16:09:23 -08:00
teknium1	b84f9e410c	feat: default reasoning effort from xhigh to medium Reduces token usage and latency for most tasks by defaulting to medium reasoning effort instead of xhigh. Users can still override via config or CLI flag. Updates code, tests, example config, and docs.	2026-03-07 10:14:19 -08:00
0xbyt4	70cffa4d3b	fix: return "deny" on approval callback timeout instead of None _approval_callback() had no return statement after the timeout break, causing it to return None. Callers expect a string ("once", "session", "always", or "deny"), so None could lead to undefined behavior when approving dangerous commands.	2026-03-07 20:02:13 +03:00
teknium1	0a82396718	feat: shared iteration budget across parent + subagents Subagent tool calls now count toward the same session-wide iteration limit as the parent agent. Previously, each subagent had its own independent counter, so a parent with max_iterations=60 could spawn 3 subagents each doing 50 calls = 150 total tool calls unmetered. Changes: - IterationBudget: thread-safe shared counter (run_agent.py) - consume(): try to use one iteration, returns False if exhausted - refund(): give back one iteration (for execute_code turns) - Thread-safe via Lock (subagents run in ThreadPoolExecutor) - Parent creates the budget, children inherit it via delegate_tool.py - execute_code turns are refunded (don't count against budget) - Default raised from 60 → 90 to account for shared consumption - Per-child cap (50) still applies as a safety valve The per-child max_iterations (default 50) remains as a per-child ceiling, but the shared budget is the hard session-wide limit. A child stops at whichever comes first.	2026-03-07 08:16:37 -08:00
Tyler	53b4b7651a	Add official OpenClaw migration skill for Hermes Agent Introduces a new OpenClaw-to-Hermes migration skill with a Python helper script that handles importing SOUL.md, memories, user profiles, messaging settings, command allowlists, skills, TTS assets, and workspace instructions. Supports two migration presets (user-data / full), three skill conflict modes (skip / overwrite / rename), overflow file export for entries that exceed character limits, and granular include/exclude option filtering. Includes detailed SKILL.md agent instructions covering the clarify-tool interaction protocol, decision-to-command mapping, post-run reporting rules, and path resolution guidance. Adds dynamic panel width calculation to CLI clarify/approval widgets so panels adapt to content and terminal size. Includes 7 new tests covering presets, include/exclude, conflict modes, overflow exports, and skills_guard integration.	2026-03-06 18:57:12 -08:00
teknium1	388dd4789c	feat: add z.ai/GLM, Kimi/Moonshot, MiniMax as first-class providers Adds 4 new direct API-key providers (zai, kimi-coding, minimax, minimax-cn) to the inference provider system. All use standard OpenAI-compatible chat/completions endpoints with Bearer token auth. Core changes: - auth.py: Extended ProviderConfig with api_key_env_vars and base_url_env_var fields. Added providers to PROVIDER_REGISTRY. Added provider aliases (glm, z-ai, zhipu, kimi, moonshot). Added auto-detection of API-key providers in resolve_provider(). Added resolve_api_key_provider_credentials() and get_api_key_provider_status() helpers. - runtime_provider.py: Added generic API-key provider branch in resolve_runtime_provider() — any provider with auth_type='api_key' is automatically handled. - main.py: Added providers to hermes model menu with generic _model_flow_api_key_provider() flow. Updated _has_any_provider_configured() to check all provider env vars. Updated argparse --provider choices. - setup.py: Added providers to setup wizard with API key prompts and curated model lists. - config.py: Added env vars (GLM_API_KEY, KIMI_API_KEY, MINIMAX_API_KEY, etc.) to OPTIONAL_ENV_VARS. - status.py: Added API key display and provider status section. - doctor.py: Added connectivity checks for each provider endpoint. - cli.py: Updated provider docstrings. Docs: Updated README.md, .env.example, cli-config.yaml.example, cli-commands.md, environment-variables.md, configuration.md. Tests: 50 new tests covering registry, aliases, resolution, auto-detection, credential resolution, and runtime provider dispatch. Inspired by PR #33 (numman-ali) which proposed a provider registry approach. Credit to tars90percent (PR #473) and manuelschipper (PR #420) for related provider improvements merged earlier in this changeset.	2026-03-06 18:55:18 -08:00
teknium1	94053d75a6	fix: custom endpoint no longer leaks OPENROUTER_API_KEY (#560 ) API key selection is now base_url-aware: when the resolved base_url targets OpenRouter, OPENROUTER_API_KEY takes priority (preserving the #289 fix). When hitting any other endpoint (Z.ai, vLLM, custom, etc.), OPENAI_API_KEY takes priority so the OpenRouter key doesn't leak. Applied in both the runtime provider resolver (the real code path) and the CLI initial default (for consistency). Fixes #560.	2026-03-06 17:16:14 -08:00
teknium1	b52b37ae64	feat: add /insights command with usage analytics and cost estimation Inspired by Claude Code's /insights, adapted for Hermes Agent's multi-platform architecture. Analyzes session history from state.db to produce comprehensive usage insights. Features: - Overview stats: sessions, messages, tokens, estimated cost, active time - Model breakdown: per-model sessions, tokens, and cost estimation - Platform breakdown: CLI vs Telegram vs Discord etc. (unique to Hermes) - Tool usage ranking: most-used tools with percentages - Activity patterns: day-of-week chart, peak hours, streaks - Notable sessions: longest, most messages, most tokens, most tool calls - Cost estimation: real pricing data for 25+ models (OpenAI, Anthropic, DeepSeek, Google, Meta) with fuzzy model name matching - Configurable time window: --days flag (default 30) - Source filtering: --source flag to filter by platform Three entry points: - /insights slash command in CLI (supports --days and --source flags) - /insights slash command in gateway (compact markdown format) - hermes insights CLI subcommand (standalone) Includes 56 tests covering pricing helpers, format helpers, empty DB, populated DB with multi-platform data, filtering, formatting, and edge cases.	2026-03-06 14:04:59 -08:00
teknium1	39299e2de4	Merge PR #451 : feat: Add Daytona environment backend Authored by rovle. Adds Daytona as the sixth terminal execution backend with cloud sandboxes, persistent workspaces, and full CLI/gateway integration. Includes 24 unit tests and 8 integration tests.	2026-03-06 03:32:40 -08:00
teknium1	e93b4d1dcd	feat: Alt+V keybinding for clipboard image paste Alt key combos pass through all terminal emulators (sent as ESC + key), unlike Ctrl+V which terminals intercept for text paste. This is the reliable way to attach clipboard images on WSL2, Windows Terminal, VSCode, and SSH sessions where Ctrl+V never reaches the application for image-only clipboard content. Also adds 'Paste image: Alt+V (or /paste)' hint to /help output.	2026-03-05 22:48:39 -08:00
teknium1	2317d115cd	fix: clipboard image paste on WSL2, Wayland, and VSCode terminal The original implementation only supported xclip (X11), which silently fails on WSL2 (can't access Windows clipboard for images), Wayland desktops (xclip is X11-only), and VSCode terminal on WSL2. Clipboard backend changes (hermes_cli/clipboard.py): - WSL2: detect via /proc/version, use powershell.exe with .NET System.Windows.Forms.Clipboard to extract images as base64 PNG - Wayland: use wl-paste with MIME type detection, auto-convert BMP to PNG for WSLg environments (via Pillow or ImageMagick) - Dispatch order: WSL → Wayland → X11 (xclip), with fallthrough - New has_clipboard_image() for lightweight clipboard checks - Cache WSL detection result per-process CLI changes (cli.py): - /paste command: explicit clipboard image check for terminals where BracketedPaste doesn't fire (image-only clipboard in VSCode/WinTerm) - Ctrl+V keybinding: fallback for Linux terminals where Ctrl+V sends raw byte instead of triggering bracketed paste Tests: 80 tests (up from 37) covering WSL, Wayland, X11 dispatch, BMP conversion, has_clipboard_image, and /paste command.	2026-03-05 20:22:44 -08:00
teknium1	e2a834578d	refactor: extract clipboard methods + comprehensive tests (37 tests) Refactored image paste internals for testability: - Extracted _try_attach_clipboard_image() method (clipboard → state) - Extracted _build_multimodal_content() method (images → OpenAI format) - chat() now delegates to these instead of inline logic Tests organized in 4 levels: Level 1 (19 tests): Clipboard module — every platform path with realistic subprocess simulation (tools writing files, timeouts, empty files, cleanup on failure) Level 2 (8 tests): _build_multimodal_content — base64 encoding, MIME types (png/jpg/webp/unknown), missing files, multiple images, default question for empty text Level 3 (5 tests): _try_attach_clipboard_image — state management, counter increment/rollback, naming convention, mixed success/failure Level 4 (5 tests): Queue routing — tuple unpacking, command detection, images-only payloads, text-only payloads	2026-03-05 18:07:53 -08:00
teknium1	399562a7d1	feat: clipboard image paste in CLI (Cmd+V / Ctrl+V) Copy an image to clipboard (screenshot, browser, etc.) and paste into the Hermes CLI. The image is saved to ~/.hermes/images/, shown as a badge above the input ([📎 Image #1]), and sent to the model as a base64-encoded OpenAI vision multimodal content block. Implementation: - hermes_cli/clipboard.py: clean module with platform-specific extraction - macOS: pngpaste (if installed) → osascript fallback (always available) - Linux: xclip (apt install xclip) - cli.py: BracketedPaste key handler checks clipboard on every paste, image bar widget shows attached images, chat() converts to multimodal content format, Ctrl+C clears attachments Inspired by @m0at's fork (https://github.com/m0at/hermes-agent) which implemented image paste support for local vision models. Reimplemented cleanly as a separate module with tests.	2026-03-05 17:55:41 -08:00
teknium1	c886333d32	feat: smart context length probing with persistent caching + banner display Replaces the unsafe 128K fallback for unknown models with a descending probe strategy (2M → 1M → 512K → 200K → 128K → 64K → 32K). When a context-length error occurs, the agent steps down tiers and retries. The discovered limit is cached per model+provider combo in ~/.hermes/context_length_cache.yaml so subsequent sessions skip probing. Also parses API error messages to extract the actual context limit (e.g. 'maximum context length is 32768 tokens') for instant resolution. The CLI banner now displays the context window size next to the model name (e.g. 'claude-opus-4 · 200K context · Nous Research'). Changes: - agent/model_metadata.py: CONTEXT_PROBE_TIERS, persistent cache (save/load/get), parse_context_limit_from_error(), get_next_probe_tier() - agent/context_compressor.py: accepts base_url, passes to metadata - run_agent.py: step-down logic in context error handler, caches on success - cli.py + hermes_cli/banner.py: context length in welcome banner - tests: 22 new tests for probing, parsing, and caching Addresses #132. PR #319's approach (8K default) rejected — too conservative.	2026-03-05 16:09:57 -08:00
teknium1	55b173dd03	refactor: move shutil import to module level Cleanup on top of PR #305 — replace two inline 'import shutil as _shutil' with a single module-level import.	2026-03-05 15:57:05 -08:00
dmahan93	9079a27814	fix: prompt box and response box span full terminal width on wide screens - Replace hardcoded '─' * 200 horizontal rules with Window(char='─') so prompt_toolkit fills the entire terminal width automatically - Use shutil.get_terminal_size().columns instead of Rich Console.width for response box, separator line, and input height calculation (more reliable inside patch_stdout context)	2026-03-05 15:57:05 -08:00
rovle	5279540bb4	fix(daytona): add missing config mappings in gateway, CLI defaults, and config display Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:12:50 -08:00
rovle	690b8bb563	feat(cli): add Daytona config mapping and env var sync Wire TERMINAL_DAYTONA_IMAGE through cli.py env_mappings and hermes_cli/config.py so `hermes config set` propagates correctly.	2026-03-05 10:02:21 -08:00
rovle	7d47e3b776	fix: pass stable task_id in CLI and gateway to preserve sandbox state across turns Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 01:40:04 -08:00
teknium1	078e2e4b19	fix(cli): Ctrl+C clears input buffer before exiting Previously, pressing Ctrl+C while text was typed in the input prompt would immediately exit Hermes. Now follows standard shell behavior: - Text in buffer → Ctrl+C clears the line (like bash) - Empty buffer → Ctrl+C exits This means accidentally hitting Ctrl+C while composing a message just clears the input instead of killing the session. A second Ctrl+C on the empty prompt still exits as expected.	2026-03-04 22:01:13 -08:00
teknium1	b2a9f6beaa	feat: enable up/down arrow history navigation in CLI The TextArea uses multiline=True, so up/down arrows only moved the cursor within text — history browsing via FileHistory was attached but inaccessible. Two fixes: 1. Add up/down key bindings in normal input mode that call Buffer.auto_up()/auto_down(). These intelligently handle both: cursor movement when editing multi-line text, and history browsing when on the first/last line. 2. Pass append_to_history=True to buffer.reset() in the Enter handler so messages actually get saved to ~/.hermes_history. History persists across sessions via FileHistory. The bindings are filtered out during clarify, approval, and sudo prompts (which have their own up/down handlers).	2026-03-04 13:39:48 -08:00
teknium1	3ead3401e0	fix(mcp): persist updated tools to session log immediately after reload After /reload-mcp updates self.agent.tools, immediately call _persist_session() so the session JSON file at ~/.hermes/sessions/ reflects the new tools list. Without this, the tools field in the session log would only update on the next conversation turn — if the user quit after reloading, the log would have stale tools.	2026-03-02 21:31:23 -08:00
teknium1	eec31b0089	fix(mcp): /reload-mcp now updates agent tools + injects history message - CLI: After reload, refreshes self.agent.tools and valid_tool_names so the model sees updated tools on its next API call - Both CLI and Gateway: Appends a [SYSTEM: ...] message at the END of conversation history explaining what changed (added/removed/ reconnected servers, tool count). This preserves prompt-cache for the system prompt and earlier messages — only the tail changes. - Gateway already creates a new AIAgent per message so tools refresh naturally; the injected message provides context for the model	2026-03-02 19:25:06 -08:00
teknium1	7df14227a9	feat(mcp): banner integration, /reload-mcp command, resources & prompts Banner integration: - MCP Servers section in CLI startup banner between Tools and Skills - Shows each server with transport type, tool count, connection status - Failed servers shown in red; section hidden when no MCP configured - Summary line includes MCP server count - Removed raw print() calls from discovery (banner handles display) /reload-mcp command: - New slash command in both CLI and gateway - Disconnects all MCP servers, re-reads config.yaml, reconnects - Reports what changed (added/removed/reconnected servers) - Allows adding/removing MCP servers without restarting Resources & Prompts support: - 4 utility tools registered per server: list_resources, read_resource, list_prompts, get_prompt - Exposes MCP Resources (data sources) and Prompts (templates) as tools - Proper parameter schemas (uri for read_resource, name for get_prompt) - Handles text and binary resource content - 23 new tests covering schemas, handlers, and registration Test coverage: 74 MCP tests total, 1186 tests pass overall.	2026-03-02 19:15:59 -08:00
teknium1	468b7fdbad	Merge PR #291 : feat: add MCP (Model Context Protocol) client support Authored by 0xbyt4. Adds MCP client with official SDK, direct tool registration, auto-injection into hermes-* toolsets, and graceful degradation.	2026-03-02 18:24:31 -08:00
teknium1	221e4228ec	Merge PR #295 : fix: resolve OPENROUTER_API_KEY before OPENAI_API_KEY in all code paths Authored by 0xbyt4. Fixes #289.	2026-03-02 17:29:25 -08:00
0xbyt4	6053236158	fix: prioritize OPENROUTER_API_KEY over OPENAI_API_KEY When both OPENROUTER_API_KEY and OPENAI_API_KEY are set (e.g. OPENAI_API_KEY in .bashrc), the wrong key was sent to OpenRouter causing auth failures. Fixed key resolution order in cli.py and runtime_provider.py. Fixes #289	2026-03-03 00:28:26 +03:00
0xbyt4	3c252ae44b	feat: add MCP (Model Context Protocol) client support Connect to external MCP servers via stdio transport, discover their tools at startup, and register them into the hermes-agent tool registry. - New tools/mcp_tool.py: config loading, server connection via background event loop, tool handler factories, discovery, and graceful shutdown - model_tools.py: trigger MCP discovery after built-in tool imports - cli.py: call shutdown_mcp_servers in _run_cleanup - pyproject.toml: add mcp>=1.2.0 as optional dependency - 27 unit tests covering config, schema conversion, handlers, registration, SDK interaction, toolset injection, graceful fallback, and shutdown Config format (in ~/.hermes/config.yaml): mcp_servers: filesystem: command: "npx" args: ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"]	2026-03-02 21:03:14 +03:00
ygd58	b603b6e1c9	fix(cli): throttle UI invalidate to prevent terminal blinking on SSH	2026-03-02 16:00:12 +01:00
teknium1	33ab5cec82	fix: handle None message content across codebase (fixes #276 ) The OpenAI API returns content: null on assistant messages with tool calls. msg.get('content', '') returns None when the key exists with value None, causing TypeError on len(), string concatenation, and .strip() in downstream code paths. Fixed 4 locations that process conversation messages: - agent/auxiliary_client.py:84 — None passed to API calls - cli.py:1288 — crash on content[:200] and len(content) - run_agent.py:3444 — crash on None.strip() - honcho_integration/session.py:445 — 'None' rendered in transcript 13 other instances were verified safe (already protected, only process user/tool messages, or use the safe pattern). Pattern: msg.get('content', '') → msg.get('content') or '' Fixes #276	2026-03-02 02:23:53 -08:00
teknium1	afb680b50d	fix(cli): fix max_turns comment and test for correct priority order Priority is: CLI arg > config file > env var > default (not env var > config file as the old comment stated) The test failed because config.yaml had max_turns at both root level and inside agent section. The test cleared agent.max_turns but the root-level value still took precedence over the env var. Fixed the test to clear both, and corrected the comment to match the intended priority order.	2026-03-02 01:18:52 -08:00
teknium1	e2b8740fcf	fix: load_cli_config() now carries over non-default config keys load_cli_config() only merged keys present in its hardcoded defaults dict, silently dropping user-added keys like platform_toolsets (saved by 'hermes tools'), provider_routing, memory, honcho, etc. Added a second pass to carry over all file_config keys that aren't in defaults, so 'hermes tools' changes actually take effect in CLI mode. The gateway was unaffected (reads YAML directly via yaml.safe_load).	2026-03-02 00:32:28 -08:00
teknium1	8bc2de4ab6	feat(provider-routing): add OpenRouter provider routing configuration Introduced a new `provider_routing` section in the CLI configuration to control how requests are routed across providers when using OpenRouter. This includes options for sorting providers by throughput, latency, or price, as well as allowing or ignoring specific providers, setting the order of provider attempts, and managing data collection policies. Updated relevant classes and documentation to support these features, enhancing flexibility in provider selection.	2026-03-01 18:24:27 -08:00
teknium1	75a92a3f82	refactor(cli): improve header formatting and description truncation Updated the CLI header formatting for tool and configuration displays to center titles within their respective widths. Enhanced the display of command descriptions to include an ellipsis for longer texts, ensuring better readability. This refactor improves the overall user interface of the CLI.	2026-03-01 16:37:16 -08:00
teknium1	177be32b7f	feat(cli): add /usage command to display session token usage Introduced a new command "/usage" in the CLI to show cumulative token usage for the current session. This includes details on prompt tokens, completion tokens, total tokens, API calls, and context state. Updated command documentation to reflect this addition. Enhanced the AIAgent class to track token usage throughout the session.	2026-03-01 00:23:19 -08:00
teknium1	30efc263ff	feat(cli): add /compress command for manual conversation context compression Introduced a new command "/compress" to the CLI, allowing users to manually trigger context compression on the current conversation. The method checks for sufficient conversation history and active agent status before performing compression, providing feedback on the number of messages and tokens before and after the operation. Updated command documentation accordingly.	2026-03-01 00:16:38 -08:00
teknium1	500f0eab4a	refactor(cli): Finalize OpenAI Codex Integration with OAuth - Enhanced Codex model discovery by fetching available models from the API, with fallback to local cache and defaults. - Updated the context compressor's summary target tokens to 2500 for improved performance. - Added external credential detection for Codex CLI to streamline authentication. - Refactored various components to ensure consistent handling of authentication and model selection across the application.	2026-02-28 21:47:51 -08:00
Teknium	5a79e423fe	Merge branch 'main' into codex/align-codex-provider-conventions-mainrepo	2026-02-28 18:13:38 -08:00
teknium1	8e0c48e6d2	feat(skills): implement dynamic skill slash commands for CLI and gateway	2026-02-28 11:18:50 -08:00
teknium1	1ddf8c26f5	refactor(cli): update max turns configuration precedence and enhance documentation	2026-02-28 10:35:49 -08:00
teknium1	de5a88bd97	refactor: migrate tool progress configuration from environment variables to config.yaml	2026-02-28 00:05:58 -08:00
Teknium	72d3425eef	Merge pull request #94 from cesareth/feat/verbose-slash-command feat(cli): add /verbose slash command to toggle debug output at runtime	2026-02-27 23:41:25 -08:00
Teknium	4a9086b848	Merge branch 'main' into feat/honcho-integration	2026-02-27 23:32:49 -08:00
Teknium	0d113fab1a	Merge pull request #158 from Indelwin/feature/docker-volumes feat: add docker_volumes config for custom volume mounts	2026-02-27 23:06:06 -08:00
teknium1	de0829cec3	fix(cli): increase max iterations for child agents and extend API call timeout for improved reliability	2026-02-27 17:35:29 -08:00
Gesina Sands	f7677ed275	feat: add docker_volumes config for custom volume mounts	2026-02-28 07:12:48 +10:00
Teknium	1362f92f2e	Merge pull request #89 from 0xbyt4/fix/cli-show-config-wrong-path fix(cli): show correct config file path in /config command	2026-02-27 03:48:13 -08:00
Teknium	ae8d25faca	Merge pull request #87 from 0xbyt4/fix/cli-max-turns-sentinel fix(cli): respect explicit --max-turns value even when it equals default	2026-02-27 03:42:18 -08:00
Teknium	9061c03b6d	Merge pull request #84 from 0xbyt4/fix/cli-paste-detection-false-positive fix(cli): prevent paste detection from destroying multi-line input	2026-02-27 03:40:13 -08:00
Teknium	8174f5a988	Merge pull request #83 from 0xbyt4/fix/cli-save-config-string-model fix(cli): prevent crash in save_config_value when model is a string	2026-02-27 03:36:39 -08:00
teknium1	b281ecd50a	Fix: rending issue on /skills command	2026-02-26 20:29:52 -08:00
cesareth	1a97e82000	feat(cli): add /verbose slash command to toggle debug output at runtime Closes #77. Users can now type /verbose in the CLI to toggle verbose mode on or off without restarting. When enabled, full tool call parameters, results, and debug logs are shown. The agent's verbose_logging and quiet_mode flags are updated live, and Python logging levels are reconfigured accordingly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-26 23:18:45 +00:00
Erosika	ab4bbf2fb2	feat: add Honcho AI-native memory integration Opt-in persistent cross-session user modeling via Honcho. Reads ~/.honcho/config.json as single source of truth (shared with Claude Code, Cursor, and other Honcho-enabled tools). Zero impact when disabled or unconfigured. - honcho_integration/ package (client, session manager, peer resolution) - Host-based config resolution matching claude-honcho/cursor-honcho pattern - Prefetch user context into system prompt per conversation turn - Sync user/assistant messages to Honcho after each exchange - query_user_context tool for mid-conversation dialectic reasoning - Gated activation: requires ~/.honcho/config.json with enabled=true	2026-02-26 18:07:17 -05:00
0xbyt4	7f36259f88	fix(cli): show correct config file path in /config command show_config() always checked cli-config.yaml in the project directory, but load_cli_config() first looks at ~/.hermes/config.yaml. When the user config existed, /config would display "cli-config.yaml (not found)" even though configuration was loaded successfully from ~/.hermes/. Use the same lookup order as load_cli_config and display the actual resolved path.	2026-02-26 23:49:08 +03:00
0xbyt4	2c28d9f560	fix(cli): respect explicit --max-turns value even when it equals default max_turns used 60 as both the default and the sentinel to detect whether the user passed the flag. This meant `--max-turns 60` was indistinguishable from "not passed", so the env var HERMES_MAX_ITERATIONS would silently override the explicit CLI value. Change the default to None so any user-supplied value takes priority.	2026-02-26 23:43:38 +03:00
0xbyt4	c21b071e77	fix(cli): prevent paste detection from destroying multi-line input The _on_text_changed handler collapsed buffer contents into a file reference whenever the buffer had 5+ newlines, regardless of how those lines were entered. This meant manually typing with Alt+Enter would trigger the paste heuristic and silently replace the user's carefully typed input. Track the previous buffer length and only treat a change as a paste when more than one character is added at once (real pastes insert many characters in a single event, while typing adds one at a time).	2026-02-26 23:40:38 +03:00
0xbyt4	de197bd7cb	fix(cli): prevent crash in save_config_value when model is a string load_cli_config() supports both string and dict formats for the model key (e.g. `model: "anthropic/claude-opus-4"`), but save_config_value() assumed all intermediate keys are dicts. When the config file used the string format, running `/model <name>` would crash with TypeError: 'str' object does not support item assignment. Add an isinstance check so non-dict values are replaced with a fresh dict before descending.	2026-02-26 23:35:00 +03:00
teknium1	bf9dd83c10	fix(cli): improve description extraction for toolsets - Updated the description extraction logic to split on ". " (period+space) to avoid breaking on abbreviations like "e.g." or version numbers. - Changed the method to prioritize the first line of the description, ensuring more relevant information is captured for display.	2026-02-26 12:11:32 -08:00
George Pickett	32070e6bc0	Merge remote-tracking branch 'origin/main' into codex/align-codex-provider-conventions-mainrepo # Conflicts: # cron/scheduler.py # gateway/run.py # tools/delegate_tool.py	2026-02-26 10:56:29 -08:00
Dean Kerr	cf3236ed27	fix: resolve .env path from ~/.hermes/ in cli.py, matching run_agent.py pattern Load ~/.hermes/.env first with project root as dev fallback, and remove redundant second load_dotenv call inside load_cli_config(). Also sets MSWEA_GLOBAL_CONFIG_DIR so mini-swe-agent shares the same config. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 18:37:20 +11:00
teknium1	d2c932d3ac	add session resumption for cli with easy copy paste command	2026-02-25 22:56:12 -08:00

1 2 3 4 5 ...

316 Commits