From 25caaa4a709f71026b1f419ec61adcaf0f41f914 Mon Sep 17 00:00:00 2001 From: Teknium <127238744+teknium1@users.noreply.github.com> Date: Thu, 30 Apr 2026 02:30:36 -0700 Subject: [PATCH] feat(tips): add cost-saving tips from April 30 tip-of-the-day (#17841) Seed the tips corpus with the knobs users can turn to reduce token spend: hermes tools / hermes skills config to trim surface area, /reasoning low|minimal to dial thinking depth down from the medium default, and hermes models to route auxiliary tasks (vision, compression, title gen, session_search) to cheaper backends while the main chat model stays intact. Requested by @micheltamanda under Teknium's tip-of-the-day tweet. --- hermes_cli/tips.py | 3 +++ 1 file changed, 3 insertions(+) diff --git a/hermes_cli/tips.py b/hermes_cli/tips.py index 8e07323b..62fad2eb 100644 --- a/hermes_cli/tips.py +++ b/hermes_cli/tips.py @@ -100,6 +100,9 @@ TIPS = [ "hermes gateway install sets up Hermes as a system service (systemd/launchd).", "hermes memory setup lets you configure an external memory provider (Honcho, Mem0, etc.).", "hermes webhook subscribe creates event-driven webhook routes with HMAC validation.", + "Save money: hermes tools disables unused tools, hermes skills config trims skills down.", + "/reasoning low or /reasoning minimal cuts thinking depth below the default (medium) — faster, cheaper responses.", + "hermes models routes vision, compression, and aux tasks to cheaper models — cuts background token cost 85%+ without downgrading your main chat model.", # --- Configuration --- "Set display.bell_on_complete: true in config.yaml to hear a bell when long tasks finish.",