Hermes Agent Tools

A toolset is the unit that gives Hermes a class of capabilities. If the agent needs to edit files, it needs file tools. If it needs to run builds, it needs terminal tools. If it needs to inspect a web UI, it needs browser tools.

Toolsets

Enable/disable via hermes tools (interactive) or hermes tools enable/disable NAME.

ToolsetWhat it provides
webWeb search and content extraction
browserBrowser automation (Browserbase, Camofox, or local Chromium)
terminalShell commands and process management
fileFile read/write/search/patch
code_executionSandboxed Python execution
visionImage analysis
image_genAI image generation
ttsText-to-speech
skillsSkill browsing and management
memoryPersistent cross-session memory
session_searchSearch past conversations
delegationSubagent task delegation
cronjobScheduled task management
clarifyAsk user clarifying questions
messagingCross-platform message sending
searchWeb search only (subset of web)
todoIn-session task planning and tracking
rlReinforcement learning tools (off by default)
moaMixture of Agents (off by default)
homeassistantSmart home control (off by default)

Tool changes take effect on /reset (new session). They do NOT apply mid-conversation to preserve prompt caching.


Callable API Tools

The exact callable tool surface depends on profile, platform, and enabled toolsets.

API toolWhat it does
browser_navigateOpen a URL and return a compact page snapshot.
browser_snapshotRead the current page’s accessibility tree.
browser_clickClick an element from a browser snapshot.
browser_typeType text into a browser input.
browser_pressPress a keyboard key in the browser.
browser_scrollScroll the page up or down.
browser_backNavigate back in browser history.
browser_consoleRead console logs or evaluate JavaScript in the page.
browser_get_imagesCollect images on the current page.
browser_visionScreenshot and visually analyze the current page.
clarifyAsk the user a structured clarification question.
cronjobCreate, list, update, pause, resume, remove, or run scheduled jobs.
delegate_taskSpawn isolated subagents for independent work.
image_generateGenerate images from text prompts.
memoryAdd, replace, or remove durable user/profile memory.
patchApply targeted fuzzy replacements or multi-file patches.
processManage background terminal processes.
read_fileRead text files with line numbers and pagination.
search_filesSearch file names or file contents.
send_messageSend messages to connected platforms.
session_searchSearch or browse prior conversation sessions.
skills_listList installed skills.
skill_viewRead skill instructions and linked files.
skill_manageCreate, patch, edit, or delete skills.
terminalExecute shell commands and builds.
text_to_speechConvert text into audio.
todoTrack the current session’s task list.
vision_analyzeAnalyze images from URLs or local files.
write_fileCreate or overwrite text files.
multi_tool_use.parallelRun independent developer tool calls in parallel.

Practical routing

SituationStart with
Read a fileread_file
Search file names or contentssearch_files
Create or edit fileswrite_file, patch
Run builds, tests, or deployment commandsterminal, process
Inspect a web UIbrowser_*
Analyze images or screenshotsvision_analyze, browser_vision
Schedule repeated workcronjob
Split independent workstreamsdelegate_task
Recall prior conversationssession_search

Operating principle

More tools are not always better. Enable the capabilities required by the task, verify side effects, and treat deployment, file writes, external messages, and public posts as actions that deserve explicit checks.