OMX
Oh My CodeXv0.18.9

Tools

An overview of the runtime capability surfaces OMX uses internally while workflows are running.

What are tools?

Tools are the runtime capability surfaces OMX uses internally while work is in progress. Unlike skills, which users invoke directly, tools are the lower-level capabilities that skills, hooks, and agents rely on underneath.

In other words, if skills decide what work should happen, tools are part of what makes that work actually possible.

Why do tools matter?

Tools are what let OMX:

  • store and restore runtime state
  • show progress through the HUD
  • accumulate knowledge in the wiki
  • perform visual QA such as screenshot comparison
  • delegate bounded cross-CLI questions to other models

So the Tools section is less about “what button do I press?” and more about understanding how OMX behaves under the hood.

Main tool categories

State

State is the surface for reading and writing active runtime mode state.

For example:

  • checking which mode is active
  • storing the current phase or iteration
  • clearing stale runtime state

It is one of the core building blocks behind long-running workflows such as $ralph, $team, and $ultraqa.

HUD

HUD is the monitoring surface for the current session.

For example:

  • checking which phase is currently active
  • opening a tmux status pane
  • exporting status as JSON

HUD is effectively the quickest answer to “what is happening right now?”.

Wiki

Wiki is the surface for persistent project knowledge stored under .omx/wiki/.

For example:

  • accumulating structured knowledge
  • reusing findings across sessions
  • querying, reading, and curating durable notes

If State is for the current run, Wiki is for information that should remain useful in later runs too.

Catalog

Catalog explains how OMX groups prompts, skills, and tools into a stable capability map.

This matters less as an execution surface and more as a way to understand how the docs and runtime are organized.

Visual

Visual is the lower-level comparison capability behind visual QA.

For example:

  • comparing screenshots against reference images
  • identifying layout, spacing, and hierarchy differences
  • producing evidence for the next UI iteration

In practice, users usually reach this through a higher-level workflow such as $visual-verdict.

Ask Claude / Ask Gemini

Ask Claude and Ask Gemini are bridge surfaces for delegating bounded sub-questions to other models outside the current Codex context.

For example:

  • getting a second opinion from another model
  • running a clean-room summary or lookup
  • comparing perspectives across CLIs intentionally

How tools differ from skills

A simple distinction is:

SurfaceMeaning
SkillsUser-facing workflows you explicitly start
ToolsLower-level capabilities those workflows depend on

For example, one workflow might:

  • update runtime state
  • write results into the wiki
  • show progress in the HUD
  • validate outputs through visual comparison

without forcing the user to invoke each of those capabilities as separate top-level commands.

When should you read the tools docs first?

The Tools section is especially useful when you want to:

  • understand where OMX stores and restores state
  • sort out the difference between HUD, wiki, and state
  • understand where visual QA or provider bridges sit in the architecture
  • understand what capability layer sits underneath the skills you already know

You do not need to begin with Tools, but once you want to understand OMX internals, it becomes a very important section.

On this page