Harness

Invest in your harness

Your harness is where your unique knowledge lives. Own it, invest in it, keep it portable across whichever model lands next.

Five pillars

What goes into a harness

An agent harness is everything around the model that helps it do real work on your project: rules, tools, permissions, and the linked workspace it pulls context from.

Claude Code and Codex are themselves harnesses. You and your team provide a second harness on top of them.

Five pillars. Three get the attention: context, restraint, empowerment. Two get less and matter just as much: a linked context graph, and a visual interface.

Read the full essay: Five pillars of the agent harness above Claude Code and Codex →

Invest in a harness that you own: prompt, five-pillar harness, agent harness, and the underlying model

What your agent needs to know

Context

CLAUDE.md, AGENTS.md, path-scoped rules, examples and recipes, your data model, your past decisions.

Without it, the agent re-derives your team's choices on every prompt and ships something close but subtly wrong.

Context pillar with concrete example entries

How the context connects

Context Graph

Typed links between tracker items, plans, specs, diagrams, sessions, diffs, files, and decisions.

Without it, connections live only in human heads. With it, the agent traverses the graph in one call instead of seven lookups.

Context Graph pillar with concrete example entries

What your agent must not do

Restraint

Hard rules, approval boundaries, permission scopes, tool allowlists, an audit trail.

A capable agent without restraint will eventually push to main, drop a table, or burn through a paid API.

Restraint pillar with concrete example entries

What your agent can actually do

Empowerment

Tools that read logs, query the running database, drive the UI, take screenshots, and run end-to-end test loops.

An agent that can verify its own output closes the loop without a human in the middle of every step.

Empowerment pillar with concrete example entries

How you and your agent share the work

Visual Interface

Markdown, mockups, diagrams, data models, red and green diffs, screenshots, a canvas to sketch against.

When visual artifacts are first-class, the agent can render a mockup, screenshot it, and check whether it looks right.

Visual Interface pillar with concrete example entries

A worked example

A harness in action

Here is what those five pillars look like filled in for a single concrete prompt, all the way through to the resulting outcome.

How to think about your harness

Prioritizing your harness

Own your harness

If you cannot read it, edit it, take it with you, and run it under any agent you choose, it is not yours.

Invest in your harness

Spend a meaningful share of your token budget on better rules, better tools, recorded decisions, tighter loops. Treat the harness as a product your team ships to itself.

Keep it portable across models

Same files, same rules, same tools, same graph, whatever model lands next. If switching agents means re-creating the harness, you do not have choice.

An example you can adopt

Nimbalyst is an open-source harness built on these five pillars

Visual interface, context graph, empowerment tools, cross-model CLAUDE.md and skills, all in one workspace. Claude Code and Codex run as first-class agents. The agent layer is pluggable for whatever lands next.

The desktop and iOS apps are MIT licensed. Study how they are wired, copy what is useful, or run Nimbalyst as your workspace.

Download

Also available for: macOS Apple Silicon macOS Intel Windows Linux

Read about the context graph

FAQ

Questions about agent harnesses

What is an agent harness?

An agent harness is everything around the AI model that helps it do the right thing in your project. It is made of context (what the agent knows about your code and conventions), a context graph (how that knowledge connects across tracker items, plans, diagrams, sessions, and files), restraint (rules, permissions, allowlists), empowerment (tools that touch live state and let the agent verify its own work), and a visual interface (how you and the agent share work). The model is interchangeable. The harness is where your durable investment lives.

How is a harness different from Claude Code or Codex?

Claude Code and Codex are themselves harnesses. They wrap a frontier model with a system prompt, a tool set, a permission system, and an execution loop. Your team provides a second harness on top of that: the workspace, the linked context, the rules, and the tools that are specific to your project.

Why does the harness matter more than the model?

Frontier models flip the leaderboard every few weeks. Two recent studies from Stanford and Tsinghua have shown that the orchestration code around the model drives more performance variation than the model itself: the same model can produce a six-times gap in result quality depending on the harness it runs in. The investment you make in your harness compounds and survives model churn. The investment you make in tuning prompts for last quarter's model does not.

How do I start building a harness for Claude Code and Codex?

Start with a CLAUDE.md or AGENTS.md at the root of your project that captures your real conventions and hard rules. Add path-scoped rule files for areas with special concerns. Wire up at least one tool that lets the agent verify its own work, like a test loop or a screenshot tool. Adopt a workspace like Nimbalyst that gives you a linked context graph and visual editors out of the box, so the agent and the human can work from the same artifacts.

What is the context graph in a harness?

The context graph is the part of the harness that records persistent, typed links between the artifacts that matter. Tracker item to plan, plan to spec, spec to diagram, diagram to session, session to diff, diff to files, decision to the work that forced it. Without it, the connections between work live only in human heads and an agent cannot traverse them. With it, both human and agent can pick up where the last session left off in a single traversal.

Is Nimbalyst the only way to build a harness?

No. Many of the pieces of a good harness, like CLAUDE.md, path-scoped rules, and tool definitions, can be built up inside any project. Nimbalyst is one open-source example of a harness that already includes the context graph and visual interface pillars. You can adopt it whole, copy ideas from it, or use it as a reference while building your own.

Nimbalyst: the open-source visual workspace for building with Codex, Claude Code, and more

Download

Also available for: macOS Apple Silicon macOS Intel Windows Linux