Harness Engineering

Harness Engineering

Harness engineering is the practice of making a codebase, toolchain, and team process legible enough that coding agents can work, verify, review, and hand off changes with less human babysitting.

Key points

  • OpenAI frames the move from autocomplete and pair programming into agent delegation as requiring a better "harness" around the model, not only a better model [src-084].
  • The API & Codex Build Hour demo builds an "agent legibility score" for repositories, with criteria such as bootstrap self-sufficiency, task entry points, validation harnesses, linting, formatting, documentation, and modular boundaries [src-084].
  • Worktrees are part of the harness because they let multiple agent tasks run in parallel on separate branches without clobbering each other [src-084].
  • High-quality validation is central: tests, lint, build commands, browser checks, logs, and other acceptance signals let Codex decide whether the work is actually done [src-084].
  • Skills and subagents encode repeatable team standards, such as PR creation, commit style, code review, architecture review, standards enforcement, and pull-request follow-up [src-084].
  • Team knowledge should move from individual heads and chat history into version-controlled files, docs, specs, skills, notes, and local decision records so every agent benefits from the same context [src-084].
  • The source also adds a company-context version: an "Atlas" style repo for operating principles, strategy, and non-code context can make Codex useful beyond production code [src-084].

Related entities

Related concepts

Source references

  • [src-084] OpenAI Codex, Workspace Agents, Prompt Caching, and Superintelligence Policy cluster (2026-02-09 to 2026-05-08)