Harness Engineering

Harness Engineering

Harness engineering is the practice of making a codebase, toolchain, and team process legible enough that coding agents can work, verify, review, and hand off changes with less human babysitting.

Key points

  • OpenAI frames the move from autocomplete and pair programming into agent delegation as requiring a better "harness" around the model, not only a better model [src-084].
  • The API & Codex Build Hour demo builds an "agent legibility score" for repositories, with criteria such as bootstrap self-sufficiency, task entry points, validation harnesses, linting, formatting, documentation, and modular boundaries [src-084].
  • Worktrees are part of the harness because they let multiple agent tasks run in parallel on separate branches without clobbering each other [src-084].
  • High-quality validation is central: tests, lint, build commands, browser checks, logs, and other acceptance signals let Codex decide whether the work is actually done [src-084].
  • Skills and subagents encode repeatable team standards, such as PR creation, commit style, code review, architecture review, standards enforcement, and pull-request follow-up [src-084].
  • Team knowledge should move from individual heads and chat history into version-controlled files, docs, specs, skills, notes, and local decision records so every agent benefits from the same context [src-084].
  • The source also adds a company-context version: an "Atlas" style repo for operating principles, strategy, and non-code context can make Codex useful beyond production code [src-084].

Related entities

Related concepts

Source references

  • [src-084] OpenAI Codex, Workspace Agents, Prompt Caching, and Superintelligence Policy cluster (2026-02-09 to 2026-05-08)

Robin Cartier perspective

This page is part of Robin Cartier's working AI knowledge graph: a practical research layer for production AI, recommendation systems, experimentation, GEO, and agentic web readiness.

The useful next step is to connect this concept back to applied product leadership and operating models.

Recommended next

Keep reading from this thread

From 494 indexed pages and articles.

  1. Wiki concept GPT-5.4 An OpenAI reasoning model presented in the API & Codex Build Hour as a token-efficient coding and agent model with native computer-use capabilities Related by 084
  2. Wiki concept OpenAI Workspace Agents Codex-powered agents in ChatGPT for shared team workflows that run in the cloud across files, apps, skills, schedules, Slack Related by 084
  3. Insight AI Beyond POCs How enterprise AI moves beyond proofs of concept through ownership, governance, measurement, adoption, and production operating models Readers have engaged with this next