Karpathy LLM Wiki Pattern

Karpathy LLM Wiki Pattern

A knowledge-base architecture popularised by Andrej Karpathy in April 2026 as a RAG alternative for small-to-medium personal corpora. Uses flat markdown files inside a raw/ folder for sources and a wiki/ folder for synthesised entity/concept/analysis pages, plus an index.md and log.md. Claude Code reads, indexes, compacts, and links pages by following markdown links rather than using vector similarity. No embeddings, no vector DB, no chunking pipeline — just markdown and a claude.md that explains the routing. Obsidian is the recommended viewer for the graph view, though not required.

Key points

  • Flat markdown structure: raw/ (sources), wiki/ (synthesised pages), index.md, log.md
  • Optional subfolders: entities, concepts, sources, analysis, people, organisations
  • Hot cache (hot.md) stores recent context summary, useful for executive-assistant style wikis
  • Claude Code performs data ingest, organisation, linting, and Q&A phases
  • One user dropped token usage 95% vs vector RAG on 383 files + 100 meeting transcripts
  • Relationships via markdown backlinks, not similarity search
  • Linting phase runs LLM health checks for inconsistent data and gap-filling
  • Scales cleanly up to hundreds of pages; vector RAG wins above millions of docs
  • Obsidian Web Clipper extension can ingest web articles directly into the raw folder
  • In Karpathy’s Sequoia interview, the pattern is reframed as an understanding tool: LLMs can project fixed information into new wiki structures, summaries, links, and Q&A surfaces that help a human understand the material better [src-055]
  • Roberts contrasts the wiki/Obsidian style with Pinecone: use markdown when you want readable, editable files and graph backlinks; use vector memory when scalable semantic search matters more [src-059].

Related entities

Related concepts

Source references

  • [src-004] Nate Herk cluster — Nate Herk — Claude Code cluster (21 videos)

– Videos referenced: sboNwYmH3AY

  • [src-013] Nate Herk — “Build & Sell Claude Code Operating Systems (2+ Hour Course)” (2026-05-01)

– Live demo: Karpathy’s original prompt used verbatim to build a wiki from the AI2027 article, producing 23 pages (6 people, 5 organisations, multiple concept pages) in one session. Hot.md demonstrated as a session cache. Obsidian Web Clipper shown as the one-click ingestion path for web articles.

  • [src-055] Sequoia Capital — “Andrej Karpathy: From Vibe Coding to Agentic Engineering” (2026-04-29)
  • [src-059] Jack Roberts — “This Memory System just 10x’d Claude Code” (2026-05-03)

Robin Cartier perspective

This page is part of Robin Cartier's working AI knowledge graph: a practical research layer for production AI, recommendation systems, experimentation, GEO, and agentic web readiness.

The useful next step is to connect this concept back to applied product leadership and operating models.

Recommended next

Keep reading from this thread

From 494 indexed pages and articles.

  1. Wiki concept LLM Knowledge Bases (Karpathy pattern) A practical guide to LLM knowledge bases, the Karpathy markdown wiki pattern, and what it teaches production AI teams about durable memory and governance Related by karpathy
  2. Wiki concept LLM Wiki vs Semantic RAG A comparison framework for choosing between two knowledge-base architectures: the LLM Wiki Pattern (markdown + index + LLM reader) and semantic RAG (embeddings + vector database Related by karpathy
  3. Insight Generative Engine Optimization for AI Search A practical GEO guide for becoming visible in AI-generated answers through machine-scannable content, authority, schema, and monitoring Related by sources