Inside the Article Library: How the Writing Engine Produces Long-Form at Scale
12 prompts. Drop cap to footnote. How the IO Article Library chains decomposed prompts to produce publication-ready editorial without a single revision loop — and exactly which model handles each step.
IO-CB-2026-001
How does the IO Article Library produce publication-ready long-form content?
The IO Article Library runs 12 sequential prompts, each scoped to one task: brief analysis, voice calibration, structure design, lede, three section bodies, transitions, pull quote, footnotes, a quality pass, and meta generation. Prompts 1–3 run on Claude Sonnet for reasoning-heavy analysis. Prompts 4–10 run on Claude Haiku for fast execution. Prompts 11–12 return to Sonnet for coherence review. The full chain completes in 90–110 seconds and is returned to the Orchestrator as a single structured episode — no revision loops.
Every senior editor knows the feeling: you ask someone to write a 2,000-word article and get back something that opens brilliantly, coasts through the middle, and ends on a sentence that sounds like the writer ran out of energy and caffeine at the same moment. That is not a people problem. It is an architecture problem.
A single "write me a great article about X" prompt hands the model too many responsibilities at once: understand the brief, choose a structure, establish a voice, write a compelling lede, maintain quality across 2,000 words, end well. Each is a separate cognitive task. Bundling them means each gets a fraction of the model's attention — and the fraction allocated to sections three through five is smaller, because the context window is full of everything that came before.
The IO Article Library solves this with prompt decomposition. Each of the 12 prompts has one job. The brief analysis prompt extracts 6 structured parameters. The voice calibration prompt outputs a 200-token style specification. The structure design prompt outputs a locked outline. No subsequent prompt writes freeform.
01Why 12 Prompts and Not One
The number 12 is not arbitrary. It is the result of decomposing a publication-ready article into its minimum set of non-overlapping, single-responsibility tasks. Remove any one prompt and you either push its work onto an adjacent prompt or skip the step entirely — producing a detectably worse article.
The key decomposition decisions are three. First, structure before copy: the outline is locked in prompt 3 before any body copy is written. Second, sections receive only their brief: each section-body prompt receives the locked outline and its specific section brief, not the full text of prior sections. Third, quality pass at the end: prompt 11 reads the assembled article as a whole and flags coherence issues for prompt-level correction.
“Each prompt has one job. Structure before copy. Sections receive only their brief. A quality pass at the end. This is why section five reads as well as section one.”
Tommy Saunders · Founder, IntelligentOperations.ai
02The 12-Prompt Chain — Interactive
Click any step to see its model assignment, token budget, and role. The indigo nodes run on Claude Sonnet 4 (reasoning-heavy). The green nodes run on Claude Haiku (fast execution).
03Model Routing: Sonnet vs. Haiku
The Article Library routes each prompt to the model whose capabilities match the task — Sonnet for reasoning-heavy analysis and quality review, Haiku for high-volume execution. This is not cost-cutting; it produces better output. Haiku writes section bodies more cleanly because its focused attention window keeps it on the locked style spec.
Claude Sonnet 4
Claude Haiku
The counterintuitive finding: Haiku-generated section bodies score higheron voice consistency than Sonnet, because Sonnet's tendency to elaborate pushes it off the locked spec. The best model for a task is not always the most capable — it is the one whose failure modes are most compatible with the constraint structure.
04Before / After: Single Prompt vs. Chain
The same article brief — one generated with a single “write a full article” prompt, one through the 12-step chain. Tabs show the Lede, Section 2 (the degradation zone), and the Conclusion.
Artificial intelligence is transforming the way businesses approach content creation. In today’s rapidly evolving digital landscape, companies are increasingly turning to AI tools to streamline their content workflows…
Every senior editor knows the feeling: you ask someone to write a 2,000-word article and get back something that opens brilliantly, coasts through the middle, and ends on a sentence that sounds like the writer ran out of caffeine. That is not a people problem. It is an architecture problem.
05Voice Consistency Matrix
The matrix scores five voice attributes across the chain — measuring how consistently each holds from prompt 4 (lede) through prompt 10 (footnotes). A perfect score is 5.0. The single-prompt baseline is shown for comparison.
| Voice Attribute | P04 Lede | P05–07 Body | P08 Trans. | P09–10 Close | Single Prompt |
|---|---|---|---|---|---|
| Direct register (no hedging) | 5.0 | 5.0 | 4.7 | 5.0 | 3.2 |
| Structural argument | 5.0 | 4.8 | 5.0 | 5.0 | 2.4 |
| Banned vocabulary avoidance | 5.0 | 5.0 | 5.0 | 5.0 | 3.0 |
| Audience-appropriate specificity | 4.8 | 4.8 | 5.0 | 5.0 | 3.5 |
| Conclusion closes argument | N/A | N/A | N/A | 5.0 | 1.4 |
The baseline collapses most dramatically on “conclusion closes argument” — 1.4/5 — because a single prompt, having written 1,800 words, has almost no context window left for strategic thinking. The chain dedicates an entire Sonnet prompt (P11) to reading the complete article and writing a conclusion that responds to its own lede.