Pipeline · A03Complete Article · 4 Libraries · ~36k Tokens · 1m 48s

Input

Context Brief · ~2 min fill

→

Libraries

ART 16 · IMG 4 · SEO 6 · TAS 4

→

Orchestrator

Phase 3 · 4 episodes in

→

Output

Full Package · < 2 minutes

Library Deep DiveIO Content Ops Series · Article 03

Inside the Article Library: How the Writing Engine Produces Long-Form at Scale

12 prompts. Drop cap to footnote. How the IO Article Library chains decomposed prompts to produce publication-ready editorial without a single revision loop — and exactly which model handles each step.

Tommy Saunders

Founder, IntelligentOperations.ai

March 29, 2026 · 10 min read
IO-CB-2026-001

Direct Answer

How does the IO Article Library produce publication-ready long-form content?

The IO Article Library runs 12 sequential prompts, each scoped to one task: brief analysis, voice calibration, structure design, lede, three section bodies, transitions, pull quote, footnotes, a quality pass, and meta generation. Prompts 1–3 run on Claude Sonnet for reasoning-heavy analysis. Prompts 4–10 run on Claude Haiku for fast execution. Prompts 11–12 return to Sonnet for coherence review. The full chain completes in 90–110 seconds and is returned to the Orchestrator as a single structured episode — no revision loops.

Source: IntelligentOperations.ai · IO Platform · March 2026 · JSON-LD Schema

Every senior editor knows the feeling: you ask someone to write a 2,000-word article and get back something that opens brilliantly, coasts through the middle, and ends on a sentence that sounds like the writer ran out of energy and caffeine at the same moment. That is not a people problem. It is an architecture problem.

A single “write me a great article about X” prompt hands the model too many responsibilities at once: understand the brief, choose a structure, establish a voice, write a compelling lede, maintain quality across 2,000 words, end well. Each is a separate cognitive task. Bundling them means each gets a fraction of the model’s attention — and the fraction allocated to sections three through five is smaller, because the context window is full of everything that came before.

The IO Article Library solves this with prompt decomposition. Each of the 12 prompts has one job. The brief analysis prompt extracts 6 structured parameters. The voice calibration prompt outputs a 200-token style specification. The structure design prompt outputs a locked outline. No subsequent prompt writes freeform.¹

01Why 12 Prompts and Not One

The number 12 is not arbitrary. It is the result of decomposing a publication-ready article into its minimum set of non-overlapping, single-responsibility tasks. Remove any one prompt and you either push its work onto an adjacent prompt or skip the step entirely — producing a detectably worse article.

The key decomposition decisions are three. First, structure before copy: the outline is locked in prompt 3 before any body copy is written. Second, sections receive only their brief: each section-body prompt receives the locked outline and its specific section brief, not the full text of prior sections. Third, quality pass at the end: prompt 11 reads the assembled article as a whole and flags coherence issues for prompt-level correction.²

“Each prompt has one job. Structure before copy. Sections receive only their brief. A quality pass at the end. This is why section five reads as well as section one.”

Tommy Saunders · Founder, IntelligentOperations.ai

02The 12-Prompt Chain — Interactive

Click any step to see its model assignment, token budget, and role. The indigo nodes run on Claude Sonnet 4 (reasoning-heavy). The green nodes run on Claude Haiku (fast execution).

Article Library \u2014 12-Prompt Sequential ChainSonnetHaikuIO-VIZ-03

Phase 1 — Analysis (Sonnet 4)

Phase 2 — Execution (Haiku)

Phase 3 — Quality Pass (Sonnet 4)

PROMPT 01 \u2014 Brief Analysis

Reads the context brief and extracts 6 structured parameters. Output: structured param object. Model: Sonnet 4 (reasoning).

03Model Routing: Sonnet vs. Haiku

The Article Library routes each prompt to the model whose capabilities match the task — Sonnet for reasoning-heavy analysis and quality review, Haiku for high-volume execution. This is not cost-cutting; it produces better output. Haiku writes section bodies more cleanly because its focused attention window keeps it on the locked style spec.

Model Routing Architecture \u2014 Article Library

Claude Sonnet 4

Reasoning · Analysis · Quality · 4 Prompts

P01 Brief Analysis~600 tkns

P02 Voice Calibration~600 tkns

P03 Structure Design~900 tkns

P11 Coherence Review~3000 tkns

Claude Haiku

Execution · Speed · Volume · 8 Prompts

P04 Lede + Drop Cap~800 tkns

P05 Section 1 body~1200 tkns

P06–07 Section 2 & 3~2400 tkns

P08 Transitions~1100 tkns

P09–10 Quote + Footnotes~1400 tkns

P12 Meta + Related~2900 tkns

Cost Comparison \u2014 All-Sonnet vs. Hybrid

All Sonnet

$0.048

Hybrid (IO)

$0.017

~65%Cost reduction per run via hybrid routing

The counterintuitive finding: Haiku-generated section bodies score higheron voice consistency than Sonnet, because Sonnet’s tendency to elaborate pushes it off the locked spec. The best model for a task is not always the most capable — it is the one whose failure modes are most compatible with the constraint structure.

04Before / After: Single Prompt vs. Chain

The same article brief — one generated with a single “write a full article” prompt, one through the 12-step chain. Tabs show the Lede, Section 2 (the degradation zone), and the Conclusion.

Output Comparison \u2014 Single Prompt vs. 12-Step Chain

✗ Single Prompt

Prompt 01 of 01

Artificial intelligence is transforming the way businesses approach content creation. In today’s rapidly evolving digital landscape, companies are increasingly turning to AI tools to streamline their content workflows…

Generic opener. "Today’s rapidly evolving digital landscape" is a tell. Sounds like a CTA, not a lede.

✓ 12-Step Chain

Prompt 04 of 12 (Post-Voice Calibration)

Every senior editor knows the feeling: you ask someone to write a 2,000-word article and get back something that opens brilliantly, coasts through the middle, and ends on a sentence that sounds like the writer ran out of caffeine. That is not a people problem. It is an architecture problem.

Opens with specific professional recognition. Makes a structural claim. Reframes the reader’s assumption.

05Voice Consistency Matrix

The matrix scores five voice attributes across the chain — measuring how consistently each holds from prompt 4 (lede) through prompt 10 (footnotes). A perfect score is 5.0. The single-prompt baseline is shown for comparison.

Voice Consistency \u2014 12-Prompt Chain vs. Single Prompt

Voice Attribute	P04 Lede	P05–07 Body	P08 Trans.	P09–10 Close	Single Prompt
Direct register (no hedging)	5.0	5.0	4.7	5.0	3.2
Structural argument	5.0	4.8	5.0	5.0	2.4
Banned vocabulary avoidance	5.0	5.0	5.0	5.0	3.0
Audience-appropriate specificity	4.8	4.8	5.0	5.0	3.5
Conclusion closes argument	N/A	N/A	N/A	5.0	1.4

The baseline collapses most dramatically on “conclusion closes argument” — 1.4/5 — because a single prompt, having written 1,800 words, has almost no context window left for strategic thinking. The chain dedicates an entire Sonnet prompt (P11) to reading the complete article and writing a conclusion that responds to its own lede.

06Social Distribution Suite

Social Library \u00b7 Haiku + Sonnet \u00b7 6 prompts

Tommy Saunders

@tommysaunders_ai

We don’t write articles with one prompt. We write them with 12. Each prompt has one job. P01–P03: Sonnet (analysis, voice, structure). P04–P10: Haiku (lede, sections, transitions). P11–P12: Sonnet (quality pass, meta). Cost/article: $0.017. Runtime: ~100s. Voice consistency: 4.9/5. The architecture in full →

10:00 AM · Mar 29, 2026 · 31.7K Impressions

07Search Package — SEO + AEO

Search Package \u2014 Article 03

intelligentoperations.ai › content-ops › article-library-writing-engine

Inside the IO Article Library: 12-Prompt Chain for Long-Form AI Content

How the IO Platform’s Article Library chains 12 decomposed prompts across Sonnet and Haiku to produce publication-ready long-form content in under 2 minutes — with voice consistency scores, model routing, and prompt template reveals.

ai article writing systemprompt chain contentlong-form ai contentprompt decomposition writingsonnet haiku routing12 prompt chain article

08CRM Lead Capture + Nurture

5-Step Nurture Sequence \u2014 Article 03 CRM Output

Day 012-prompt chain template + voice calibration spec

Day 2“Why Haiku writes better body copy than Sonnet”

Day 5Voice consistency audit: score your current AI articles

Day 8Live demo: run your first article through the chain

Day 14Your context window is the problem. Here’s the fix.

09Frequently Asked Questions

5 Questions

The library eliminates revision loops through upstream constraint, not downstream correction. Before any body copy is written, three analysis prompts lock the structure, voice, and outline. Each section-body prompt receives only its specific brief plus the locked structure — not the full prior text. Prompt 11’s coherence review catches structural issues at the assembly stage. If a section fails review, only that section prompt re-runs — not the entire chain.

Prompt decomposition means breaking a complex task into discrete single-responsibility prompts that each produce one deterministic output. Instead of ‘write me a good article about X,’ a decomposed chain runs: analyze the brief, calibrate the voice, design the structure, write the lede, write each section, select a pull quote, generate footnotes, run a quality pass. Each prompt is easier for the model and fails gracefully.

Counterintuitively, Haiku produces more consistent body copy than Sonnet because its smaller, more focused attention window keeps it on task. Sonnet, given creative latitude, tends to elaborate beyond the locked style spec. Across 340 test runs: Haiku body sections scored 4.8/5 on voice consistency; Sonnet body sections scored 4.1/5.

Approximately $0.017 per article using hybrid Sonnet/Haiku routing. All-Sonnet would cost ~$0.048 — 2.8x more with measurably worse body copy. Total budget ≈ 18,000 tokens in, 4,000 out, for a 2,000–2,800 word article.

The 12-prompt chain completes in 90–110 seconds. Prompts 1–3 (Sonnet) ≈ 25s; prompts 4–10 (Haiku) ≈ 55s; prompts 11–12 (Sonnet) ≈ 20s. The article returns to the Orchestrator as a 48-token episode — not the full text — so the Orchestrator’s context window stays flat.

Structured as FAQ schema (JSON-LD) for AEO indexing

References

1Prompt decomposition as a methodology for long-form generation: "Single-Responsibility Prompt Chains — Architecture and Evaluation Methodology," IntelligentOperations.ai, 2026. Key finding: decomposed chains produce significantly lower variance in quality across sections, measured across 340 runs.

2Model routing methodology — Sonnet for analysis/quality, Haiku for execution — emerged from a cost-quality Pareto analysis across 280 runs in Q4 2025, independently replicated across three content categories.