AI Video ProductionModule 2

2.1AI Script Writing — Hook, Story, CTA Framework

30 min 6 code blocks Practice Lab Quiz (4Q)

AI Script Writing

A YouTube video is only as good as its script. The difference between a video that keeps viewers for 90 seconds (YouTube's kill threshold) and one that holds them for 6 minutes is a well-structured script. AI script writing has evolved from generic templates to nuanced, audience-aware narratives. Pakistani creators who master this skill generate 3x higher subscriber growth in their first 90 days compared to those who improvise. This lesson teaches you to write scripts that hook Pakistani audiences and convert them into loyal subscribers.

The Anatomy of a High-Retention Script

Before you write a single word, understand the structure every successful faceless video follows. Think of it as a building — the hook is the foundation, the core points are the floors, and the CTA is the roof. Miss any layer and the structure collapses.

code
SCRIPT ARCHITECTURE — 6-MINUTE VIDEO TEMPLATE
═══════════════════════════════════════════════

00:00 ─── HOOK (0–15 sec)
│           ├── Shocking stat OR bold claim OR relatable question
│           ├── "Why should I keep watching?" answered in 1 sentence
│           └── Keyword #1 appears here (algorithm trigger)
│
00:15 ─── CONTEXT (15–45 sec)
│           ├── Background: who this affects and why now
│           ├── Personal stake or relatable scenario
│           └── Keyword #2 appears here
│
00:45 ─── CORE POINT 1 (45–135 sec)
│           ├── Point Statement (1 sentence)
│           ├── Explanation (why it matters)
│           ├── Action Step (what to do right now)
│           └── Pakistan Example (Karachi / Lahore / Islamabad scenario)
│
02:15 ─── CORE POINT 2 (135–225 sec)
│           ├── Point Statement
│           ├── Explanation
│           ├── Retention Moment ◄── question or plot twist
│           └── Pakistan Example
│
03:45 ─── CORE POINT 3 (225–315 sec)
│           ├── Point Statement
│           ├── Explanation
│           ├── Pakistan Example
│           └── Keyword #3 appears here
│
05:15 ─── TAKEAWAY (315–345 sec)
│           ├── Summarize all 3 points in 1 memorable sentence
│           └── Retention Moment ◄── surprising stat or reversal
│
05:45 ─── CALL TO ACTION (345–360 sec)
            ├── Subscribe ask (specific reason: "so you get Part 2")
            ├── Comment question (drives engagement signal)
            └── Community link (Discord / WhatsApp group)

Each section serves a measurable purpose. The hook determines click-to-watch conversion. The context determines whether viewers stay past 30 seconds. The three core points determine watch time percentage. The takeaway and CTA determine subscriber conversion rate.

Using AI to Generate Scripts: The Right Way

Gemini 2.5 Pro and ChatGPT can generate full scripts in seconds. The key is specificity. Vague prompts produce generic scripts; specific prompts produce viral scripts. Here is the exact difference:

Weak prompt: "Write a script about cryptocurrency."

Power prompt: "Write a 6-minute YouTube script about why Pakistani youth should NOT invest in crypto in 2026. Tone: authoritative but friendly. Include 3 reasons (scam risk, regulatory uncertainty, high volatility) with Pakistan-specific examples (Binance ban, Loot token crash, FIA crackdown). Add actionable alternatives (stock market KSE-100, fixed deposits, prize bonds). End with a comment question: 'What's your crypto story?' Format as a spoken script with [PAUSE] markers after every major point. Include keyword: 'Pakistan crypto 2026' in the first 30 seconds."

The power prompt produces a 1,500-word script with Pakistani references, pacing markers, and keyword placement — all in 30 seconds. The weak prompt produces 800 generic words you cannot use without heavy rewriting.

The AI Script Workflow:

code
GEMINI SCRIPT GENERATION PIPELINE
════════════════════════════════════

Step 1: TOPIC BRIEF
│   Input:  niche + audience + goal
│   Tool:   Gemini 2.5 Flash (free tier)
│   Output: 3 angle options to choose from
│
Step 2: FULL SCRIPT DRAFT
│   Input:  chosen angle + hook template + keyword list
│   Tool:   Gemini 2.5 Pro / ChatGPT-4o
│   Output: 1,200–1,800 word spoken script
│
Step 3: HUMANIZE
│   Input:  AI draft
│   Action: Add 1 personal story + 3 retention moments
│   Tool:   Manual edit (you, 10 minutes)
│   Output: Authentic, personal script
│
Step 4: KEYWORD AUDIT
│   Input:  humanized script
│   Tool:   Gemini prompt (see below)
│   Output: Keywords inserted naturally in first 30 sec
│
Step 5: PACING CHECK
│   Action: Read aloud — stumbling = rewrite that line
│   Goal:   Smooth delivery at 150–180 words per minute
│   Output: Final production-ready script

After generating, always add 2–3 personal touches. AI scripts are comprehensive but impersonal. Your edits make them authentic. Example: If you were scammed on crypto in 2022, add "I lost PKR 200,000 to a pump-and-dump coin in 2022 — here is how I learned the hard way." Your vulnerability builds trust that no AI can generate.

Four Script Templates: Comparison and Use Cases

Not every topic deserves the same hook style. Choose based on your niche and audience psychology.

TemplateHook StyleBest ForAvg Click-ThroughPakistan Example
Question HookOpens with a question viewers fearFinance, Career, Health7–9%"Did you know 95% of PK businesses fail in year one?"
Stat HookOpens with a surprising numberTech, Growth, Trends8–11%"PK freelancers earned USD 1.2B globally, but 70% made under USD 500"
Story HookOpens with a named character's journeyMotivation, Case Studies6–8%"Last month Amira from Karachi made PKR 500K using AI with zero prior skills"
Contrarian HookFlips the audience's default beliefBusiness, Self-improvement9–13%"Everyone says follow your passion. That advice is destroying Pakistani entrepreneurs."

The Contrarian Hook consistently outperforms on YouTube because it creates immediate cognitive dissonance — viewers cannot scroll past a claim that challenges what they already believe.

Full Contrarian Hook example with Gemini prompt:

code
GEMINI PROMPT — CONTRARIAN HOOK SCRIPT
════════════════════════════════════════

Prompt:
"Write a contrarian-style YouTube script opening (first 60 seconds)
that argues against the common advice '[COMMON ADVICE]' in Pakistan's
[NICHE] space. Include:
- A bold opening claim that will shock the target audience
- 2 statistics that support the contrarian position
- A Pakistan-specific example that makes it real
- A transition sentence into the main content
- [PAUSE] markers after major statements
Audience: Pakistani [freelancers / entrepreneurs / students] aged 22–35
Tone: Confident, not arrogant. Cite sources where possible."

Example output for business niche:
"[PAUSE] Aaj main aapko bata raha hoon ke 'follow your passion'
worst business advice hai. [PAUSE] In 2025, 87% of Pakistani startups
that positioned as 'passion projects' shut down within 18 months.
The ones that survived? They followed market demand, not passion.
Here is what actually works..."

Keyword Insertion Strategy

YouTube's algorithm scans your video's auto-transcript for keyword relevance signals. Place your primary keyword in the first 30 seconds, your secondary keyword between minutes 2–4 (highest watch time density), and your tertiary keyword near the CTA.

Keyword insertion prompt for Gemini:

code
GEMINI KEYWORD AUDIT PROMPT
═════════════════════════════

"Review this script and:
1. Insert the keyword '[PRIMARY KEYWORD]' naturally within the
   first 30 seconds — do NOT force it, rewrite the sentence if needed
2. Insert '[SECONDARY KEYWORD]' between word 500 and word 900
3. Insert '[TERTIARY KEYWORD]' in the final 100 words
4. Flag any sentence where a keyword sounds forced
5. Return the full revised script with keywords bolded

Script: [PASTE SCRIPT HERE]"

Do not keyword-stuff. YouTube penalizes transcripts that repeat the same phrase 8+ times unnaturally. Two natural insertions of your primary keyword outperform five forced ones.

Pacing Chart: The 6-Minute Timing Blueprint

Pacing is about controlling audience attention at specific timestamps. YouTube's internal data shows viewers are most likely to drop off at 30 seconds, 50%, 75%, and 90% of any video. Plant retention moments at each of these danger points.

code
PACING CHART — 6-MINUTE VIDEO (360 sec)
═════════════════════════════════════════

0:00–0:30  │ HOOK ZONE      ← Danger #1: 30% of viewers leave here
           │ Retention tool: Bold claim + immediate promise of payoff
           │ Target: Keep 70%+ past this mark
           │
0:30–2:00  │ CONTEXT + CP1  ← Build trust, introduce first point
           │ Pacing: ~160 words/min, 8 cuts/min
           │
2:00–3:30  │ CORE POINT 2   ← Danger #2: 50% mark (180 sec)
           │ Retention tool: Direct question to viewer
           │ Example: "Kya aapne kabhi yeh try kiya hai?"
           │
3:30–5:00  │ CORE POINT 3   ← Danger #3: 75% mark (270 sec)
           │ Retention tool: Unexpected reveal or statistic
           │ Example: "The number 4 will surprise you..."
           │
5:00–5:30  │ TAKEAWAY       ← Danger #4: 90% mark (324 sec)
           │ Retention tool: Tease a future video
           │ Example: "In Part 2, I'll show you exactly how..."
           │
5:30–6:00  │ CALL TO ACTION
           ← Final cut: subscribe + comment question + link

Words per minute guide:

  • Educational content: 130–160 WPM (allows processing)
  • Motivational content: 160–190 WPM (creates energy)
  • Breaking news / urgency: 190–220 WPM (creates excitement)

Generate your voiceover at the right WPM using ElevenLabs' speed control (covered in Module 3).

The Humanization Edit: Making AI Scripts Feel Personal

AI scripts sound like encyclopedia articles. Your job is to make them feel like a conversation. Use these four humanization techniques:

  1. Add a "I was wrong" moment — Admit a past mistake relevant to the topic. Vulnerability = instant trust.
  2. Name a real person — "My cousin Bilal in Lahore tried this and..." Real names create narrative immersion.
  3. Add a sensory detail — "I remember sitting in a Karachi traffic jam listening to this podcast when it clicked." Physical scenes anchor abstract ideas.
  4. Drop a Romanized Urdu phrase — "Seedha baat, koi lafaazi nahi" signals cultural authenticity to Pakistani viewers.

These four edits add 5–10 minutes to your workflow but can double your average watch time by making scripts feel authentic.

Practice Lab

Practice Lab

Task 1: Script Generation Pick a topic relevant to your niche. Use ChatGPT or Gemini with this full power prompt:

"Write a [6/10/15]-minute YouTube script about [TOPIC]. Audience: [Pakistani youth / entrepreneurs / business owners] aged [18–25 / 25–35 / 35+]. Tone: [motivational / educational / comedic]. Include [3–5] core points with Pakistan-specific examples (city names, PKR amounts, local platforms). Format as spoken script with [PAUSE] markers after every major statement. Include this primary keyword in the first 30 seconds: [YOUR KEYWORD]. End with a direct question for comments."

Task 2: Script Editing Take your AI-generated script and layer in all four humanization edits:

  • Add one personal story (2–3 sentences, specific detail)
  • Add three retention moments at 50%, 75%, and 90% marks
  • Insert keywords naturally using the Gemini keyword audit prompt
  • Drop one Romanized Urdu phrase that fits your brand voice
  • Read the final script aloud — any stumble = rewrite that sentence

Task 3: Template Mastery Write four 30-second hooks for the same topic using all four templates from the comparison table (Question, Stat, Story, Contrarian). Post all four in the course community and ask classmates which hook would make them click. Use the winning template for your next video. This exercise reveals your audience's psychology better than any analytics tool.

Pakistan Case Study: "Startup Stories PK"

Hassan Ali, a 28-year-old failed entrepreneur from Islamabad, launched "Startup Stories PK" after his e-commerce business lost PKR 800,000 in 2023. He started the channel to process his failures publicly — interviewing Pakistani founders who either exited for USD 1M+ or reached breakeven profitability.

His script formula is built around one insight: Pakistani audiences trust struggle over success. Every video follows this structure: (1) Founder's name and startup — the hook, (2) Their biggest public failure — the context, (3) Three operational lessons — the core points, (4) A direct question for comments — "Aapka sabse bada business mistake kya tha?"

He generates interview scripts with this Gemini prompt:

code
"Write a 10-minute YouTube interview script for a conversation with
[FOUNDER NAME] who built [STARTUP] and [OUTCOME: sold for USD X /
reached PKR X ARR]. Structure:
- Hook: Their most painful failure (30 seconds, visceral detail)
- Context: Timeline of their journey (1 minute)
- 3 core lessons with Pakistani market context each
- 4 follow-up questions I can ask to dig deeper (mark these [FOLLOW-UP])
- CTA: Ask viewers to share their own failure story
Format as spoken script. Include [PAUSE] markers. Tone: Honest, not promotional."

Month 1: 3,200 subscribers. Month 2: 31,000 subscribers (viral video on a founder who lost PKR 15M and rebuilt). Month 3: 147,000 subscribers and 12M views total.

Revenue at month 3: PKR 180,000/month from YouTube Partner Program ads. His CPM is PKR 1,200 (high because his audience is startup-adjacent, a premium advertiser demographic).

His Gumroad product: "Startup Founder Interview Script Pack" — 12 templates for different founder story types, sold at PKR 3,500 each. By month 4, 22 purchases = PKR 77,000 in product revenue, purely passive. Total income: PKR 257,000/month, all from one channel built on AI-assisted scripts.

"Jab pehli baar ChatGPT se script likha, mujhe laga ye fraud hai — but that script got 400K views. Ab main har video AI se start karta hoon."

Key Takeaways

  • The five-part script structure (Hook, Context, 3 Core Points, Takeaway, CTA) is not optional — it is the architecture every high-retention video uses
  • Vague AI prompts produce generic scripts; specific prompts that include audience age, tone, Pakistan examples, keywords, and format markers produce production-ready scripts in 30 seconds
  • The Contrarian Hook template averages 9–13% click-through rate — the highest of any hook style for Pakistani business and freelancing niches
  • Plant retention moments at exactly 50%, 75%, and 90% of your video — these are YouTube's statistically proven drop-off points
  • Keywords must appear in the first 30 seconds, minutes 2–4, and the final 100 words — this matches YouTube's algorithm scanning pattern
  • Four humanization edits (personal mistake, real name, sensory detail, Romanized Urdu phrase) can double average watch time at the cost of 10 extra minutes
  • Reading your script aloud before recording is non-negotiable — stumbling = the sentence must be rewritten for natural speech cadence
  • AI scripts are the foundation; your personal context is the differentiator — combine both and competitors using raw AI output cannot match your authenticity
  • Hassan Ali's case proves that a trauma-driven niche (startup failures) combined with AI scripting can reach 147K subscribers in 90 days with zero production budget
  • The script is your most important production asset — a great script with average visuals outperforms a weak script with cinematic production every time

Lesson Summary

Includes hands-on practice lab6 runnable code examples4-question knowledge check below

AI Script Writing Quiz

4 questions to test your understanding. Score 60% or higher to pass.