Back to Projects
Chitra: GenAI Video Pipeline
GenAIVideoLLMsContent

Chitra: GenAI Video Pipeline

Multi-stage GenAI pipeline that converts text articles into short-form videos using LLM summarization, scriptwriting, image generation, AI avatars, and human-in-the-loop QA

Overview

AI Chitra was an experimental content production system designed to answer a specific question: can we reliably convert text-first content (news articles and story prompts) into high-quality, publishable videos with a largely automated workflow? The project combined LLM-driven understanding and writing with generative visuals and avatar-based presentation, backed by human review before publishing.

Motivation

High-quality video creation is expensive and slow. Most content pipelines require scripting, creative direction, asset sourcing, editing, and publishing—often taking hours per video. AI Chitra explored whether a multi-stage GenAI pipeline + targeted human checkpoints could reduce cost/time while maintaining quality for:

  1. News shorts from current articles
  2. Narrative storytelling (e.g., India mythology & history for kids) in a consistent style

Product

  • Article ingestion & summarization: selected articles parsed and summarized into structured key points
  • Script generation: LLM-written scripts optimized for short-form pacing and clarity
  • Visual prompt generation: image prompts generated per script segment/scene
  • Generative image creation: scene images via Midjourney and DALL·E
  • AI avatar anchoring: avatar-based presenter for an "anchor-style" delivery
  • Human-in-the-loop quality gates:
    • Verify narrative correctness and flow
    • Curate/override generated images where needed
    • Final QA before publishing

Technical Stack

  • LLM layer: OpenAI + Gemini for extraction and summarization; OpenAI for scriptwriting + image prompt generation
  • Generative visuals: Midjourney + DALL·E for scene assets
  • Avatar/video tooling: AI avatar platform to generate presenter segments
  • Orchestration: ingest → summarize → script → prompt → images → avatar → assemble → QA → publish

Deployment

  • Operated as a semi-automated internal production workflow
  • Outputs were manually published after QA
  • Supported two tracks (news + storytelling) with repeatable "content runs"

Outcomes

Collaborators

Apurv Mehra

Partners

OpenAIGoogle Gemini