Retold AI transforms full movies, long-form videos, or raw .mp4 files into AI-generated cinematic trailers and human-style story retellings.

Retold AI is an experimental multimodal system that transforms full-length movies and long-form videos into cinematic AI-generated trailers and human-style narrative summaries. Built on top of OpenAI's multimodal models, Retold analyzes an entire film, identifies emotional and narrative peaks, and retells the story in concise prose that resembles a film critic or storyteller.
The system also generates a short video edit using selected scenes, paired narration, and optional AI-generated music. The output includes a narrative text summary and a cinematic trailer suitable for archives, short-form content, and creative research.
# Windows only
git clone https://github.com/dfordp/retold-ai.git
cd retold-ai
python3 -m pip install -r requirements.txt
# Remove placeholder files
Get-ChildItem -Recurse -Filter ".placeholder" | Remove-Item
Retold uses OpenAI for multimodal story comprehension and ElevenLabs for optional voice narration. Users must provide their own API keys.
{
"openai_api_key": "YOUR_API_KEY",
"elevenlabs_api_key": "YOUR_ELEVENLABS_KEY"
}
Retold follows a four-stage pipeline inspired by video understanding research: ingest, extract, retell, and render.
output/
├── MovieName_summary.txt
└── MovieName_trailer.mp4
python3 main.pyA small GUI appears, allowing users to select movies, start generation, and view outputs in the /output directory.
Users may configure the YouTube API to automatically upload generated trailers. Only audited YouTube applications can publish public videos; unaudited workflows default to private uploads.
Each Retold run outputs a narrative summary and a one-minute cinematic edit based on selected scenes. These outputs support storytelling research, film metadata, content creation, and automated summarization workflows.
Retold emerged from a weekend experiment exploring whether short-form cinematic explainers could be automated using modern multimodal models. The project draws on research in video segmentation, emotion detection, and narrative extraction.
Retold was built as a creative exploration into whether AI can capture the emotional rhythm of a story, not just summarize plot points. The system analyzes entire films, identifies narrative beats, rewrites them in human-like prose, and then pairs the text with relevant scenes to form a cohesive short trailer. The result is an early experiment in automated cinematic storytelling.