Sep 16, 2025

StorySpark AI

devfestdelhi2025 elevenlabs gemini nano-banana google-cloud cloud-run

StorySpark AI: Text-to-Video Narrated Stories

 

Our hackathon project, StorySpark AI, transforms text prompts into narrated storybook videos. It uses Gemini 2.5 Flash Image for visuals and ElevenLabs for the audio.

Key features include:

  • Gemini Integration: We use Gemini's character consistency to ensure characters maintain the same look throughout the story and prompt-based image editing for minor scene adjustments.
  • ElevenLabs Audio: High-quality, human-like voice narration is generated for each page.
  • Customization: Users can reorder or delete story frames to edit the flow.
  • Background Music: The app offers three pre-made background music options and a custom upload feature.
  • Video Assembly: Due to a persistent FFmpeg CORS error, automatic video creation was not possible. Instead, the application generates a downloadable ZIP file containing all the images and audio, allowing users to assemble the final video in their own editing software. A planned GIF creation feature also faced the same technical issue.
 
Please note the repo is private because of a leaked API key there, I can share your team members indiviudally as a collaborator. 
Give a star to encourage!Discussion
Start a new conversation!
Login to join the discussion

More Builds by Manish Dwibedy

genai ai education edtech aiineducation