StorySpark AI
Link to open source: https://github.com/manishdwibedy/StorySpark-AI/
Link to Live Project: https://storyspark-ai-805085509940.us-west1.run.app
StorySpark AI: Text-to-Video Narrated Stories
Our hackathon project, StorySpark AI, transforms text prompts into narrated storybook videos. It uses Gemini 2.5 Flash Image for visuals and ElevenLabs for the audio.
Key features include:
- Gemini Integration: We use Gemini's character consistency to ensure characters maintain the same look throughout the story and prompt-based image editing for minor scene adjustments.
- ElevenLabs Audio: High-quality, human-like voice narration is generated for each page.
- Customization: Users can reorder or delete story frames to edit the flow.
- Background Music: The app offers three pre-made background music options and a custom upload feature.
- Video Assembly: Due to a persistent FFmpeg CORS error, automatic video creation was not possible. Instead, the application generates a downloadable ZIP file containing all the images and audio, allowing users to assemble the final video in their own editing software. A planned GIF creation feature also faced the same technical issue.
Please note the repo is private because of a leaked API key there, I can share your team members indiviudally as a collaborator.


