Subgenie is an automated subtitle generation and embedding system for videos. It aims to make content more accessible for hearing-impaired and non-native speakers by providing an easy-to-use interface for generating and embedding subtitles in videos.
- Videos without subtitles are inaccessible to hearing-impaired and non-native speakers.
- Manual transcription is time-consuming and resource-intensive.
Develop an automated system to:
- Generate subtitles from any language input
- Embed subtitles into videos
- Provide an easy interface for users to upload videos, automatically generate subtitles, and download the final video with embedded subtitles
- 📤 Video upload and processing
- 🗣️ Automatic subtitle generation
- 📺 Subtitle embedding
- 🌐 Multi-language support
- ✂️ Video trimming and splitting
- 🎨 User-friendly design
- 🔄 Language translations
- ✏️ AI-generated subtitle editing
- ⚛️ Electron.js: Enables desktop application features
- ⚛️ React: Used for creating the UI components
- 🎞️ FFMPEG: Used for video-to-audio conversion and embedding subtitles into videos
- 🐍 Python (or 🟩 Node.js): Handles MP3-to-SRT conversion and communicates with the frontend
- ⚡ FastAPI: For handling API endpoints to model
- 🗣️ Speech-to-Text Model: Converts audio into subtitle format (SRT)
- User uploads a video file
- Audio is extracted and converted to MP3
- MP3 is sent to the backend for speech-to-text conversion into SRT
- The SRT file is sent back to the frontend
- The SRT is embedded into the video, and the user can download the subtitled video
- 📹 Aspiring Vloggers
- 🌟 Social Media Stars
- 🚀 Startup Companies
- Tech stack selection: Choosing the best tools for making native desktop apps
- UI Design Completion: User Interactive design in FIGMA
- Finding the Translational Models: Research on best language translational models
- Defining Features: Feature planning based on tech stack capabilities
- Development Started: Team split into frontend and backend development
- Completion of MVP: Development of all basic product features
- Burning subtitles to video using ffmpeg
- Limited access to premium translational models
- Deployment issues in various cloud providers