SmartSRT is my attempt, at an community run end-to-end solution for automatically generating subtitles for videos with a maximum subtitle length constraint, along with speaker diarization. It uses machine learning models for speech recognition, text summarization, face regocnition, and diarization, and will run on a CUDA GPU for faster performance.
# Getting Started You need to install the required dependencies and download the necessary models.\Clone the repository:
git clone https://github.com/KevinGeLe/SmartSRT.git
Install the required dependencies:
cd SmartSRT
pip install -r requirements.txt
The necessary models will be downloaded, by specifying the models that you want to use.
- Add Face recognition (ยฑ1 second)๐ฑ
- Add Max-Subtitle-Lenght with Per-Word-Timestamps:๐ฐ๏ธ
- Add new parsers for Max-Lenght, Output, Input and Model๐
- Refactor code to improve readability๐
- Work on improving performance๐ฆ
- Update README.md๐
SmartSRT is released under the MIT license. See the LICENSE file for more information.