SmartSRT

SmartSRT is my attempt, at an community run end-to-end solution for automatically generating subtitles for videos with a maximum subtitle length constraint, along with speaker diarization. It uses machine learning models for speech recognition, text summarization, face regocnition, and diarization, and will run on a CUDA GPU for faster performance.

# Getting Started You need to install the required dependencies and download the necessary models.\ ⚠️Remember that you have to use a CUDA supported GPU!⚠️

Clone the repository:
git clone https://github.com/KevinGeLe/SmartSRT.git
Install the required dependencies:
cd SmartSRT
pip install -r requirements.txt
The necessary models will be downloaded, by specifying the models that you want to use.

TODO:

Add Face recognition (±1 second)👱
Add Max-Subtitle-Lenght with Per-Word-Timestamps:🕰️
Add new parsers for Max-Lenght, Output, Input and Model🔍
Refactor code to improve readability🐊
Work on improving performance🦈
Update README.md📑

License

SmartSRT is released under the MIT license. See the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SmartSRT

TODO:

License

About

Releases

Packages

License

KevinGeLe/SmartSRT

Folders and files

Latest commit

History

Repository files navigation

SmartSRT

TODO:

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages