Skip to content

๐Ÿ“„ SmartSRT is a command-line tool for generating accurate subtitles with per-word timestamps. It uses WhisperAI for speech transcription, NVIDIA NeMo for diarization, and OpenCV for face recognition. The program is good at creating high accuracy subtitles. ๐ŸŽง๐Ÿ’ปโš™๏ธ

License

Notifications You must be signed in to change notification settings

KevinGeLe/SmartSRT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 

Repository files navigation

SmartSRT

SmartSRT is my attempt, at an community run end-to-end solution for automatically generating subtitles for videos with a maximum subtitle length constraint, along with speaker diarization. It uses machine learning models for speech recognition, text summarization, face regocnition, and diarization, and will run on a CUDA GPU for faster performance.

Your image

# Getting Started You need to install the required dependencies and download the necessary models.\ โš ๏ธRemember that you have to use a CUDA supported GPU!โš ๏ธ

Clone the repository:
git clone https://github.com/KevinGeLe/SmartSRT.git
Install the required dependencies:
cd SmartSRT
pip install -r requirements.txt
The necessary models will be downloaded, by specifying the models that you want to use.

TODO:

  • Add Face recognition (ยฑ1 second)๐Ÿ‘ฑ
  • Add Max-Subtitle-Lenght with Per-Word-Timestamps:๐Ÿ•ฐ๏ธ
  • Add new parsers for Max-Lenght, Output, Input and Model๐Ÿ”
  • Refactor code to improve readability๐ŸŠ
  • Work on improving performance๐Ÿฆˆ
  • Update README.md๐Ÿ“‘

License

SmartSRT is released under the MIT license. See the LICENSE file for more information.

About

๐Ÿ“„ SmartSRT is a command-line tool for generating accurate subtitles with per-word timestamps. It uses WhisperAI for speech transcription, NVIDIA NeMo for diarization, and OpenCV for face recognition. The program is good at creating high accuracy subtitles. ๐ŸŽง๐Ÿ’ปโš™๏ธ

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published