Skip to content

Latest commit

 

History

History
119 lines (79 loc) · 2.58 KB

readme.md

File metadata and controls

119 lines (79 loc) · 2.58 KB

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

Can be run as Online mode ( connected to internet and make api calls using Google translate - No api keys are needed )

Or Fully Locally (using local LLM's)

Or as a Hybrid mode (mix of Online and one single Local LLM )

Features

  • Translate from and to 17 Languages :

    • The translator supports various languages, including English, Spanish, French, German, Dutch , Japanese, Korean, Turkish, Arabic, Russian, Hebrew, Hindi, Italian, Portuguese, Chinese, Czech and Hungarian.

Options

  • File Menu available options:

  • Convert Audio file to MP3

  • Extract audio from Video

  • YouTube Downloader

  • Replace Audio in Video

  • Video Text Adder

  • Voice Recorder

  • PyTranscriber (shortcut)

  • Exit

Requirements

Make sure you have the following dependencies installed:

  • Python >= 3.10
  • Pip (Python package installer)
  • FFmpeg #Should be installed manually and added to sys env path

Usage

1- Clone the repository:

git clone https://github.com/overcrash66/OpenTranslator.git

2- Navigate to folder:

cd OpenTranslator

3- Create a vitrual env:

py -3.10 -m venv venv
venv\Scripts\activate

4- Install the required Python packages using:

If you would like to use CUDA 118 - GPU:

PY -3.10

pip install torch==2.1.2+cu118 torchaudio==2.1.2+cu118 --index-url https://download.pytorch.org/whl/cu118

PY -3.12

pip install torch==2.2.1+cu118 torchaudio==2.2.1+cu118 --index-url https://download.pytorch.org/whl/cu118

Install mecab https://github.com/ikegami-yukino/mecab/releases

pip install -r requirements_Py312.txt

OR by default you use CPU only:

pip install -r requirements.txt

5- Run the Script:

python OpenTranslator.py

Or Local mode (using a set of LLM's) for audio file translation only, using a WEB UI (Gradio)

python WebUI.py

GUI Preview

Redesigned (Custom)

Web

Configuration

  • You can customize the translation models and other settings by modifying the script.

License

This project is licensed under the GPL License - see the LICENSE file for details.

Acknowledgements

Special thanks to: XTTS_V2 whisper v3 Large Llama2-13b-Language-translate autosub gTTS