Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

Can be run as Online mode ( connected to internet and make api calls using Google translate - No api keys are needed )

Or Fully Locally (using local LLM's)

Or as a Hybrid mode (mix of Online and one single Local LLM )

Features

Translate from and to 17 Languages :
- The translator supports various languages, including English, Spanish, French, German, Dutch , Japanese, Korean, Turkish, Arabic, Russian, Hebrew, Hindi, Italian, Portuguese, Chinese, Czech and Hungarian.

Options

File Menu available options:
Convert Audio file to MP3
Extract audio from Video
YouTube Downloader
Replace Audio in Video
Video Text Adder
Voice Recorder
PyTranscriber (shortcut)
Exit

Requirements

Make sure you have the following dependencies installed:

Python >= 3.10
Pip (Python package installer)
FFmpeg #Should be installed manually and added to sys env path

Usage

1- Clone the repository:

git clone https://github.com/overcrash66/OpenTranslator.git

2- Navigate to folder:

cd OpenTranslator

3- Create a vitrual env:

py -3.10 -m venv venv

venv\Scripts\activate

4- Install the required Python packages using:

If you would like to use CUDA 118 - GPU:

PY -3.10

pip install torch==2.1.2+cu118 torchaudio==2.1.2+cu118 --index-url https://download.pytorch.org/whl/cu118

PY -3.12

pip install torch==2.2.1+cu118 torchaudio==2.2.1+cu118 --index-url https://download.pytorch.org/whl/cu118

Install mecab https://github.com/ikegami-yukino/mecab/releases

pip install -r requirements_Py312.txt

OR by default you use CPU only:

pip install -r requirements.txt

5- Run the Script:

python OpenTranslator.py

Or Local mode (using a set of LLM's) for audio file translation only, using a WEB UI (Gradio)

python WebUI.py

GUI Preview

Configuration

You can customize the translation models and other settings by modifying the script.

License

This project is licensed under the GPL License - see the LICENSE file for details.

Acknowledgements

Special thanks to: XTTS_V2 whisper v3 Large Llama2-13b-Language-translate autosub gTTS

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

Can be run as Online mode ( connected to internet and make api calls using Google translate - No api keys are needed )

Or Fully Locally (using local LLM's)

Or as a Hybrid mode (mix of Online and one single Local LLM )

Features

Options

Requirements

Usage

GUI Preview

Configuration

License

Acknowledgements

Files

readme.md

Latest commit

History

readme.md

File metadata and controls

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

Can be run as Online mode ( connected to internet and make api calls using Google translate - No api keys are needed )

Or Fully Locally (using local LLM's)

Or as a Hybrid mode (mix of Online and one single Local LLM )

Features

Options

Requirements

Usage

GUI Preview

Configuration

License

Acknowledgements