Skip to content

Latest commit

 

History

History
74 lines (30 loc) · 2.74 KB

README.md

File metadata and controls

74 lines (30 loc) · 2.74 KB

Any2English - Multi-Lingual Video to English Dub with Lip Sync

This project is made in collaboration with @hamzakhalil798

Overview

The Multi-Lingual Video Dubber is a powerful tool designed to dub videos in various languages to English while ensuring lip sync. This GitHub repository contains the necessary code and resources to achieve this task. It utilizes several libraries and models, including Whisper for language translation, Tortoise TTS for text-to-speech conversion with customizable voices, and Wav2Lip for accurate lip synchronization. Please note that due to resource requirements, the current version only supports small videos.

How it Works

  1. Audio Extraction: The first step of the process involves extracting the audio from the input video file.

  2. Language Translation: The extracted audio is then passed through Whisper, a language translation library, to convert it to English.

  3. Text-to-Speech Conversion: Tortoise TTS is employed to convert the English text into speech. By default, the Tom voice is used, but users have the option to choose from a selection of other voices.

  4. Lip Synchronization: Wav2Lip, a lip synchronization model, is used to sync the generated English speech with the input video, ensuring that the dubbed video's lip movements match the spoken words.

Usage

Due to resource-intensive processes, the current version supports only small videos. To use the tool, follow these steps:

  1. Upload the repo to your Google Drive.
  2. Upload your input video files to the drive.
  3. Open the provided notebook (any2English.ipynb) in Google Colab and follow easy outlined instructions for dubbing your video.

Results

Input

video_1.mp4
video_2.mp4

Output

video1_result.mp4
video2_output.mp4

Acknowledgements

  • Whisper - The speech translation library used for translating audio to English.
  • Tortoise TTS - The Text-to-Speech library used for generating audio.
  • Wav2Lip - The lip syncing library used for synchronizing the voice to the input video.

Contributing

Contributions to this project are welcome. If you find any issues or want to add new features, feel free to open a pull request.

TODOs

  • Allow bigger video files to be uploaded
  • Dub into other languages