update pre-trained model with audio demos

keonlee9420 · Jul 27, 2021 · e38a0c4 · e38a0c4
1 parent e24cdde
commit e38a0c4
Show file tree

Hide file tree

Showing 35 changed files with 7 additions and 7 deletions.
diff --git a/.gitignore b/.gitignore
@@ -114,7 +114,7 @@ montreal-forced-aligner/
 raw_data/
 output/
 *.npy
-*.wav
+preprocessed_data**/*.wav
 TextGrid/
 hifigan/*.pth.tar
 *.out
diff --git a/README.md b/README.md
@@ -20,11 +20,11 @@ pip3 install -r requirements.txt
 
 ## Inference
 
-You have to download the [pretrained models]() and put them in ``output/ckpt/LJSpeech/``.
+You have to download the [pretrained models](https://drive.google.com/drive/folders/1Kzh3AxVl5cpVixs18-eDDPnKOsdp8Ep9?usp=sharing) and put them in ``output/ckpt/LJSpeech/``.
 
 For English single-speaker TTS, run
 ```
-python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step 900000 --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
+python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step RESTORE_STEP --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
 ```
 The generated utterances will be put in ``output/result/``.
 
@@ -33,7 +33,7 @@ The generated utterances will be put in ``output/result/``.
 Batch inference is also supported, try
 
 ```
-python3 synthesize.py --source preprocessed_data/LJSpeech/val.txt --restore_step 900000 --mode batch -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
+python3 synthesize.py --source preprocessed_data/LJSpeech/val.txt --restore_step RESTORE_STEP --mode batch -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml
 ```
 to synthesize all utterances in ``preprocessed_data/LJSpeech/val.txt``
 
@@ -42,7 +42,7 @@ The speaking rate of the synthesized utterances can be controlled by specifying
 For example, one can increase the speaking rate by 20 % by
 
 ```
-python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step 900000 --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml --duration_control 0.8
+python3 synthesize.py --text "YOUR_DESIRED_TEXT" --restore_step RESTORE_STEP --mode single -p config/LJSpeech/preprocess.yaml -m config/LJSpeech/model.yaml -t config/LJSpeech/train.yaml --duration_control 0.8
 ```
 
 # Training
@@ -100,11 +100,11 @@ tensorboard --logdir output/log/LJSpeech
 ```
 
 to serve TensorBoard on your localhost.
-<!-- The loss curves, synthesized mel-spectrograms, and audios are shown.
+The loss curves, synthesized mel-spectrograms, and audios are shown.
 
 ![](./img/tensorboard_loss.png)
 ![](./img/tensorboard_spec.png)
-![](./img/tensorboard_audio.png) -->
+![](./img/tensorboard_audio.png)
 
 # Implementation Issues
 

diff --git a/.../But there were few cases so remarkable as the great ones already recorded..png b/.../But there were few cases so remarkable as the great ones already recorded..png
diff --git a/...ech/250000/But there were few cases so remarkable as the great ones already recorded..wav b/...ech/250000/But there were few cases so remarkable as the great ones already recorded..wav
diff --git a/demo/LJSpeech/250000/Here are the match lineups for the Colombia Haiti match..png b/demo/LJSpeech/250000/Here are the match lineups for the Colombia Haiti match..png
diff --git a/demo/LJSpeech/250000/Here are the match lineups for the Colombia Haiti match..wav b/demo/LJSpeech/250000/Here are the match lineups for the Colombia Haiti match..wav
diff --git a/demo/LJSpeech/250000/In some yards.png b/demo/LJSpeech/250000/In some yards.png
diff --git a/demo/LJSpeech/250000/In some yards.wav b/demo/LJSpeech/250000/In some yards.wav
diff --git a/... night in Bridgeport expect a temperature of minus four degrees Fahrenheit..png b/... night in Bridgeport expect a temperature of minus four degrees Fahrenheit..png
diff --git a/.../On Friday night in Bridgeport expect a temperature of minus four degrees Fahrenheit..wav b/.../On Friday night in Bridgeport expect a temperature of minus four degrees Fahrenheit..wav
diff --git a/demo/LJSpeech/250000/The central criminal court, when the trial came on,.png b/demo/LJSpeech/250000/The central criminal court, when the trial came on,.png
diff --git a/demo/LJSpeech/250000/The central criminal court, when the trial came on,.wav b/demo/LJSpeech/250000/The central criminal court, when the trial came on,.wav
diff --git a/demo/LJSpeech/250000/Weekends at twenty three fifty..png b/demo/LJSpeech/250000/Weekends at twenty three fifty..png
diff --git a/demo/LJSpeech/250000/Weekends at twenty three fifty..wav b/demo/LJSpeech/250000/Weekends at twenty three fifty..wav
diff --git a/...ard the sirens was johnny calvin brewer, manager of hardy's shoestore, a fe.png b/...ard the sirens was johnny calvin brewer, manager of hardy's shoestore, a fe.png
diff --git a/...ons who heard the sirens was johnny calvin brewer, manager of hardy's shoestore, a fe.wav b/...ons who heard the sirens was johnny calvin brewer, manager of hardy's shoestore, a fe.wav
diff --git a/demo/LJSpeech/250000/testing testing testing!.png b/demo/LJSpeech/250000/testing testing testing!.png
diff --git a/demo/LJSpeech/250000/testing testing testing!.wav b/demo/LJSpeech/250000/testing testing testing!.wav
diff --git a/demo/LJSpeech/250000/testing testing testing.png b/demo/LJSpeech/250000/testing testing testing.png
diff --git a/demo/LJSpeech/250000/testing testing testing.wav b/demo/LJSpeech/250000/testing testing testing.wav
diff --git a/.../But there were few cases so remarkable as the great ones already recorded..png b/.../But there were few cases so remarkable as the great ones already recorded..png
diff --git a/...ech/500000/But there were few cases so remarkable as the great ones already recorded..wav b/...ech/500000/But there were few cases so remarkable as the great ones already recorded..wav
diff --git a/demo/LJSpeech/500000/Here are the match lineups for the Colombia Haiti match..png b/demo/LJSpeech/500000/Here are the match lineups for the Colombia Haiti match..png
diff --git a/demo/LJSpeech/500000/Here are the match lineups for the Colombia Haiti match..wav b/demo/LJSpeech/500000/Here are the match lineups for the Colombia Haiti match..wav
diff --git a/demo/LJSpeech/500000/In some yards.png b/demo/LJSpeech/500000/In some yards.png
diff --git a/demo/LJSpeech/500000/In some yards.wav b/demo/LJSpeech/500000/In some yards.wav
diff --git a/... night in Bridgeport expect a temperature of minus four degrees Fahrenheit..png b/... night in Bridgeport expect a temperature of minus four degrees Fahrenheit..png
diff --git a/.../On Friday night in Bridgeport expect a temperature of minus four degrees Fahrenheit..wav b/.../On Friday night in Bridgeport expect a temperature of minus four degrees Fahrenheit..wav
diff --git a/demo/LJSpeech/500000/The central criminal court, when the trial came on,.png b/demo/LJSpeech/500000/The central criminal court, when the trial came on,.png
diff --git a/demo/LJSpeech/500000/The central criminal court, when the trial came on,.wav b/demo/LJSpeech/500000/The central criminal court, when the trial came on,.wav
diff --git a/demo/LJSpeech/500000/Weekends at twenty three fifty..png b/demo/LJSpeech/500000/Weekends at twenty three fifty..png
diff --git a/demo/LJSpeech/500000/Weekends at twenty three fifty..wav b/demo/LJSpeech/500000/Weekends at twenty three fifty..wav
diff --git a/img/tensorboard_audio.png b/img/tensorboard_audio.png
diff --git a/img/tensorboard_loss.png b/img/tensorboard_loss.png
diff --git a/img/tensorboard_spec.png b/img/tensorboard_spec.png