README

Paule Benchmarks

This will grow to a collection of benchmarks to compare the different flavours of the Predcitive Articulatory speech synthesis model Utelising Lexical Embeddings (Paule) package and to show its limitations and strength.

Human recordings benchmark

The human recordings benchmark compares the resynthesis quality for a small set of words (Lehrer, Wissenschaft, Liebe for now) between the recovering of the segment based synthesis and the resynthesis of the human recording.

Run

To run the benchmark install the paule package and execute:

python benchmark_human_recordings.py

The benchmark should finish in around ???? hours depending on your hardware specifications.

Results

Results of the benchmark can be found below results/benchmark_human_recordings/.

TODO

Benchmark: "minimal pair" / local contrast
(Benchmark: aber, also, oder)
Benchmark: babi, babu, baba

Questions

Predictive/Forward model comparison ("small" models -> bad gradients?)

Metrics

loss improvement production real (mse acoustics, mse semantics, cross corr / rank)
loss improvement prediction imagined
final production loss
final prediction loss
babi, babu, baba: formant transitions in first /a/; tongue raising
number of trainable parameters
execution time
evtl. training time

"Experiments"

segment based resynthesis vs. human recording
formant transitions in babi, babu, baba
semantically driven synthesis "Miete" -> "Miete", "Miete" -> "mitte", "mitte" -> "Miete" and, "mitte" -> "mitte"; Übergang visualisieren
semantically driven synthesis "aber" -> "oder" and "oder" -> "aber"??
cross-correlation loss vs. MSE loss
initilize "mitte" -> target 0% "mitte"-"Miete"
initilize "mitte" -> target 10% "mitte"-"Miete"
initilize "mitte" -> target 50% "mitte"-"Miete"
initilize "mitte" -> target 90% "mitte"-"Miete"
initilize "mitte" -> target 100% "mitte"-"Miete"

Hypothesis: duration of /i/ increases monotonically.

Story-Telling objective

possible to start from semvec (witout acoustics)
enhancement through embedder
https://github.com/lochenchou/MOSNet

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
README.rst		README.rst
analyse_exp_seg_vs_record_embedder.py		analyse_exp_seg_vs_record_embedder.py
analyse_human_recordings.py		analyse_human_recordings.py
analyse_masterarbeit_paul.py		analyse_masterarbeit_paul.py
benchmark_human_recordings.py		benchmark_human_recordings.py
create_benchmark.py		create_benchmark.py
exp_masterarbeit_paul.py		exp_masterarbeit_paul.py
exp_seg_vs_record_embedder.py		exp_seg_vs_record_embedder.py
run_three.py		run_three.py
visualize_human_recordings.py		visualize_human_recordings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Paule Benchmarks

Human recordings benchmark

Run

Results

TODO

Questions

Metrics

"Experiments"

Story-Telling objective

About

Releases

Packages

Languages

License

quantling/paule_benchmark

Folders and files

Latest commit

History

Repository files navigation

README

Paule Benchmarks

Human recordings benchmark

Run

Results

TODO

Questions

Metrics

"Experiments"

Story-Telling objective

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages