Array2Binaural

This repository contains the code to reproduce the figures (instrumental evaluation and experiment results) in the paper

@inproceedings{stahl2023perceptual2,
    title = {Perceptual Comparison of Dynamic Binaural Reproduction Methods for Head-Mounted Microphone Arrays},
    author = {Stahl, Benjamin and Riedel, Stefan},
    booktitle = {Proc. 155th Audio Engineering Society Convention},
    year = {2023}}

A demo of the listening experiment stimuli is available here: https://array2binaural_demo.iem.sh/.

Setup

Environment

Create a Python 3.11 environment; install pip.
Install the packages in requirements.txt using pip.

Download third-party data

Download the Easycom array impulse response data from https://spear2022data.blob.core.windows.net/spear-data/Device_ATFs.h5 into origin_array_tf_data/.
Download the boundary element method (BEM)-simulated array transfer functions by McCormack et al. from https://zenodo.org/records/6401603/files/HMD_SensorArrayResponses.mat into origin_array_tf_data/
Download the EBU-SQAM snippets by running simulate_scenarios_and_mic_signals/utils/download_and_cut_ebu_sqam.py.

Preprocessing

Convert the array transfer functions to the SH domain

Run the script encode_array_into_sh.py to encode the Easycom array transfer functions into the spherical harmonics domain. This will create the file Easycom_array_32000Hz_o25_22samps_delay.npy.

Instrumental Evaluation

In order to compute the resulting ILDs and ITDs and render the corresponding figures into figures/, run the scripts ild_itd_analysis/compute_filters.py and ild_itd_analysis/evaluate_ilds_itds.py.
In order to carry out the MUSIC simulation study investigating the inherent localization robustness of different arrays, run the script simulation_study_music.py. This will also create a figure in figures/.

Evaluation of the listening experiment results

Run evaluate_raw_experiment_results.py and evaluate_difference_experiment_results.py in order to obtain the listening experiment response data visualizations displayed in the paper.

Instructions for computing a set of end-to-end magnitude-least-squares filters

Run compute_emagls_filters/compute_emagls2_for_rotations.py to compute MLS filters for a fine grid of rotations. This can take a while. Note that we do not include a real-time magnitude-least-squares rendering VST application. However, you could build your own using the computed filters.

Instructions for Audio Stimulus Generation and

Ambisonic audio stimuli for 3DoF rendering are created. Additionally, the residual microphone are saved. These can be used for end-to-end magnitude-least-squares residual rendering.

Simulate the scenarios and microphone signals

Run the script simulate_scenarios_and_mic_signals/generate_stimuli.py. This will create 6 25th-order Ambisonic reference wav files in simulate_scenarios_and_mic_signals/audio_o25/.
Run the script simulate_scenarios_and_mic_signals/create_mic_signals.py. This will create simulated microphone signals in the simulate_scenarios_and_mic_signals/rendered_mic folder.

Compute 1st/5th-order Ambisonic signals for FOA encoding/decoding, BFBR, and DOA-informed BF+residual rendering.

Run the script beamform_array2amb.py in order to apply FOA encoding and beamformers+5th-order encoding and write the output files into bemformed_amb/. These can be used for binaural rendering using Ambisonic binaural decoders. (For BFBR and DOA-informed BF+residual rendering, this is a convenience step we used in our experiment, as we used the SceneRotator and 5th-order BinauralDecoder of the IEM-Plugin Suite for the rendering, so we did not have to write a real-time processor that selects and convolves HRIRs according to listener orientations.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Array2Binaural

Setup

Environment

Download third-party data

Preprocessing

Convert the array transfer functions to the SH domain

Instrumental Evaluation

Evaluation of the listening experiment results

Instructions for computing a set of end-to-end magnitude-least-squares filters

Instructions for Audio Stimulus Generation and

Simulate the scenarios and microphone signals

Compute 1st/5th-order Ambisonic signals for FOA encoding/decoding, BFBR, and DOA-informed BF+residual rendering.

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
beamformed_amb		beamformed_amb
compute_emagls_filters		compute_emagls_filters
figures		figures
ild_itd_analysis		ild_itd_analysis
ku100_magls_sh_hrir		ku100_magls_sh_hrir
mushra/results		mushra/results
origin_array_tf_data		origin_array_tf_data
simulate_scenarios_and_mic_signals		simulate_scenarios_and_mic_signals
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ambisonics.py		ambisonics.py
beamform_array2amb.py		beamform_array2amb.py
encode_array_into_sh.py		encode_array_into_sh.py
evaluate_difference_experiment_results.py		evaluate_difference_experiment_results.py
evaluate_manova.py		evaluate_manova.py
evaluate_raw_experiment_results.py		evaluate_raw_experiment_results.py
requirements.txt		requirements.txt
simulation_study_music.py		simulation_study_music.py

License

BenjSta/array2binaural

Folders and files

Latest commit

History

Repository files navigation

Array2Binaural

Setup

Environment

Download third-party data

Preprocessing

Convert the array transfer functions to the SH domain

Instrumental Evaluation

Evaluation of the listening experiment results

Instructions for computing a set of end-to-end magnitude-least-squares filters

Instructions for Audio Stimulus Generation and

Simulate the scenarios and microphone signals

Compute 1st/5th-order Ambisonic signals for FOA encoding/decoding, BFBR, and DOA-informed BF+residual rendering.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages