Query generation questions #14

stdio2016 · 2021-07-13T09:12:20Z

I tried to reproduce your paper, and my code is https://github.com/stdio2016/pfann .
In my code, I generate queries by:

Randomly slice a x second segment from test music, x is query length
Add one noise file to this segment
Add 2 IRs to this segment, one is for room reverb, and the other is for microphone IR
Save this segment as query file

It seems that your code does these:

Split test music into 1 second segments
For each segment:
Randomly time shift the segment within +/-0.2s
Add one noise segment to the segment. The added noises of each segment seem to maintain time order.
Add one IR file to the segment.
Concatenate these segments and save as query file

My question:

Why do you add different IR to different 1s segments of the same query file? I do not think that the reverb environment would change every 1s.
I use random slicing to simulate query start time, while you randomly shift each 1s segment independently. Isn't uniform time shifting enough?
In your paper, you said "microphone and room impulse response (IR) are sequentially applied by convolution operation." However, I can only find one convolution operation per segment. How do you apply 2 IRs (microphone and room IRs) in one convolution? Do you merge these two datasets, or preprocess so that every new IR is a combination of one microphone and one room IR?

mimbres · 2021-07-13T14:12:55Z

@stdio2016 Hi, Yi-feng. Thank you for questions and reviewing my code. Your implementation looks awesome! I was really waiting for someone else to implement it with PyTorch. I'm on my way to fork it.

I completely agree with your idea that changing IR every 1s cannot be a realistic test for the sequence (2~10s) search task . Perhaps the test from my implementation can be a slightly more difficult task than actual. But no guarantees. There is no reason other than my oversimplifying mistake in this implementation. Although I focused more on 1s segment-level search in the work, my implementation for evaluating sequence search task needs improvement on the points you made.
Yes, uniform-random start time is enough and the most desired one. In my implementation, I first generate a list of 1s-segments using 0.5s overlapping windows. Then applying +/-0.2s time offset as in the training. The resulting queries will cover only 80% of all possible start times. If I applied +/- 0.25s time offset, it could be much more like uniform random start time. However, due to my laziness, I reused the training data pipeline in the test again. As you mentioned, generating sequence queries with independent sampling of start time is the best way. So I feel your implementation (I didn't review yet) must be more correct.
I merged 2 IR filters for simplicity. I think it was just fine for the test. I believe the data pipeline in this repo is exactly reproducing the experiment. However, Its known drawback is that the pre-processed 300+ IRs would not provide enough randomness for training. I have a plan to improve it in the upcoming data pipeline.

I hope that my answer is a supplement to the points that were not clear in the paper and this repo. Thanks for reminding me of the existing problem of unrealistic test set generation. I have a plan to revise the code reflecting your opinion. I will update answer if I missed any points.

mimbres self-assigned this Aug 28, 2021

mimbres added the question Further information is requested label Aug 28, 2021

stdio2016 mentioned this issue Dec 26, 2021

请问论文当中地标法有开源吗？ stdio2016/pfann#1

Open

mimbres mentioned this issue May 2, 2022

Questions about inquiries #24

Open

mimbres added the test setup about test setup label May 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query generation questions #14

Query generation questions #14

stdio2016 commented Jul 13, 2021

mimbres commented Jul 13, 2021 •

edited

Loading

Query generation questions #14

Query generation questions #14

Comments

stdio2016 commented Jul 13, 2021

mimbres commented Jul 13, 2021 • edited Loading

mimbres commented Jul 13, 2021 •

edited

Loading