Add a feature to generate frequency domain waveforms at reduced frequencies #4948

Kanchan-05 · 2024-11-19T23:34:20Z

Standard information about the request

This is a: new feature

This change affects: pycbc_brute_bank script

This change: has appropriate unit tests, follows style guidelines (See e.g. PEP8), has been proposed using the contribution guidelines

Motivation

Template bank generation using pycbc_brute_bank is computationally intensive for very long-duration signals, primarily due to the match calculations. The computation time can be reduced by generating frequency-domain signals at lower frequencies specifically for the match calculations.

Links to any issues or associated PRs

N/A

Testing performed

N/A

Additional notes

The author of this pull request confirms they will adhere to the code of conduct

…quencies

ahnitz · 2024-11-19T23:37:11Z

The main purpose of this is for time domain signals since they must be generated at full resolution first before conversion to the frequency-domain. This adds the ability to then decimate their frequency-domain representation and use that for match calculations.

We do this already for frequency-domain waveform by just setting the buffer length to something shorter than the actual duration. However, that will now work for time domain signals.

ahnitz · 2024-11-19T23:38:32Z

bin/bank/pycbc_brute_bank

@@ -53,6 +53,8 @@ parser.add_argument('--approximant',  required=False,
 parser.add_argument('--minimal-match', default=0.97, type=float)
 parser.add_argument('--buffer-length', default=2, type=float,
    help='size of waveform buffer in seconds')
+parser.add_argument('--buffer-high-pass-length', default=None, type=float,


The name / help might be clearer. Maybe --full-resolution-buffer-length or maybe you have a better idea? Suggestions?

--full-resolution-buffer-length sounds good! If it feels too lengthy, perhaps --full-res-buffer-length could be a more concise alternative.

spxiwh · 2024-11-20T08:44:31Z

What does this gain you, in terms of computational efficiency? I would expect that the cost of generating TD waveforms (which I think is still done here at the full sample rate/length) dominates matched-filtering calculations in this application?

Kanchan-05 · 2024-11-20T15:49:06Z

@spxiwh I looked at the cProfile output (see attachment) for low-mass template bank generation, and it turns out the FFT operations for match calculations are more time-consuming than generating waveforms in the time domain.

If we go with the reduced frequency approach, we can reduce the cost of match calculations quite a bit. After that, waveform generation becomes the main bottleneck, but I think we can handle that by parallelizing it across multiple cores.

ahnitz

Can you double check the code style? Codeclimate doesn't appear to be automatically running. Make sure that your code is following PEP8, etc.

… some bugs

ahnitz

@Kanchan-05 Address the one remaining comment and then feel free to merge.

ahnitz · 2024-11-22T18:00:51Z

bin/bank/pycbc_brute_bank

@@ -53,6 +53,8 @@ parser.add_argument('--approximant',  required=False,
 parser.add_argument('--minimal-match', default=0.97, type=float)
 parser.add_argument('--buffer-length', default=2, type=float,
    help='size of waveform buffer in seconds')
+parser.add_argument('--full-resolution-buffer-length', default=None, type=float,
+    help='size of waveform buffer in seconds')


I think this help message needs to be clear, e.g. how is this different from buffer-length? Otherwise the PR looks fine to me.

spxiwh · 2024-11-22T20:55:25Z

Okay, so 3600 waveform generations and 160000 FFTs … Makes sense then!

I do note that you are using the numpy backend for FFTs, which is slow, and not using the faster Class based FFTs (there is a “do cached FFTs” function that I wrote to leverage the class based stuff in an easy way, which would probably be useful here) … This code would be an interesting test case for the CUPY backend I am playing with as well (especially if using a waveform that can be generated on a GPU).

Kanchan-05 · 2024-11-22T21:34:56Z

@spxiwh That's interesting. Could you please point me to the class that you mentioned?

spxiwh · 2024-11-24T20:26:02Z

pycbc/pycbc/strain/strain.py

Line 1330 in b7cb8bc

def execute_cached_fft(invec_data, normalize_by_rate=True, ifft=False,

is the function I mean. This won't do anything with the numpy FFT backend (which is always slow), but if you install+use MKL or FFTW it will help quite a bit.

added a feature to generate frequency domain waveforms at reduced fre…

003bb36

…quencies

ahnitz reviewed Nov 19, 2024

View reviewed changes

ahnitz reviewed Nov 20, 2024

View reviewed changes

Kanchan Soni added 3 commits November 20, 2024 12:13

Mofified the input argument --full-resolution-buffer-length and fixed…

2fbc5ee

… some bugs

Minor change

ccc8068

Minor change

eae8e88

ahnitz approved these changes Nov 22, 2024

View reviewed changes

updated the help message for --buffer-high-pass-length

7e08bea

ahnitz approved these changes Nov 22, 2024

View reviewed changes

ahnitz enabled auto-merge (squash) November 22, 2024 18:14

ahnitz merged commit 47d4d8d into gwastro:master Nov 22, 2024
29 checks passed

Kanchan-05 deleted the add_reduced_freq_match_func branch November 22, 2024 20:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a feature to generate frequency domain waveforms at reduced frequencies #4948

Add a feature to generate frequency domain waveforms at reduced frequencies #4948

Kanchan-05 commented Nov 19, 2024

ahnitz commented Nov 19, 2024

ahnitz Nov 19, 2024

Kanchan-05 Nov 19, 2024

spxiwh commented Nov 20, 2024

Kanchan-05 commented Nov 20, 2024

ahnitz left a comment

ahnitz left a comment

ahnitz Nov 22, 2024

Kanchan-05 Nov 22, 2024

spxiwh commented Nov 22, 2024

Kanchan-05 commented Nov 22, 2024

spxiwh commented Nov 24, 2024

Add a feature to generate frequency domain waveforms at reduced frequencies #4948

Add a feature to generate frequency domain waveforms at reduced frequencies #4948

Conversation

Kanchan-05 commented Nov 19, 2024

Standard information about the request

Motivation

Contents

Links to any issues or associated PRs

Testing performed

Additional notes

ahnitz commented Nov 19, 2024

ahnitz Nov 19, 2024

Choose a reason for hiding this comment

Kanchan-05 Nov 19, 2024

Choose a reason for hiding this comment

spxiwh commented Nov 20, 2024

Kanchan-05 commented Nov 20, 2024

ahnitz left a comment

Choose a reason for hiding this comment

ahnitz left a comment

Choose a reason for hiding this comment

ahnitz Nov 22, 2024

Choose a reason for hiding this comment

Kanchan-05 Nov 22, 2024

Choose a reason for hiding this comment

spxiwh commented Nov 22, 2024

Kanchan-05 commented Nov 22, 2024

spxiwh commented Nov 24, 2024