How to use timesteps? #203

blankspark · 2022-03-28T13:52:11Z

I have noticed the output of ctcdecode includes timesteps, which the description says it can be used as alignment.
But I just get shape (Batchsize，N_beams，N_timesteps). I don't know how to use it.

timesteps - Shape: BATCHSIZE x N_BEAMS

The timestep at which the nth output character has peak probability. Can be used as alignment between the audio and the transcript.

Thanks in advance.

abarcovschi · 2023-12-07T11:49:26Z

@blankspark have you ever figured out how to use them? I am looking to get word-level time alignments, but I don't know how to calculate this information from the timesteps returned by ctcdecode.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use timesteps? #203

How to use timesteps? #203

blankspark commented Mar 28, 2022 •

edited

Loading

abarcovschi commented Dec 7, 2023

How to use timesteps? #203

How to use timesteps? #203

Comments

blankspark commented Mar 28, 2022 • edited Loading

timesteps - Shape: BATCHSIZE x N_BEAMS

The timestep at which the nth output character has peak probability. Can be used as alignment between the audio and the transcript.

abarcovschi commented Dec 7, 2023

blankspark commented Mar 28, 2022 •

edited

Loading