multi-level alignment with "task_adjust_boundary_nonspeech_min" #237

bwang482 · 2019-10-20T23:18:35Z

I want to alignment my audio recording files with corresponding transcripts. There are a lot of pauses and silence in my audios. I want multi-level alignment (mainly word-level and segment/paragraph-level) as well as alignment for the pauses and silence. It is important for me to know how long or how short the inter-segment pauses are. However when I use the command below, it detects no pauses/silence for between segments. While if I use is_text_type=plain instead of mplain, I receive the alignment for those inter-segment pauses (as well as the segments).

python -m aeneas.tools.execute_task sample_audio.mp3 sample_audio_transcript.txt "task_language=eng|os_task_file_format=json|is_text_type=mplain|task_adjust_boundary_nonspeech_min=0.0100|task_adjust_boundary_nonspeech_string=(sil)|task_adjust_boundary_algorithm=auto" sample_audio_output.multilevel.json

Why?

The text was updated successfully, but these errors were encountered:

pettarin · 2020-01-22T21:00:18Z

To be honest, on top of my mind I cannot answer. It might be a limitation of the current implementation of multilevel. I would need to check the code.

lokesh1199 · 2023-08-08T05:58:19Z

I want to alignment my audio recording files with corresponding transcripts. There are a lot of pauses and silence in my audios. I want multi-level alignment (mainly word-level and segment/paragraph-level) as well as alignment for the pauses and silence. It is important for me to know how long or how short the inter-segment pauses are. However when I use the command below, it detects no pauses/silence for between segments. While if I use is_text_type=plain instead of mplain, I receive the alignment for those inter-segment pauses (as well as the segments).

python -m aeneas.tools.execute_task sample_audio.mp3 sample_audio_transcript.txt "task_language=eng|os_task_file_format=json|is_text_type=mplain|task_adjust_boundary_nonspeech_min=0.0100|task_adjust_boundary_nonspeech_string=(sil)|task_adjust_boundary_algorithm=auto" sample_audio_output.multilevel.json

Why?

What is the structure of your transcriptions?

Is it

Lorem Ipsum is simply dummy text of the printing and typesetting industry

or

Lorem 
Ipsum 
is 
simply
dummy 
text 
of 
the 
...

readbeyond added the bug label Jan 21, 2021

readbeyond added this to the 2.0.0 milestone Jan 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-level alignment with "task_adjust_boundary_nonspeech_min" #237

multi-level alignment with "task_adjust_boundary_nonspeech_min" #237

bwang482 commented Oct 20, 2019 •

edited

Loading

pettarin commented Jan 22, 2020

lokesh1199 commented Aug 8, 2023

multi-level alignment with "task_adjust_boundary_nonspeech_min" #237

multi-level alignment with "task_adjust_boundary_nonspeech_min" #237

Comments

bwang482 commented Oct 20, 2019 • edited Loading

pettarin commented Jan 22, 2020

lokesh1199 commented Aug 8, 2023

bwang482 commented Oct 20, 2019 •

edited

Loading