Add duration of audio and VAD removed duration to BatchedInferencePipeline #1186

greenw0lf · 2024-12-03T10:44:05Z

With the non-batched version of the WhisperModel, you would get logging output like:

Processing audio with duration 01:33:59.990
VAD filter removed 06:55.648 of audio

Whereas, when calling the BatchedInferencePipeline's transcribe() method, that is no longer the case.

This PR tries to bring that logging back as I believe it does not add extra overhead to the logging and it is quite useful for developers who wish to know how much audio gets processed in the end.

If this is not the case and it is an issue related to my usage of the model, I apologize in advance!

MahmoudAshraf97 · 2024-12-03T11:08:38Z

faster_whisper/transcribe.py

@@ -114,6 +114,7 @@ def __init__(
        self,
        model,
    ):
+        self.logger = get_logger()


self.model already has a logger, so I'd rather we use the same logger instead of having duplicate loggers in both the pipeline and the model

MahmoudAshraf97 and others added 12 commits October 31, 2024 12:50

initial commit

3626f6a

remove torch from reqirements

5f905ee

fix formatting and dtype

65dc596

add type annotations

dcc95af

remove CuPy

f6adb22

reduce padding to hop_length insteadn of n_samples

bc86503

Merge remote-tracking branch 'MahmoudAshraf97/numpy_fe'

9f4124c

Attempt to fix logging for batched Whisper

fdadc64

Hopefully same logging as with normal WhisperModel

15c39f4

Merge remote-tracking branch 'MahmoudAshraf97/master'

15add92

Merge branch 'SYSTRAN:master' into master

0a119b4

Revert irrelevant changes

ef8c2ce

MahmoudAshraf97 requested changes Dec 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add duration of audio and VAD removed duration to BatchedInferencePipeline #1186

Add duration of audio and VAD removed duration to BatchedInferencePipeline #1186

greenw0lf commented Dec 3, 2024

MahmoudAshraf97 Dec 3, 2024

Add duration of audio and VAD removed duration to BatchedInferencePipeline #1186

Are you sure you want to change the base?

Add duration of audio and VAD removed duration to BatchedInferencePipeline #1186

Conversation

greenw0lf commented Dec 3, 2024

MahmoudAshraf97 Dec 3, 2024

Choose a reason for hiding this comment