Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hubert Denoiser - distorted output audio #131

Open
A-2-H opened this issue Oct 15, 2024 · 0 comments
Open

Hubert Denoiser - distorted output audio #131

A-2-H opened this issue Oct 15, 2024 · 0 comments

Comments

@A-2-H
Copy link

A-2-H commented Oct 15, 2024

As in the title. The denoiser script makes input audio very distorted, too loud and clipping.
According to guide i used this script:

python tools/diffusion/inference.py --config configs/denoiser_cn_hubert.py \
    --checkpoint checkpoints/denoiser/denoiser-cn-hubert-large-v1.ckpt \
    --input "input.wav" \
    --output "output.wav" \
    --sampler_interval 5 \
    --skip_steps 970

I tried change --skip_steps to different value, but as I understand the lower the value the more steps it does so it changes the audio completely. When I change it to very low number like 30 it doesn't clip but quality is bad and it doesn't sound like the original. So according to the guide the number 970 should be ok as it is 30 steps only, but it's distorting the audio.

Any number between 0-1000 gives bad results in some way. The lower the number the poorer the quality becomes, the higher the number the more distorted/clipping it becomes.

I tried it on different samples rendered by different models and also tried to denoise custom audio recording and it still gives the same results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant