Extremely slow processing on CPU #98

JPery · 2020-09-14T10:42:47Z

I think this is an issue related to the tf.signal FFT implementation. It seems like it's using only a CPU core and it's extremely slow. Can we do anything to improve it?

PS: Thank you for your awesome work!

keunwoochoi · 2020-09-14T18:15:17Z

Hi 😊 thanks for the issue and the context. Maybe one thing we can do is to wrap the numpy fft function and selectively use it based on the hardware but it doesn't seem very simple.

Toku11 · 2020-09-15T19:18:40Z

That's why I closed the pull request once #72, I would recommend to use larger batches

JPery · 2020-11-17T16:56:58Z

Hi there! If you want we can use the workaround that @zaccharieramzi found, as it improves up to 10 times the actual implementation, at least for my use case, until tensorflow gives us a better approach!

To do so, we have to include its implementation somewhere in the code:

import multiprocessing
from tensorflow.python.framework import ops
from tensorflow.python.ops.signal import shape_ops, fft_ops, window_ops, spectral_ops
import numpy as np
from functools import partial

def parallel_stft(signals, frame_length, frame_step, fft_length=None,
         window_fn=window_ops.hann_window,
         pad_end=False, name=None):
  with ops.name_scope(name, 'stft', [signals, frame_length, frame_step]):
    signals = ops.convert_to_tensor(signals, name='signals')
    signals.shape.with_rank_at_least(1)
    frame_length = ops.convert_to_tensor(frame_length, name='frame_length')
    frame_length.shape.assert_has_rank(0)
    frame_step = ops.convert_to_tensor(frame_step, name='frame_step')
    frame_step.shape.assert_has_rank(0)
    if fft_length is None:
      fft_length = spectral_ops._enclosing_power_of_two(frame_length)
    else:
      fft_length = ops.convert_to_tensor(fft_length, name='fft_length')
    framed_signals = shape_ops.frame(
        signals, frame_length, frame_step, pad_end=pad_end)
    if window_fn is not None:
      window = window_fn(frame_length, dtype=framed_signals.dtype)
      framed_signals *= window
    return tf.map_fn(
	    partial(fft_ops.rfft, fft_length=[fft_length]),
	    framed_signals,
	    fn_output_signature=tf.complex64,
	    parallel_iterations=multiprocessing.cpu_count(),  # or how many parallel ops you see fit
    )

After that, change the call in the STFT layer from tf.signal.stft to parallel_stft

keunwoochoi · 2020-11-17T17:47:57Z

@JPery Sounds like not a bad idea. To maximize its utility, we'd want this to work i) automatically when it's on cpu ii) without any complex configuration iii) when there is more than one item in the batch. maybe in __init__ of STFT, we pass an argument that specifies whether this behavior would be turned on in call(). In call(), the layer would detect the device (https://www.tensorflow.org/api_docs/python/tf/config/list_physical_devices) and behave accordingly. of course these should be tested.

Or, a conservative approach is to create another layer, maybe an inherited one from kapre.STFT which works as described above. We can just keep a gist of it somewhere.

In either way, we need a carefully tested code, but I don't think I would have any time to work on this at least in 1-2 months. I'd love to review a PR :)

JPery changed the title ~~Slow processing on CPU~~ Extremely slow processing on CPU Sep 14, 2020

zaccharieramzi mentioned this issue Nov 17, 2020

tf.signal CPU FFT implementation is slower than NumPy, PyTorch, etc. tensorflow/tensorflow#6541

Closed

JPery mentioned this issue Nov 26, 2020

Added parallel STFT implementation #113

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extremely slow processing on CPU #98

Extremely slow processing on CPU #98

JPery commented Sep 14, 2020 •

edited

Loading

keunwoochoi commented Sep 14, 2020

Toku11 commented Sep 15, 2020 •

edited

Loading

JPery commented Nov 17, 2020 •

edited

Loading

keunwoochoi commented Nov 17, 2020

Extremely slow processing on CPU #98

Extremely slow processing on CPU #98

Comments

JPery commented Sep 14, 2020 • edited Loading

keunwoochoi commented Sep 14, 2020

Toku11 commented Sep 15, 2020 • edited Loading

JPery commented Nov 17, 2020 • edited Loading

keunwoochoi commented Nov 17, 2020

JPery commented Sep 14, 2020 •

edited

Loading

Toku11 commented Sep 15, 2020 •

edited

Loading

JPery commented Nov 17, 2020 •

edited

Loading