- sample_rate
(int, optional): Sample rate of audio signal. (Default: 16000
)
- n_fft
(int, optional): Size of FFT, creates n_fft // 2 + 1
bins. (Default: 400
)
- win_length
(int or NULL, optional): Window size. (Default: n_fft
)
- hop_length
(int or NULL, optional): Length of hop between STFT windows. (Default: win_length // 2
)
- f_min
(float, optional): Minimum frequency. (Default: 0.
)
- f_max
(float or NULL, optional): Maximum frequency. (Default: NULL
)
- pad
(int, optional): Two sided padding of signal. (Default: 0
)
- n_mels
(int, optional): Number of mel filterbanks. (Default: 128
)
- window_fn
(function, optional): A function to create a window tensor
that is applied/multiplied to each frame/window. (Default: torch_hann_window
)
- power
(float, optional): Power of the norm. (Default: to 2.0
)
- normalized
(logical): Whether to normalize by magnitude after stft (Default: FALSE
)
- ...
(optional): Arguments for window function.