Convenience Functions
High-level functions for one-shot spectrogram computation. For batch processing, use the Planner API API instead.
Linear Spectrograms
- spectrograms.compute_linear_power_spectrogram()
Compute a linear power spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D NumPy array
- paramsSpectrogramParams
Spectrogram parameters
Returns
- Spectrogram
Spectrogram with linear frequency scale and power amplitude scale
- spectrograms.compute_linear_magnitude_spectrogram()
Compute a linear magnitude spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D NumPy array
- paramsSpectrogramParams
Spectrogram parameters
Returns
- Spectrogram
Spectrogram with linear frequency scale and magnitude amplitude scale
- spectrograms.compute_linear_db_spectrogram()
Compute a linear decibel spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D NumPy array
- paramsSpectrogramParams
Spectrogram parameters
Returns
- Spectrogram
Spectrogram with linear frequency scale and decibel amplitude scale
Mel Spectrograms
- spectrograms.compute_mel_power_spectrogram()
Compute a mel power spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D array
- paramsSpectrogramParams
Spectrogram parameters
- filter_paramsPyMelParams
Filterbank parameters
- dbtyping.Optional[LogParams], optional
Optional decibel scaling parameters
Returns
- Spectrogram
Spectrogram with mel frequency scale and power amplitude scale
- spectrograms.compute_mel_magnitude_spectrogram()
Compute a mel magnitude spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D array
- paramsSpectrogramParams
Spectrogram parameters
- filter_paramsPyMelParams
Filterbank parameters
- dbtyping.Optional[LogParams], optional
Optional decibel scaling parameters
Returns
- Spectrogram
Spectrogram with mel frequency scale and magnitude amplitude scale
- spectrograms.compute_mel_db_spectrogram()
Compute a mel decibel spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D array
- paramsSpectrogramParams
Spectrogram parameters
- filter_paramsPyMelParams
Filterbank parameters
- dbtyping.Optional[LogParams], optional
Optional decibel scaling parameters
Returns
- Spectrogram
Spectrogram with mel frequency scale and decibel amplitude scale
ERB Spectrograms
- spectrograms.compute_erb_power_spectrogram()
Compute a ERB/gammatone power spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D array
- paramsSpectrogramParams
Spectrogram parameters
- filter_paramsPyErbParams
Filterbank parameters
- dbtyping.Optional[LogParams], optional
Optional decibel scaling parameters
Returns
- Spectrogram
Spectrogram with ERB/gammatone frequency scale and power amplitude scale
- spectrograms.compute_erb_magnitude_spectrogram()
Compute a ERB/gammatone magnitude spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D array
- paramsSpectrogramParams
Spectrogram parameters
- filter_paramsPyErbParams
Filterbank parameters
- dbtyping.Optional[LogParams], optional
Optional decibel scaling parameters
Returns
- Spectrogram
Spectrogram with ERB/gammatone frequency scale and magnitude amplitude scale
- spectrograms.compute_erb_db_spectrogram()
Compute a ERB/gammatone decibel spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D array
- paramsSpectrogramParams
Spectrogram parameters
- filter_paramsPyErbParams
Filterbank parameters
- dbtyping.Optional[LogParams], optional
Optional decibel scaling parameters
Returns
- Spectrogram
Spectrogram with ERB/gammatone frequency scale and decibel amplitude scale
Audio Features
- spectrograms.compute_cqt()
Compute a Constant-Q Transform power spectrogram.
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D NumPy array
- paramsSpectrogramParams
Spectrogram parameters
- cqtCqtParams
CQT parameters
- dbtyping.Optional[LogParams], optional
Optional decibel scaling parameters
Returns
- Spectrogram
CQT spectrogram with power amplitude scale
- spectrograms.compute_chromagram()
Compute a chromagram (pitch class profile).
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D NumPy array
- stft_paramsStftParams
STFT parameters
- sample_ratefloat
Sample rate in Hz
- chroma_paramsChromaParams
Chromagram parameters
Returns
- numpy.ndarray
Chromagram as a 2D NumPy array (12 x n_frames)
- spectrograms.compute_mfcc()
Compute MFCCs (Mel-Frequency Cepstral Coefficients).
Parameters
- samplesnumpy.typing.NDArray[numpy.float64]
Audio samples as a 1D NumPy array
- stft_paramsStftParams
STFT parameters
- sample_ratefloat
Sample rate in Hz
- n_melsint
Number of mel bands
- mfcc_paramsMfccParams
MFCC parameters
Returns
- numpy.ndarray
MFCCs as a 2D NumPy array (n_mfcc x n_frames)
Low-Level Functions
- spectrograms.compute_stft()
Compute the raw STFT (Short-Time Fourier Transform).
Returns the complex-valued STFT matrix before any frequency mapping or amplitude scaling.
Parameters
:param samples - Audio samples as a 1D NumPy array :param params - Spectrogram parameters
Returns
Complex STFT as a 2D NumPy array of complex128 (n_fft/2+1 x n_frames)