spectrograms

User Guide

  • Installation
    • Requirements
    • Install from PyPI
    • Install from Source
  • Quickstart
    • Basic Example
    • Understanding the Result
    • Mel Spectrogram
    • Next Steps
  • Choosing Parameters
    • STFT Parameters
      • FFT Size (n_fft)
      • Hop Size
      • Window Function
      • Centering
    • Default Configurations
      • Speech Processing
      • Music Processing
    • Mel Scale Parameters
      • Number of Mel Bands
      • Frequency Range
    • Decibel Conversion
    • ERB Scale
    • Performance Considerations
      • Memory Usage
      • Computation Time
  • Frequency Scales
    • Linear Scale
    • Log Frequency Scale
    • Mel Scale
    • ERB Scale
    • Constant-Q Transform
    • Scale Comparison
    • Choosing a Scale
  • Batch Processing
    • Why Use Plans?
    • Basic Usage
    • Creating Plans
    • Computing Spectrograms
      • Full Spectrogram
      • Single Frame
      • Output Shape Prediction
    • Performance Comparison
    • When to Use Plans
    • Memory Considerations
  • Audio Features
    • MFCC (Mel-Frequency Cepstral Coefficients)
      • Basic Usage
      • Standard Configuration
      • Custom Configuration
    • Chromagram
      • Basic Usage
      • Output
      • Standard Configuration
      • Normalization Options
    • Constant-Q Transform (CQT)
      • Basic Usage
      • Output
      • Configuration
    • Applications
      • Speech Recognition
      • Music Analysis
      • Audio Classification

API Reference

  • API Reference
    • Parameters
      • STFT Parameters
        • StftParams
      • Spectrogram Parameters
        • SpectrogramParams
      • Decibel Conversion
        • LogParams
      • Frequency Scales
        • Mel Scale
        • ERB Scale
        • Constant-Q Transform
      • Audio Features
        • Chromagram
        • MFCC
    • Convenience Functions
      • Linear Spectrograms
        • compute_linear_power_spectrogram()
        • compute_linear_magnitude_spectrogram()
        • compute_linear_db_spectrogram()
      • Mel Spectrograms
        • compute_mel_power_spectrogram()
        • compute_mel_magnitude_spectrogram()
        • compute_mel_db_spectrogram()
      • ERB Spectrograms
        • compute_erb_power_spectrogram()
        • compute_erb_magnitude_spectrogram()
        • compute_erb_db_spectrogram()
      • Audio Features
        • compute_cqt()
        • compute_chromagram()
        • compute_mfcc()
      • Low-Level Functions
        • compute_stft()
    • Planner API
      • Creating Plans
        • SpectrogramPlanner
      • Plan Classes
        • Linear Plans
        • Mel Plans
        • ERB Plans
    • Results
      • Spectrogram
        • Spectrogram.data
        • Spectrogram.db_range()
        • Spectrogram.duration()
        • Spectrogram.frequencies
        • Spectrogram.frequency_range()
        • Spectrogram.n_bins
        • Spectrogram.n_frames
        • Spectrogram.params
        • Spectrogram.shape
        • Spectrogram.times
    • Exceptions
      • SpectrogramError
      • InvalidInputError
      • DimensionMismatchError
      • FFTBackendError
      • InternalError
spectrograms
  • API Reference
  • View page source

API Reference

This section documents the complete Python API for the spectrograms library.

  • Parameters
    • STFT Parameters
    • Spectrogram Parameters
    • Decibel Conversion
    • Frequency Scales
    • Audio Features
  • Convenience Functions
    • Linear Spectrograms
    • Mel Spectrograms
    • ERB Spectrograms
    • Audio Features
    • Low-Level Functions
  • Planner API
    • Creating Plans
    • Plan Classes
  • Results
    • Spectrogram
  • Exceptions
    • SpectrogramError
    • InvalidInputError
    • DimensionMismatchError
    • FFTBackendError
    • InternalError
Previous Next

© Copyright 2026, Jack Geraghty.

Built with Sphinx using a theme provided by Read the Docs.