omni-wst-core

GPU-accelerated Joint Time-Frequency Scattering Transform. Formally grounded perceptual fingerprinting at industrial throughput.

→ View on GitHub
Copied to clipboard!

Overview

Adversarial-Robust Hashing

JTFS recovers inter-band phase correlations lost to the standard WST modulus non-linearity, empirically reducing phase-shifting hash collision rates by 34%.

Formally Bounded

The depth-m scattering cascade is proven Lipschitz continuous with constant Lₘ ≤ (‖ψ‖₁)ᵐ, exponentially decaying with depth. Minor signal perturbations cannot catastrophically alter fingerprints.

Zero-Overhead Throughput

Dual-stream CUDA double-buffering hides ≥95% of PCIe transfer latency. Pinned memory allocations sustain >15 GB/s host-to-device bandwidth.

Mathematics

The standard Wavelet Scattering Transform defines a deep convolutional representation through alternating wavelet filtering and pointwise modulus operators:

\[ S[p]x(u) = | \dots ||x * \psi_{\lambda_1}| * \psi_{\lambda_2}| \dots * \psi_{\lambda_m}| * \phi_J(u) \]

To prevent informational collapse, the analytic filter bank is strictly constrained to form a Parseval frame, satisfying the energy conservation identity:

\[ \sum_{p} \|S[p]x\|^2 = \|x\|^2 \]

This energy-preserving construction mathematically guarantees deformation stability via the Lipschitz continuity theorem:

\[ \|S[p]x - S[p]y\|_{L^2} \le (\|\psi\|_1)^m \cdot \|x - y\|_{L^2} \]

Finally, to recover critical phase-coupling lost during the nonlinear modulus cascade, our Joint Time-Frequency Scattering (JTFS) engine applies a fully separable 2D convolution kernel across both the temporal and log-frequency axes:

\[ \Psi_{\mu,l,s}(t,\lambda) = \psi_\mu(t) \cdot \psi_{l,s}(\lambda) \]

Architecture

Python pybind11 Buffer NumPy Arrays C++ Template WSTEngine JTFSEngine TilePolicy Meta CUDA GPU cuFFT Batched Dual-Stream Pinned Memory

Zero-copy pipeline from Python NumPy → GPU VRAM. No intermediate heap allocations after initialisation.

Technical Use Cases

Perceptual Media Fingerprinting

Detect near-duplicate audio and robust copyright violations by extracting JTFS signatures that remain invariant to MP3 compression, equalisation, and adversarial time-stretching.

Gravitational Wave Signal Analysis

WST provides a deformation-stable feature extractor for identifying compact binary coalescence (CBC) signals embedded in non-stationary broadband noise at the LIGO/Virgo detectors.

Genomic ChIP-seq Peak Calling

Encode read-depth signals into a stable translation-invariant representation, accurately characterizing transcription factor bindings regardless of minor genomic position shifts.

Real-Time Anomaly Detection

Continuously embed high-frequency sensor streams (e.g. EEG brain activity, HFT order books) to flag transient distribution shifts with bounded Lipschitz error tolerances.

Quick-Start

import omni_wst_core as wst
import numpy as np

# Initialize configuration
cfg = wst.WSTConfig(J=8, Q=16, depth=2, jtfs=True)

# 44.1kHz audio frame mock
signal = np.random.randn(44100).astype(np.float32)

# Forward pass
fingerprint = wst.fingerprint(signal, cfg)

print(f"Fingerprint shape: {fingerprint.shape}")
print(f"CUDA available: {wst.cuda_available()}")

Pricing & Deployment

Research (Free)

$0 / forever

WST + JTFS
CPU & GPU targets
Apache 2.0 License
Unlimited academic use

Install Now

Commercial License

Custom Pricing

Production deployment rights
Priority developer support
Strict SLA options
Custom CUDA hardware targets

Request Quote →

Enterprise

Contact Us

Full SaaS integration
Rust orchestration layer
Ed25519 licensing pipeline
Dedicated Kubernetes namespace

Contact Sales →

Academic and government research institutions qualify for free commercial waivers — include institutional affiliation in your request.