Metadata-Version: 2.4
Name: voco-fishspeech
Version: 0.0.4
Summary: Fish Speech 1.5 TTS plugin for VOCO audio inference runtime
License-Expression: MIT
Requires-Python: >=3.10
Description-Content-Type: text/markdown
Requires-Dist: voco>=0.0.1
Requires-Dist: torch>=2.5.1
Requires-Dist: torchaudio>=2.5.1
Requires-Dist: numpy<=1.26.4
Requires-Dist: scipy
Requires-Dist: loguru>=0.6.0
Requires-Dist: hydra-core>=1.3.2
Requires-Dist: click
Requires-Dist: tqdm
Requires-Dist: transformers>=4.45.2
Requires-Dist: huggingface-hub>=0.20.0
Requires-Dist: omegaconf
Requires-Dist: soundfile
Requires-Dist: einops>=0.7.0
Requires-Dist: vector_quantize_pytorch==1.14.24
Requires-Dist: librosa>=0.10.1
Requires-Dist: einx[torch]==0.2.2
Requires-Dist: natsort>=8.4.0
Requires-Dist: tiktoken>=0.8.0
Requires-Dist: loralib>=0.1.2
Requires-Dist: resampy>=0.4.3
Requires-Dist: pydantic>=2.0.0
Requires-Dist: cachetools

# voco-fishspeech

Fish Speech plugin for VOCO - multilingual text-to-speech with voice cloning.

## Install

```bash
pip install -e .
```

## Usage

```python
from voco.core import AudioRouter

router = AudioRouter()
router.load("fishspeech", alias="tts")

for result in router.infer(
    "tts",
    text="Hello world",
    ref_audio="reference.wav",
    ref_text="Reference transcript"
):
    print(result.audio.shape)
```

## Features

- Multilingual support
- Voice cloning capabilities
- High-quality speech synthesis
- Flexible reference encoding

## Requirements

- Python >=3.10
- PyTorch with CUDA support recommended
