Metadata-Version: 2.4
Name: speak-tts
Version: 0.1.0
Summary: Voice cloning CLI for Mac using F5-TTS
Project-URL: Homepage, https://github.com/danielsinai/speak-tts
Project-URL: Repository, https://github.com/danielsinai/speak-tts
Author: Daniel Sinai
License-Expression: MIT
License-File: LICENSE
Keywords: f5-tts,mlx,speech-synthesis,tts,voice-cloning
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Operating System :: MacOS
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Multimedia :: Sound/Audio :: Speech
Requires-Python: >=3.10
Requires-Dist: f5-tts-mlx
Requires-Dist: soundfile
Description-Content-Type: text/markdown

# speak-tts

Voice cloning CLI for Mac (Apple Silicon) using F5-TTS.

## Installation

### Via Homebrew

```bash
brew tap danielsinai/speak-tts
brew install speak-tts
```

### Via pipx

```bash
pipx install speak-tts
```

### Via pip

```bash
pip install speak-tts
```

## Requirements

- macOS with Apple Silicon (M1/M2/M3/M4)
- Python 3.10+

## Usage

### Add a Voice Profile

```bash
speak --add-profile daniel --voice ~/voice.wav --text-file ~/reference.txt
```

The reference audio should be a clear WAV recording (24kHz) of the voice you want to clone, and the text file should contain an exact transcript.

### Generate Speech

Plays audio directly (no file saved):

```bash
speak daniel "Hello, this is a test!"
```

Save to a file:

```bash
speak daniel "Hello world" --output ~/Desktop/hello.wav
```

### List Profiles

```bash
speak --list
```

### Play Audio

```bash
speak --play output.wav
```

## Profile Storage

Profiles stored in `~/.speak/profiles/<name>/` with:
- `reference.wav` - Voice sample
- `reference.txt` - Transcript

## Tips

- **Reference Audio**: 5-15 seconds, 24kHz, minimal background noise
- **Reference Text**: Exact transcript of the audio
- **First Run**: Downloads F5-TTS model (~1GB)

## Publishing

### 1. Publish to PyPI

```bash
# Install tools
pip install build twine

# Build and upload
./scripts/publish.sh
```

Or manually:

```bash
python -m build
twine upload dist/*
```

### 2. Set up Homebrew Tap

1. Create a new GitHub repo: `homebrew-speak-tts`

2. Copy the formula:
   ```bash
   cp Formula/speak-tts.rb ~/homebrew-speak-tts/
   ```

3. Get SHA256 after PyPI upload:
   ```bash
   shasum -a 256 dist/speak_tts-*.tar.gz
   ```

4. Update `sha256` in the formula, commit, and push

5. Users install with:
   ```bash
   brew tap YOUR_USERNAME/speak-tts
   brew install speak-tts
   ```

## Development

```bash
git clone https://github.com/danielsinai/speak-tts
cd speak-tts
python3 -m venv venv
source venv/bin/activate
pip install -e .
speak --help
```

## License

MIT
