Metadata-Version: 2.4
Name: ollama-vox
Version: 1.0.1
Summary: Local macOS menubar voice assistant powered by MLX Whisper, Ollama, and Kokoro TTS
Author: Mayur Dayal
License: MIT License
        
        Copyright (c) 2026 Mayur Dayal
        
        Permission is hereby granted, free of charge, to any person obtaining a copy
        of this software and associated documentation files (the "Software"), to deal
        in the Software without restriction, including without limitation the rights
        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
        copies of the Software, and to permit persons to whom the Software is
        furnished to do so, subject to the following conditions:
        
        The above copyright notice and this permission notice shall be included in all
        copies or substantial portions of the Software.
        
        THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
        SOFTWARE.
        
Project-URL: Homepage, https://github.com/maadhav-codes/ollama-vox
Project-URL: Repository, https://github.com/maadhav-codes/ollama-vox.git
Project-URL: Issues, https://github.com/maadhav-codes/ollama-vox/issues
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: MacOS
Requires-Python: >=3.12.0
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: huggingface-hub>=1.12.2
Requires-Dist: misaki[en]>=0.9.4
Requires-Dist: mlx-audio>=0.4.3
Requires-Dist: mlx-whisper>=0.4.3
Requires-Dist: numpy>=2.2.6
Requires-Dist: pyyaml>=6.0.3
Requires-Dist: requests>=2.33.1
Requires-Dist: pyside6>=6.8.0
Requires-Dist: sounddevice>=0.5.5
Requires-Dist: soundfile>=0.13.1
Requires-Dist: phonemizer>=3.3.0
Requires-Dist: twine>=6.2.0
Dynamic: license-file

# Ollama Vox

A local macOS menubar voice assistant that records speech, transcribes with MLX Whisper, gets responses from Ollama, and speaks back using Kokoro TTS.

## Features

- **Menubar app** for easy access
- **Local speech-to-text** with `mlx-whisper`
- **Local text generation** with Ollama
- **Local text-to-speech** with `mlx-audio` + Kokoro voices
- **Status panel** showing model info, rolling latency stats, and response history

## Requirements

- macOS (Apple Silicon recommended)
- Python `3.12+`
- [Ollama](https://ollama.com/) installed and running locally

## Installation

The easiest way to install is via `pip` or `uv`:

```bash
pip install ollama-vox
# or if using uv
uv tool install ollama-vox
```

If installing from source for development:

```bash
git clone https://github.com/maadhav-codes/ollama-vox.git
cd ollama-vox
uv sync
```

## First-Time Setup

Download recommended STT + TTS models:

```bash
uv run ollama-vox --setup
```

## Run

Start the menubar application:

```bash
uv run ollama-vox
```

## Usage

1. Click the microphone icon in your macOS menubar.
2. Select **Start Listening** to speak.
3. Select **Stop Listening** when you are done. The app will process your speech and respond with audio.
4. Click **Show Status** to view latency, recent responses, and active models.

## Configuration

Settings are managed in `config.yaml`.

- **`audio`**: Adjust Voice Activity Detection (VAD) and recording limits.
- **`stt.model`**: Path to the local Whisper model.
- **`ollama`**: Set the endpoint, model name, and temperature.
- **`tts`**: Configure the voice, speaking rate, and Kokoro model path.
