Metadata-Version: 2.4
Name: local-transcription-studio
Version: 0.1.0
Summary: Privacy-first desktop transcription app using local Whisper models
Author-email: Builder Agent <builder@zachos.dev>
License: MIT
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: fastapi>=0.109.0
Requires-Dist: uvicorn[standard]>=0.27.0
Requires-Dist: openai-whisper>=20231117
Requires-Dist: torch>=2.1.0
Requires-Dist: torchaudio>=2.1.0
Requires-Dist: pyannote.audio>=3.1.1
Requires-Dist: pydub>=0.25.1
Requires-Dist: python-multipart>=0.0.6
Requires-Dist: jinja2>=3.1.3
Requires-Dist: aiofiles>=23.2.1
Requires-Dist: stripe>=8.0.0
Requires-Dist: python-dotenv>=1.0.0
Provides-Extra: dev
Requires-Dist: pytest>=8.0.0; extra == "dev"
Requires-Dist: pytest-asyncio>=0.23.0; extra == "dev"
Requires-Dist: black>=24.0.0; extra == "dev"
Requires-Dist: ruff>=0.2.0; extra == "dev"
Dynamic: license-file
Dynamic: requires-python

# Local Transcription Studio

Privacy-first audio/video transcription powered by OpenAI's Whisper—100% local, no cloud uploads, no subscriptions.

## What is this?

Local Transcription Studio is a desktop application that transcribes audio and video files on your machine using OpenAI's Whisper model. Your files never leave your computer. The app handles speaker diarization, generates timestamps, and exports transcripts in multiple formats (SRT, VTT, TXT)—filling the gap between expensive cloud services and command-line tools.

## Features

- **100% Local Processing** – Whisper runs on your machine; no data uploaded to the cloud
- **Drag-and-Drop Interface** – Drop media files directly onto the app to transcribe
- **Speaker Diarization** – Identify and label different speakers in conversations
- **Multiple Export Formats** – Save transcripts as SRT, VTT, or TXT
- **Timestamps** – Accurate timing for every phrase in the transcript
- **No Subscriptions** – One-time setup, unlimited transcription
- **Privacy by Design** – Your audio stays private; no tracking or data collection

## Quick Start

### Installation

1. Clone the repository:
   ```bash
   git clone https://github.com/yourusername/local-transcription-studio.git
   cd local-transcription-studio
   ```

2. Install dependencies:
   ```bash
   pip install -r requirements.txt
   ```

3. Set up environment variables:
   ```bash
   cp .env.example .env
   ```

4. Start the application:
   ```bash
   python -m local_transcription_studio.main
   ```

The app opens in your default browser at `http://localhost:5000`.

## Usage

1. **Load Media** – Drag and drop an audio or video file (MP3, WAV, MP4, etc.) into the interface
2. **Configure** – (Optional) Adjust settings like language, speaker detection sensitivity
3. **Transcribe** – Click "Transcribe" and wait for processing to complete
4. **Export** – Download your transcript in your preferred format (SRT, VTT, or TXT)

### Example Workflow

```
1. Drop video.mp4 onto the app
2. Select "English" language
3. Enable speaker diarization
4. Click Transcribe (~2-5 min depending on file length)
5. Export as SRT for video editing, or TXT for sharing
```

## Tech Stack

- **Backend**: Python, Flask
- **Transcription Engine**: OpenAI Whisper
- **Frontend**: HTML5, JavaScript
- **Audio Processing**: ffmpeg
- **Testing**: pytest
- **Packaging**: setuptools

## License

MIT – See [LICENSE](LICENSE) for details.

---

**Want to learn more?** Check out [OVERVIEW.md](OVERVIEW.md) for architecture details or [MONETIZATION.md](MONETIZATION.md) for business model information.
