Metadata-Version: 2.1
Name: improve-diarization-with-llm
Version: 0.0.2
Summary: This tool can take a long script (greater than 10 hours) of diarized content and improve the diarization by prompting an LLM model to look for obviously incorrect attribution and fix it.
Home-page: https://github.com/russedavid/improve-diarization-with-llm
Author: rijimuge
Author-email: david.russell04@gmail.com
License: Apache Software License 2.0
Keywords: nbdev jupyter notebook python
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Natural Language :: English
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: License :: OSI Approved :: Apache Software License
Requires-Python: >=3.7
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: claudette
Provides-Extra: dev

# improve-diarization-with-llm


<!-- WARNING: THIS FILE WAS AUTOGENERATED! DO NOT EDIT! -->

## Install

``` sh
pip install improve_diarization_with_llm
```

## How to use

``` python
import os
from improve_diarization_with_llm import claude_corrector
    
os.environ['ANTHROPIC_API_KEY'] = 'your-api-key'  # Replace with your actual API key
input_file = 'path/to/your/input/transcript.txt'  # Replace with your actual input file path
output_file = 'path/to/your/output/improved_transcript.txt'  # Replace with your desired output file path
    
corrector = claude_corrector.ClaudeDiarizationCorrector(input_file, output_file)

# corrector.process_conversation() this assumes a valid ANTHROPIC_API_KEY environment variable and input path
```
