Metadata-Version: 2.4
Name: tigerflow-ml
Version: 0.1.0a1
Summary: ML task library for TigerFlow
Project-URL: Repository, https://github.com/princeton-ddss/tigerflow-ml
Project-URL: Documentation, https://princeton-ddss.github.io/tigerflow-ml/latest
Author-email: Colin Swaney <colinswaney@princeton.edu>, Sangyoon Park <sp8538@princeton.edu>
License-Expression: MIT
License-File: LICENSE
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Scientific/Engineering :: Image Recognition
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: <3.14,>=3.10
Requires-Dist: opencv-python-headless
Requires-Dist: pillow
Requires-Dist: pymupdf>=1.27.1
Requires-Dist: sentencepiece
Requires-Dist: tigerflow>=0.3.0
Requires-Dist: timm
Requires-Dist: torch>=2.0
Requires-Dist: transformers>=4.45
Description-Content-Type: text/markdown

# tigerflow-ml

[![CI](https://github.com/princeton-ddss/tigerflow-ml/actions/workflows/ci.yml/badge.svg)](https://github.com/princeton-ddss/tigerflow-ml/actions/workflows/ci.yml)
[![PyPI](https://img.shields.io/pypi/v/tigerflow-ml)](https://pypi.org/project/tigerflow-ml/)
[![Docs](https://img.shields.io/badge/docs-latest-blue)](https://princeton-ddss.github.io/tigerflow-ml/)

ML tasks for [TigerFlow](https://github.com/princeton-ddss/tigerflow) — private cloud ML APIs on HPC infrastructure.

## Installation

```bash
pip install tigerflow-ml
```

## Tasks

| Task             | Description                         | Entry Point                       |
|------------------|-------------------------------------|-----------------------------------|
| OCR              | Extract text from images and PDFs   | `ocr` / `ocr-local`               |
| Translation      | Translate text documents            | `translate` / `translate-local`   |
| Transcription    | Transcribe audio to text            | `transcribe` / `transcribe-local` |
| Object Detection | Detect objects in images and videos | `detect` / `detect-local`         |

Each task provides both a Slurm variant (for HPC) and a Local variant (for development).

## Usage

After installation, tasks are automatically discoverable via:

```bash
tigerflow tasks list
```

Run a task directly:

```bash
python -m tigerflow_ml.text.ocr.slurm --help
python -m tigerflow_ml.text.translate.slurm --help
python -m tigerflow_ml.audio.transcribe.slurm --help
python -m tigerflow_ml.image.detect.slurm --help
```

## Development

```bash
uv sync --group dev
uv run pre-commit run --all-files
uv run pytest tests
```
