Metadata-Version: 2.4
Name: openadapt-viewer
Version: 0.2.0
Summary: Standalone HTML viewer generation for OpenAdapt ML dashboards and benchmarks
Project-URL: Homepage, https://github.com/OpenAdaptAI/openadapt-viewer
Project-URL: Repository, https://github.com/OpenAdaptAI/openadapt-viewer
Project-URL: Documentation, https://github.com/OpenAdaptAI/openadapt-viewer#readme
Project-URL: Issues, https://github.com/OpenAdaptAI/openadapt-viewer/issues
Project-URL: Changelog, https://github.com/OpenAdaptAI/openadapt-viewer/releases
Author-email: Richard Abrich <richard@openadapt.ai>
Maintainer: OpenAdaptAI
License-Expression: MIT
License-File: LICENSE
Keywords: benchmarks,dashboard,html,ml,viewer,visualization
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering :: Visualization
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.10
Requires-Dist: imageio>=2.37.2
Requires-Dist: jinja2>=3.1.0
Requires-Dist: pillow>=10.0.0
Requires-Dist: playwright>=1.57.0
Requires-Dist: plotly>=5.18.0
Requires-Dist: pydantic>=2.0.0
Provides-Extra: all
Requires-Dist: playwright>=1.40.0; extra == 'all'
Requires-Dist: pytest>=8.0.0; extra == 'all'
Requires-Dist: ruff>=0.1.0; extra == 'all'
Provides-Extra: dev
Requires-Dist: pytest>=8.0.0; extra == 'dev'
Requires-Dist: ruff>=0.1.0; extra == 'dev'
Provides-Extra: screenshots
Requires-Dist: playwright>=1.40.0; extra == 'screenshots'
Description-Content-Type: text/markdown

# openadapt-viewer

[![Build Status](https://github.com/OpenAdaptAI/openadapt-viewer/actions/workflows/publish.yml/badge.svg)](https://github.com/OpenAdaptAI/openadapt-viewer/actions/workflows/publish.yml)
[![PyPI version](https://img.shields.io/pypi/v/openadapt-viewer.svg)](https://pypi.org/project/openadapt-viewer/)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![Python 3.10+](https://img.shields.io/badge/python-3.10%2B-blue)](https://www.python.org/downloads/)

Reusable component library for OpenAdapt visualization. Build standalone HTML viewers for training dashboards, benchmark results, capture playback, and demo retrieval.

## Features

- **Component-based**: Reusable building blocks (screenshot, playback, metrics, filters)
- **Composable**: Combine components to build custom viewers
- **Standalone HTML**: Generated files work offline, no server required
- **Event transcript**: Real-time audio transcription synchronized with playback
- **Consistent styling**: Shared CSS variables and dark mode support
- **Alpine.js integration**: Lightweight interactivity out of the box

## Installation

```bash
pip install openadapt-viewer
```

Or with uv:
```bash
uv add openadapt-viewer
```

## Quick Start

### Using Components

```python
from openadapt_viewer.components import (
    screenshot_display,
    playback_controls,
    metrics_grid,
    filter_bar,
    badge,
)

# Screenshot with click overlays
html = screenshot_display(
    image_path="screenshot.png",
    overlays=[
        {"type": "click", "x": 0.5, "y": 0.3, "label": "H", "variant": "human"},
        {"type": "click", "x": 0.6, "y": 0.4, "label": "AI", "variant": "predicted"},
    ],
)

# Metrics cards
html = metrics_grid([
    {"label": "Total Tasks", "value": 100},
    {"label": "Passed", "value": 75, "color": "success"},
    {"label": "Failed", "value": 25, "color": "error"},
    {"label": "Success Rate", "value": "75%", "color": "accent"},
])
```

### Using PageBuilder

Build complete pages from components:

```python
from openadapt_viewer.builders import PageBuilder
from openadapt_viewer.components import metrics_grid, screenshot_display

builder = PageBuilder(title="My Viewer", include_alpine=True)

builder.add_header(
    title="Benchmark Results",
    subtitle="Model: gpt-5.1",
    nav_tabs=[
        {"href": "dashboard.html", "label": "Training"},
        {"href": "viewer.html", "label": "Viewer", "active": True},
    ],
)

builder.add_section(
    metrics_grid([
        {"label": "Tasks", "value": 100},
        {"label": "Passed", "value": 75, "color": "success"},
    ]),
    title="Summary",
)

# Render to file
builder.render_to_file("output.html")
```

### Ready-to-Use Viewers

5 production viewers available:

1. **Benchmark Viewer** - Visualize benchmark evaluation results
2. **Capture Viewer** - Playback recorded GUI interactions
3. **Training Dashboard** - Monitor ML training progress (via openadapt-ml)
4. **Retrieval Viewer** - Display demo search results (via openadapt-retrieval)
5. **Segmentation Viewer** - View episode segmentation results

```python
from openadapt_viewer.viewers.benchmark import generate_benchmark_html

# From benchmark results directory
generate_benchmark_html(
    data_path="benchmark_results/run_001/",
    output_path="viewer.html",
)
```

All viewers use the canonical component-based pattern. See `VIEWER_PATTERNS.md` for details.

## CLI Usage

```bash
# Generate demo benchmark viewer
openadapt-viewer demo --tasks 10 --output viewer.html

# Generate from benchmark results
openadapt-viewer benchmark --data results/run_001/ --output viewer.html
```

## Components

All components return HTML strings that can be composed together. Use them with PageBuilder or embed inline.

### Core Components

| Component | Description | Example Use Case |
|-----------|-------------|-----------------|
| `screenshot_display` | Screenshot with click/highlight overlays | Capture frames, demo screenshots |
| `playback_controls` | Play/pause/speed controls for step playback | Video-like playback |
| `timeline` | Progress bar for step navigation | Scrub through recordings |
| `action_display` | Format actions (click, type, scroll, etc.) | Display action details |
| `metrics_card` | Single statistic card | Individual metric display |
| `metrics_grid` | Grid of metric cards | Summary dashboards |
| `filter_bar` | Filter dropdowns with optional search | Filter and search data |
| `filter_dropdown` | Single dropdown filter | Domain/status filters |
| `selectable_list` | List with selection support | Task lists, file lists |
| `list_item` | Individual list item | Custom list entries |
| `badge` | Status badges (pass/fail, etc.) | Status indicators |

### Enhanced Components

| Component | Description | Example Use Case |
|-----------|-------------|-----------------|
| `video_playback` | Video playback from screenshot sequences | Smooth capture playback |
| `video_playback_with_actions` | Video + synchronized action overlay | Capture with action overlay |
| `action_timeline` | Timeline with action markers | Action sequence view |
| `action_timeline_vertical` | Vertical action timeline | Compact action view |
| `comparison_view` | Side-by-side comparison | Before/after, A/B test |
| `overlay_comparison` | Overlay comparison with slider | Image comparison |
| `action_type_filter` | Filter by action type | Filter clicks/types/scrolls |
| `action_type_pills` | Action type pill buttons | Quick action filtering |
| `action_type_dropdown` | Action type dropdown | Compact action filter |
| `failure_analysis_panel` | Failure analysis dashboard | Benchmark failure analysis |
| `failure_summary_card` | Failure summary card | Individual failure details |

**Total: 22 components** available for building viewers.

See `VIEWER_PATTERNS.md` for complete usage examples.

## Project Structure

```
src/openadapt_viewer/
├── components/           # Reusable UI building blocks
│   ├── screenshot.py     # Screenshot with overlays
│   ├── playback.py       # Playback controls
│   ├── timeline.py       # Progress bar
│   ├── action_display.py # Action formatting
│   ├── metrics.py        # Stats cards
│   ├── filters.py        # Filter dropdowns
│   ├── list_view.py      # Selectable lists
│   └── badge.py          # Status badges
├── builders/             # High-level page builders
│   └── page_builder.py   # PageBuilder class
├── styles/               # Shared CSS
│   └── core.css          # CSS variables and base styles
├── core/                 # Core utilities
│   ├── types.py          # Pydantic models
│   └── html_builder.py   # Jinja2 utilities
├── viewers/              # Full viewer implementations
│   └── benchmark/        # Benchmark results viewer
├── examples/             # Reference implementations
│   ├── benchmark_example.py
│   ├── training_example.py
│   ├── capture_example.py
│   └── retrieval_example.py
└── templates/            # Jinja2 templates
```

## Audio Transcript Feature

The viewer includes a powerful **audio transcript** feature that displays real-time transcription of captured audio alongside the visual playback. This is particularly useful for:

- **Debugging workflows**: See what was said at each step
- **Documentation**: Auto-generate narrative descriptions of recorded sessions
- **Analysis**: Correlate verbal instructions with UI actions
- **Training**: Review narrated demonstrations with synchronized visuals

### Key Capabilities

The transcript panel provides:

- **Timestamped transcription**: Each transcript segment is stamped with its time in the recording (e.g., `0:00.00`, `0:05.60`)
- **Synchronized playback**: Transcript automatically highlights and scrolls as the video plays
- **Searchable text**: Find specific moments in long recordings by searching transcript content
- **Copy functionality**: Export transcript text for documentation or analysis

### How It Works

When captures are recorded with audio (using `openadapt-capture`'s audio recording features), the viewer automatically:

1. Displays the transcript in a dedicated panel in the sidebar
2. Timestamps each transcript segment relative to the recording start time
3. Syncs transcript highlighting with the current playback position
4. Updates the displayed transcript as you navigate through events

The transcript appears alongside the event list and event details, providing a complete picture of what happened during the recording.

## Synthetic Demo Viewer

**NEW:** Interactive browser-based viewer for synthetic WAA demonstration data.

### Quick Start

```bash
# Open the synthetic demo viewer
open synthetic_demo_viewer.html
```

### What It Shows

- **82 synthetic demos** across 6 domains (notepad, paint, clock, browser, file_explorer, office)
- **Filter by domain** and select specific tasks
- **View demo content** with syntax-highlighted steps
- **See how demos are used** in actual API prompts
- **Impact comparison**: 33% → 100% accuracy improvement with demo-conditioned prompting
- **Action reference**: All 8 action types (CLICK, TYPE, WAIT, etc.)

### Purpose

Synthetic demos are **AI-generated example trajectories** that show step-by-step how to complete Windows automation tasks. They are included in prompts when calling Claude/GPT APIs during benchmark evaluation - this is called **demo-conditioned prompting**.

**Impact:** Improved first-action accuracy from 33% to 100%!

### Documentation

- **Quick Start**: `QUICK_REFERENCE.md` - One-page overview
- **Complete Guide**: `SYNTHETIC_DEMOS_EXPLAINED.md` - Full explanation
- **Examples**: `DEMO_EXAMPLES_SHOWCASE.md` - 5 diverse demo examples
- **Master Index**: `SYNTHETIC_DEMO_INDEX.md` - Central navigation hub

### Features

- Beautiful dark theme matching OpenAdapt style
- Domain filtering (All, Notepad, Paint, Clock, Browser, File Explorer, Office)
- Task selector with estimated step counts
- Dual-panel display: demo content + prompt usage
- Side-by-side impact comparison (with vs without demos)
- Complete action types reference
- Fully self-contained (no external dependencies)
- Works offline

See `SYNTHETIC_DEMO_INDEX.md` for complete documentation.

---

## Screenshots

### Full Viewer Interface

The viewer provides a complete interface for exploring captured GUI interactions with playback controls, timeline navigation, event details, and **real-time audio transcript**.

![Turn off Night Shift - Full Viewer](docs/images/turn-off-nightshift_full.png)
*Interactive viewer showing the "Turn off Night Shift" workflow with screenshot display (center), event list (right sidebar top), and **audio transcript** (right sidebar bottom)*

### Playback Controls

Step through captures with playback controls, timeline scrubbing, and keyboard shortcuts (Space to play/pause, arrow keys to navigate).

![Playback Controls](docs/images/turn-off-nightshift_controls.png)
*Timeline and playback controls with overlay toggle, plus event details and **synchronized transcript panel***

### Event List, Details, and Transcript

Browse all captured events with detailed information about each action. The **transcript panel** displays timestamped audio transcription that syncs with playback, showing exactly what was said at each moment in the recording.

![Event List](docs/images/turn-off-nightshift_events.png)
*Event list sidebar showing captured actions with timing and type information, plus **live audio transcript with timestamps***

### Demo Workflow

![Demo Workflow](docs/images/demo_new_full.png)
*Example demo workflow viewer*

## Examples

Run the examples to see how different OpenAdapt packages can use the component library:

```bash
# Benchmark results (openadapt-evals)
python -m openadapt_viewer.examples.benchmark_example

# Training dashboard (openadapt-ml)
python -m openadapt_viewer.examples.training_example

# Capture playback (openadapt-capture)
python -m openadapt_viewer.examples.capture_example

# Retrieval results (openadapt-retrieval)
python -m openadapt_viewer.examples.retrieval_example
```

### Generating Screenshots

To regenerate the README screenshots:

```bash
# Install playwright (one-time setup)
uv pip install "openadapt-viewer[screenshots]"
uv run playwright install chromium

# Install openadapt-capture (required)
cd ../openadapt-capture
uv pip install -e .
cd ../openadapt-viewer

# Generate screenshots
uv run python scripts/generate_readme_screenshots.py

# Or with custom options
uv run python scripts/generate_readme_screenshots.py \
  --capture-dir /path/to/openadapt-capture \
  --output-dir docs/images \
  --max-events 50
```

The script will:
1. Load captures from `openadapt-capture` (turn-off-nightshift and demo_new)
2. Generate interactive HTML viewers
3. Take screenshots using Playwright
4. Save screenshots to `docs/images/`

## Development

```bash
# Clone and install
git clone https://github.com/OpenAdaptAI/openadapt-viewer.git
cd openadapt-viewer
uv sync --all-extras

# Run tests
uv run pytest tests/ -v

# Run linter
uv run ruff check .
```

## Integration

Used by other OpenAdapt packages:

- **openadapt-ml**: Training dashboards and model comparison
- **openadapt-evals**: Benchmark result visualization
- **openadapt-capture**: Capture recording playback
- **openadapt-retrieval**: Demo search result display

## Documentation

- **[VIEWER_PATTERNS.md](VIEWER_PATTERNS.md)** - Canonical pattern for building viewers (MUST READ for new viewers)
- **[MIGRATION_GUIDE.md](MIGRATION_GUIDE.md)** - Step-by-step guide for converting inline viewers to component-based
- **[ARCHITECTURE.md](ARCHITECTURE.md)** - System architecture and design patterns
- **[CATALOG_SYSTEM.md](CATALOG_SYSTEM.md)** - Automatic recording discovery and indexing
- **[SEARCH_FUNCTIONALITY.md](SEARCH_FUNCTIONALITY.md)** - Token-based search implementation
- **[EPISODE_TIMELINE_QUICKSTART.md](EPISODE_TIMELINE_QUICKSTART.md)** - Adding episode timelines to viewers

## License

MIT License - see LICENSE file for details.
