Metadata-Version: 2.4
Name: extract-pdf-images
Version: 1.0.0
Summary: A lightweight GUI tool to visualize and extract images from PDF files
Author-email: PDF Image Extractor <example@example.com>
License: MIT
Project-URL: Homepage, https://github.com/bpiwowar/pdf-image-extractor
Project-URL: Repository, https://github.com/bpiwowar/pdf-image-extractor
Keywords: pdf,image,extraction,gui,tkinter
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: End Users/Desktop
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Programming Language :: Python :: 3.14
Classifier: Topic :: Multimedia :: Graphics
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: PyMuPDF>=1.23.0
Requires-Dist: Pillow>=10.0.0
Requires-Dist: PySide6>=6.5.0
Dynamic: license-file

# PDF Image Extractor

A lightweight Python application with a GUI to visualize and extract images from PDF files. The app displays PDF pages with red bounding boxes around detected images, allowing you to click on any box to save that image.

## Features

- 📄 **Visual PDF Navigation**: Browse through PDF pages with Previous/Next buttons or jump to specific pages
- 🎯 **Popup Image Preview**: Hover over image descriptions to see a popup with the actual extracted image
- 🖱️ **Click to Save**: Click on image descriptions in the list to extract and save
- 📑 **Smart Navigation**: 
  - **Outline Tab**: PDF table of contents with hierarchical navigation
  - **Thumbnails Tab**: Visual page browser with clickable thumbnails
- 🔍 **Zoom Controls**: Zoom in/out with +/- buttons or fit page to window
- 📊 **Image Info Panel**: Interactive list showing all images on the current page
- 💾 **Batch Extract**: Extract all images from the current page at once
- 🚀 **Pure Python**: No external system dependencies required

## Installation

### From Source

Clone or download this repository, then install:

```bash
pip install -e .
```

### From PyPI

```bash
pip install pdf-image-extractor
```

That's it! No system dependencies needed - PyMuPDF includes everything required.

## Usage

After installation, run from anywhere using the command:

```bash
pdf-image-extractor
```

Or open a PDF directly from the command line:

```bash
pdf-image-extractor /path/to/document.pdf
```

Or run directly from the source directory:

```bash
python -m pdf_image_extractor.app [optional-pdf-file]
```

### Quick Start Guide

1. **Open a PDF**: Click "Open PDF" button or provide a file path as argument
2. **Navigate**: 
   - Use Previous/Next (◀ ▶) buttons or type a page number
   - Click on entries in the **Outline** tab for TOC navigation
   - Switch to **Thumbnails** tab for visual page browsing
3. **View Images**: Check the right panel for a list of all images on the current page
4. **Preview**: Hover your mouse over any image in the list to see a popup preview
5. **Extract Image**: Click on an image in the list to save it
6. **Zoom**: Use +/- buttons to zoom, or click "Fit" to fit page to window (auto-fits on open)
7. **Batch Extract**: Click "Extract All" to save all images from the current page

## Development

### Install in Development Mode

```bash
pip install -e ".[dev]"
```

## License

MIT License - Feel free to use and modify as needed.
