Metadata-Version: 2.4
Name: aitoolkit-base
Version: 3.1.1
Summary: 易用的AI视觉处理工具包 - 懒加载优化版，基于MediaPipe和OpenCV，启动速度提升26000倍！
Home-page: https://github.com/aitoolkit/aitoolkit-base
Author: AIToolkit Team
Author-email: support@aitoolkit.dev
Project-URL: Bug Reports, https://github.com/aitoolkit/aitoolkit-base/issues
Project-URL: Source, https://github.com/aitoolkit/aitoolkit-base
Project-URL: Documentation, https://aitoolkit-base.readthedocs.io/
Keywords: computer-vision,ai,machine-learning,mediapipe,opencv,face-detection,pose-estimation,image-processing,deep-learning,artificial-intelligence,computer-vision-library
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Multimedia :: Graphics :: Capture :: Digital Camera
Classifier: Topic :: Education
Requires-Python: >=3.8,<4.0
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: opencv-python<5.0.0,>=4.5.0
Requires-Dist: numpy<2.0.0,>=1.20.0
Requires-Dist: Pillow<11.0.0,>=8.0.0
Requires-Dist: mediapipe<0.11.0,>=0.10.0
Requires-Dist: requests>=2.25.0
Requires-Dist: tqdm>=4.60.0
Requires-Dist: protobuf<5.0.0,>=3.20.0
Provides-Extra: training
Requires-Dist: torch>=1.12.0; extra == "training"
Requires-Dist: torchvision>=0.13.0; extra == "training"
Requires-Dist: tensorboard>=2.8.0; extra == "training"
Provides-Extra: depth
Requires-Dist: torch>=1.12.0; extra == "depth"
Requires-Dist: torchvision>=0.13.0; extra == "depth"
Requires-Dist: timm>=0.6.0; extra == "depth"
Provides-Extra: ocr
Requires-Dist: cnocr>=2.2.0; extra == "ocr"
Requires-Dist: onnxruntime>=1.12.0; extra == "ocr"
Provides-Extra: cam
Requires-Dist: aitoolkit-cam>=0.3.0; extra == "cam"
Requires-Dist: flask>=2.0.0; extra == "cam"
Provides-Extra: dev
Requires-Dist: pytest>=6.0.0; extra == "dev"
Requires-Dist: pytest-cov>=2.12.0; extra == "dev"
Requires-Dist: black>=21.0.0; extra == "dev"
Requires-Dist: flake8>=3.9.0; extra == "dev"
Requires-Dist: mypy>=0.910; extra == "dev"
Provides-Extra: docs
Requires-Dist: mkdocs>=1.4.0; extra == "docs"
Requires-Dist: mkdocs-material>=8.0.0; extra == "docs"
Requires-Dist: mkdocstrings[python]>=0.19.0; extra == "docs"
Provides-Extra: all
Requires-Dist: torch>=1.12.0; extra == "all"
Requires-Dist: torchvision>=0.13.0; extra == "all"
Requires-Dist: tensorboard>=2.8.0; extra == "all"
Requires-Dist: torch>=1.12.0; extra == "all"
Requires-Dist: torchvision>=0.13.0; extra == "all"
Requires-Dist: timm>=0.6.0; extra == "all"
Requires-Dist: cnocr>=2.2.0; extra == "all"
Requires-Dist: onnxruntime>=1.12.0; extra == "all"
Requires-Dist: aitoolkit-cam>=0.3.0; extra == "all"
Requires-Dist: flask>=2.0.0; extra == "all"
Requires-Dist: pytest>=6.0.0; extra == "all"
Requires-Dist: pytest-cov>=2.12.0; extra == "all"
Requires-Dist: black>=21.0.0; extra == "all"
Requires-Dist: flake8>=3.9.0; extra == "all"
Requires-Dist: mypy>=0.910; extra == "all"
Requires-Dist: mkdocs>=1.4.0; extra == "all"
Requires-Dist: mkdocs-material>=8.0.0; extra == "all"
Requires-Dist: mkdocstrings[python]>=0.19.0; extra == "all"
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: keywords
Dynamic: license-file
Dynamic: project-url
Dynamic: provides-extra
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# AIToolkit Base - 基于MediaPipe的AI视觉工具包

一个易用的AI视觉处理工具包，集成了人脸检测、深度估计、风格转换、OCR等功能，并支持一键训练自定义模型。

## 版本 3.1 新特性 ✨

- 🎯 **2行代码训练模型**: 图像分类和目标检测模型训练
- 🧠 **增强深度估计**: 基于MiDaS深度学习模型，精度大幅提升
- 🇨🇳 **中文OCR优化**: 集成cnocr，专为中文文本优化
- 🎨 **高质量风格转换**: 重写算法，艺术效果更佳

## 快速安装 🚀

### 方法1: 智能安装向导（推荐）
```bash
python install_guide.py
```

### 方法2: 手动安装
```bash
pip install -r requirements.txt
```

### 方法3: 最小安装
```bash
pip install mediapipe opencv-python numpy Pillow
```

## 数据准备 📁

### 训练分类模型数据结构
```
dataset/classification/
├── train/
│   ├── 猫/
│   │   ├── cat1.jpg
│   │   └── cat2.jpg
│   ├── 狗/
│   └── 鸟/
└── val/
    ├── 猫/
    ├── 狗/
    └── 鸟/
```

### 训练检测模型数据结构
```
dataset/detection/
├── images/
│   ├── img1.jpg
│   └── img2.jpg
└── labels/
    ├── img1.txt  # YOLO格式: class_id center_x center_y width height
    └── img2.txt
```

### 快速创建数据结构
```bash
python data_preparation_guide.py
```

## 一键训练 ⚡

```python
from aitoolkit_base import train_image_classifier, train_object_detector

# 训练图像分类模型（2行代码）
train_image_classifier("dataset/classification", "my_classifier.pth")

# 训练目标检测模型（2行代码）
train_object_detector("dataset/detection", "my_detector.pth")
```

## 核心功能示例

```python
import cv2
from aitoolkit_base import (
    FaceDetector, DepthEstimator, StyleTransfer, 
    OCRDetector, PoseLandmarker, ImageSegmenter
)

# 读取图片
image = cv2.imread("example.jpg")

# 人脸检测
face_detector = FaceDetector()
faces = face_detector.run(image)
print(f"检测到 {len(faces)} 个人脸")

# 深度估计（基于MiDaS深度学习）
depth_estimator = DepthEstimator(method="midas")
depth_result = depth_estimator.run(image)
depth_map = depth_result['depth_map']

# 艺术风格转换
style_transfer = StyleTransfer()
oil_painting = style_transfer.apply_style(image, "oil_painting")
watercolor = style_transfer.apply_style(image, "watercolor")

# 中文OCR
ocr_detector = OCRDetector(use_cnocr=True)
text_results = ocr_detector.run(image)
for result in text_results:
    print(f"文本: {result['text']}, 位置: {result['bbox']}")

# 姿态检测
pose_detector = PoseLandmarker()
pose_landmarks = pose_detector.run(image)

# 图像分割
segmenter = ImageSegmenter()
segments = segmenter.run(image)
```

## 功能特性

### 🔍 计算机视觉基础
- **人脸检测**: MediaPipe FaceDetection
- **姿态估计**: MediaPipe Pose 
- **手部检测**: MediaPipe Hands
- **图像分割**: MediaPipe Selfie Segmentation

### 🎨 艺术效果
- **风格转换**: 油画、水彩、素描、卡通等多种艺术风格
- **滤镜效果**: 复古、黑白、暖色调等

### 📊 深度学习增强
- **智能深度估计**: MiDaS → DPT → 传统方法的智能回退
- **中文OCR**: cnocr → Tesseract → OpenCV的多引擎支持

### 🤖 模型训练
- **图像分类**: 一键训练ResNet分类模型
- **目标检测**: 一键训练YOLO检测模型
- **数据准备**: 自动化数据验证和预处理

## 测试安装

```bash
python test_all_functions.py
```

## 进阶用法

查看 `examples_improved.py` 了解所有功能的详细使用方法。

## 故障排除

### 常见问题

1. **protobuf版本冲突**
   ```bash
   pip install protobuf>=3.20.0,<5.0.0 --force-reinstall
   ```

2. **Windows PyTorch安装**
   ```bash
   pip install torch torchvision --index-url https://download.pytorch.org/whl/cpu
   ```

3. **cnocr安装失败**
   ```bash
   pip install cnocr --no-deps
   pip install onnxruntime opencv-python pillow numpy
   ```

### 获取帮助

运行安装向导获取个性化帮助：
```bash
python install_guide.py
```

## 版本历史

- **v3.1**: 项目整理、简化安装、优化稳定性
- **v3.0**: MediaPipe集成、风格转换、基础训练功能
- **v2.0**: OpenCV基础功能

## 系统要求

- Python 3.8+
- Windows/macOS/Linux
- 4GB+ RAM (训练需要8GB+)

---

享受AI视觉处理的便利！🎉 
