Metadata-Version: 2.4
Name: orionvis
Version: 1.1.2
Summary: A Python package for computer vision tasks with integrated CUDA extensions.
Home-page: https://github.com/bluemoon-o2/orionvis
Author: bluemoon-o2
Author-email: bluemoon-o2 <2095774200@shu.edu.cn>
Project-URL: Homepage, https://github.com/bluemoon-o2/orionvis
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Requires-Python: >=3, <4
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: torch>=2.0.0
Requires-Dist: torchvision>=0.20.0
Requires-Dist: packaging>=21.0
Requires-Dist: ninja>=1.10.0
Requires-Dist: einops>=0.6.0
Requires-Dist: PyYAML>=6.0
Requires-Dist: lazy_object_proxy>=1.12.0
Requires-Dist: gdown>=5.0.0
Requires-Dist: tqdm>=4.0.0
Dynamic: author
Dynamic: home-page
Dynamic: license-file
Dynamic: requires-python

<div align="center">
<img src="./docs/OrionVis_Minimal_Logo.png" alt="OrionVis" width="40%">

# OrionVis

A PyTorch‑friendly Computer Vision toolkit for unified backbone loading, feature extraction, and registered preprocessing.

[![Python](https://img.shields.io/badge/python-3.9%2B-blue.svg)](https://www.python.org/)
[![License](https://img.shields.io/badge/license-Apache--2.0-green.svg)](https://www.apache.org/licenses/LICENSE-2.0)
[![Status](https://img.shields.io/badge/status-alpha-orange.svg)](#)
[![Issues](https://img.shields.io/github/issues/bluemoon-o2/orionvis?label=issues&logo=github)](https://github.com/bluemoon-o2/orionvis/issues)
</div>

## Overview
OrionVis streamlines common CV workflows in PyTorch: load backbones with a single call, manage pretrained weights via YAML with lazy indexing, and apply registered transforms for classification inference. It consolidates model, weight, and preprocessing logic while staying lightweight and production‑ready.

## Features
- Unified backbone loader: `resnet*`, `dino*`, `mobilemamba_*`.
- YAML‑based weights with cached index and lazy loading.
- Transform registry (e.g., `ImageClassification`) for ready‑to‑use preprocessing.
- Feature extraction helpers and utilities.

<img src="./docs/OrionVis_Banner.png" alt="OrionVis_Banner">

## Installation

**Prerequisites**:
- NVIDIA GPU with CUDA (tested on CUDA 11.8+)
- PyTorch >= 2.0
- Microsoft Visual C++ Build Tools (for Windows)

### Install from Pre-built Wheels (Recommended for Users)

If you do not want to compile from source, you can install the pre-built wheel package directly:

```bash
pip install orionvis
```

### Development Installation (Recommended for Developers)

For Windows users, we provide a `build.bat` script that handles environment setup, cleaning, and editable installation:

```bash
git clone https://github.com/bluemoon-o2/orionvis
cd orionvis
# install in editable mode
build.bat
```


## CUDA Extensions

OrionVis comes with integrated high-performance CUDA extensions to accelerate specific operations. These extensions are automatically compiled during installation.

### 1. Selective Scan (Mamba)
Optimized CUDA implementation of the Selective Scan mechanism used in Mamba architectures (`mamba_mobile`, etc.). It provides significant speedups compared to the pure PyTorch implementation.

- **Backend**: `orionvis.extentions.selective_scan`
- **Usage**: Automatically used by `mobilemamba` backbones when available.

### 2. InPlace Activated Batch Normalization (InPlace-ABN)
A memory-efficient implementation of Batch Normalization combined with activation functions. It performs the activation in-place, saving GPU memory during training.

- **Backend**: `orionvis.extentions.inplace_abn`
- **Usage**:
```python
from orionvis.extentions.inplace_abn import InPlaceABN

# Use just like standard BatchNorm2d but with activation included
layer = InPlaceABN(num_features=64, activation="leaky_relu", activation_param=0.01)
```

## Quickstart
```python
import orionvis

# Load a backbone with pretrained weights
model = orionvis.load("mobilemamba_b1s")

# Retrieve weight metadata and preprocessing transform
weight = orionvis.get_weights("mobilemamba_b1s")
transform = orionvis.get_transform("mobilemamba_b1s")

# Example: preprocess an input PIL image
# img = transform(img)
# logits = model(img.unsqueeze(0))
```

## OrionVis Hub

OrionVis Hub allows you to load models and entrypoints directly from GitHub repositories or local directories.

### Load Model

Load a model or entrypoint from a GitHub repository:

```python
import orionvis.hub as hub

# Load from GitHub
model = hub.load("pytorch/vision:v0.10.0", "resnet18")

# Load from a local directory
# model = hub.load("/path/to/local/repo", "custom_model", source="local")
```

### List Entrypoints

List all available entrypoints in a repository:

```python
import orionvis.hub as hub

entrypoints = hub.entrypoints("pytorch/vision:v0.10.0")
print(entrypoints)
```

### View Documentation

View the docstring of a specific entrypoint:

```python
import orionvis.hub as hub

doc = hub.docs("pytorch/vision:v0.10.0", "resnet18")
print(doc)
```

### Load State Dict

Download and load a state dictionary from a URL:

```python
import orionvis.hub as hub

state_dict = hub.load_state_dict_from_url(
    "https://download.pytorch.org/models/resnet18-5c106cde.pth",
    check_hash=True
)
```

## Models
- Programmatic list: `orionvis.list_models()`
- Included backbones: `resnext*`, `wide_resnet*`, `dinov2_*`, `dinov3_*`, `mobilemamba_*`.

## Weights
- Config path: `orionvis/configs/weights/mobilemamba.yaml` (multi‑model YAML; one class per document).
- Download path cache: `~/.cache/orionvis` (configurable via `ORIONVIS_CACHE`).

## License
[Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0)

## Citation
```
@article{mobilemamba,
  title={MobileMamba: Lightweight Multi-Receptive Visual Mamba Network},
  author={Haoyang He and Jiangning Zhang and Yuxuan Cai and Hongxu Chen and Xiaobin Hu and Zhenye Gan and Yabiao Wang and Chengjie Wang and Yunsheng Wu and Lei Xie},
  journal={arXiv preprint arXiv:2411.15941},
  year={2024}
}
```

## References
- [torchvision](https://github.com/pytorch/vision)
- [DINOv2](https://github.com/facebookresearch/dinov2)
- [DINOv3](https://github.com/facebookresearch/dinov3)
- [MobileMamba](https://github.com/lewandofskee/MobileMamba)
