Metadata-Version: 2.4
Name: audim
Version: 0.0.7
Summary: An animation and video rendering engine for audio-based and voice-based podcast videos.
Author-email: Atanu Sarkar <mratanusarkar@gmail.com>
Maintainer-email: Atanu Sarkar <mratanusarkar@gmail.com>
License: Apache 2.0 License
Project-URL: Homepage, https://github.com/mratanusarkar/audim
Project-URL: Documentation, https://mratanusarkar.github.io/audim
Project-URL: Repository, https://github.com/mratanusarkar/audim
Project-URL: Bug Tracker, https://github.com/mratanusarkar/audim/issues
Project-URL: Changelog, https://mratanusarkar.github.io/audim/devblog/index.md#changelog
Keywords: podcast,animation,video,audio,engine,transcription,subtitle,generation
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: End Users/Desktop
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Multimedia :: Video
Classifier: Topic :: Multimedia :: Sound/Audio
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.10
Description-Content-Type: text/markdown
License-File: LICENSE
License-File: NOTICE
Requires-Dist: pysrt>=1.1.2
Requires-Dist: opencv-python>=4.9.0.80
Requires-Dist: numpy==1.26.4
Requires-Dist: moviepy==2.0.0.dev2
Requires-Dist: Pillow>=10.2.0
Requires-Dist: matplotlib>=3.8.0
Requires-Dist: whisperx==3.3.1
Requires-Dist: torch==2.2.0
Requires-Dist: torchaudio==2.2.0
Requires-Dist: pydub==0.25.1
Provides-Extra: dev
Requires-Dist: pip>=24.2; extra == "dev"
Requires-Dist: uv>=0.4.20; extra == "dev"
Requires-Dist: pytest>=8.3.3; extra == "dev"
Requires-Dist: isort>=5.13.2; extra == "dev"
Requires-Dist: black>=24.10.0; extra == "dev"
Requires-Dist: ruff>=0.6.9; extra == "dev"
Provides-Extra: docs
Requires-Dist: mkdocs>=1.6.1; extra == "docs"
Requires-Dist: mkdocstrings>=0.26.1; extra == "docs"
Requires-Dist: mkdocstrings-python>=1.11.1; extra == "docs"
Requires-Dist: mkdocs-material>=9.5.39; extra == "docs"
Requires-Dist: mkdocs-minify-plugin>=0.8.0; extra == "docs"
Requires-Dist: mkdocs-glightbox>=0.4.0; extra == "docs"
Requires-Dist: mkdocs-jupyter>=0.25.0; extra == "docs"
Requires-Dist: jupyter>=1.1.1; extra == "docs"
Dynamic: license-file

<div align='center'>

# Audim ✨

[![Documentation](https://img.shields.io/badge/docs-mkdocs-4baaaa.svg?logo=materialformkdocs&logoColor=white)](https://mratanusarkar.github.io/audim)
[![PyPI version](https://img.shields.io/pypi/v/audim.svg?color=blue&logo=pypi&logoColor=white)](https://pypi.org/project/audim/)
[![Python versions](https://img.shields.io/pypi/pyversions/audim.svg?color=blue&logo=python&logoColor=white)](https://pypi.org/project/audim/)
[![GitHub Actions Workflow Status](https://img.shields.io/github/actions/workflow/status/mratanusarkar/audim/deploy.yml?logo=githubactions&logoColor=white)](https://github.com/mratanusarkar/audim/actions)
<br>
[![License: Apache 2.0](https://img.shields.io/badge/License-Apache%202.0-orange.svg?logo=apache&logoColor=white)](https://github.com/mratanusarkar/audim/blob/main/LICENSE)
[![Author: Atanu Sarkar](https://img.shields.io/badge/Author-Atanu%20Sarkar-708FCC?logo=github&logoColor=white)](https://github.com/mratanusarkar)
[![Citation](https://img.shields.io/badge/Cite%20this-Repository-green?logo=gitextensions&logoColor=white)](https://github.com/mratanusarkar/audim/blob/main/CITATION.cff)

**Au**dio Po**d**cast An**im**ation Engine

> _An animation and video rendering engine for audio-based and voice-based podcast videos._

|
[Documentation](https://mratanusarkar.github.io/audim) |
[Features](#-features) |
[Getting Started](#-getting-started) |
[Quick Links](#-quick-links)
|

</div>

## 🚀 Demo

<div align='center'>

<!-- for html supported places like PyPI -->
<div style="text-align: center; margin: 20px 0;">
  <video width="100%" controls>
    <source src="https://github.com/user-attachments/assets/634df0ca-77ee-448b-ac35-f4eb3e4261b9" type="video/mp4">
    Your browser does not support the video element.
  </video>
</div>

https://github.com/user-attachments/assets/634df0ca-77ee-448b-ac35-f4eb3e4261b9

*A sample podcast video generated with Audim*

</div>

> [!NOTE]
> 
> For this example,
> we have transformed a conversation between Grant Sanderson (from [3Blue1Brown](https://www.3blue1brown.com/)) and Sal Khan (from [Khan Academy](https://www.khanacademy.org/)) from [YouTube](https://www.youtube.com/watch?v=SAhKohb5e_w&t=1179s) into a visually engaging podcast video using Audim.
> 
> See [docs/devblog/v0.0.7](https://mratanusarkar.github.io/audim/devblog/v0.0.7/) for more details on how this video was generated.

## 🔗 Quick Links

1. Getting Started
    - See [Setup](https://mratanusarkar.github.io/audim/setup/installation/) and ensure you have setup correctly before usage.
    - For developers and contributors, see [Development](https://mratanusarkar.github.io/audim/setup/development/).
2. API Documentation
    - See [API Docs](https://mratanusarkar.github.io/audim/audim/) for the `audim` API documentation.
3. Usage and Examples
    - See [Usage](https://mratanusarkar.github.io/audim/usage/) for usage examples.
4. Dev Blog
    - See [Dev Blog](https://mratanusarkar.github.io/audim/devblog/) for the development blog of the project to gain more insights into the project.
    - See [Changelog](https://mratanusarkar.github.io/audim/devblog/#changelog) for the changelog of the project.

## 🎯 Introduction

Audim is an engine for precise programmatic animation and rendering of podcast videos from audio-based and voice-based file recordings.

## ✨ Features

- 💻 Precise programmatic animations.
- 🎬 Rendering of videos with layout based scenes.
- 📝 Generate subtitles and transcripts from audio/video files.
- 🎤 From subtitle and scene elements to podcast video generation.

## 🚀 Getting Started

### Prerequisites

- 🐍 Python ≥ 3.10
- 🖥️ Conda or venv
- 🎥 FFmpeg (recommended, for faster video encoding)

### Installation

#### 1. Install Audim

It is recommended to install Audim in a virtual environment from PyPI or Conda in a [Python=3.10](https://www.python.org/) environment.

Install `audim` package from PyPI:

```bash
pip install audim
```

<details>

<summary>Install from source</summary>

<br>

By installing `audim` from source, you can explore the latest features and enhancements that have not yet been officially released.
Please note that the latest changes may be still in development and may not be stable and may contain bugs.

#### Install from source

```bash
pip install git+https://github.com/mratanusarkar/audim.git
```

OR, you can also clone the repository and install the package from source:

#### Clone the repository

```bash
git clone https://github.com/mratanusarkar/audim.git
```

</details>

#### 2. Install FFmpeg locally (recommended)

Using local FFmpeg is optional but recommended for speeding up the video encoding process.

On Ubuntu, install FFmpeg using:

```bash
sudo apt install ffmpeg libx264-dev
```

On Windows and other platforms, download and install FFmpeg from the official website:

- [Download FFmpeg](https://ffmpeg.org/download.html)
- Ensure FFmpeg is in your system PATH

<details>

<summary>Virtual environment and project setup for development with uv</summary>

<br>

#### Install `uv` and setup project environment:

> **IMPORTANT**
> 
> If you are using conda base environment as the default base environment for your python projects, run the below command to activate the base environment. If not, skip this step and continue with the next step.
>
> ```bash
> conda activate base
> ```

```bash
# Install uv
pip install uv

# Setup project environment
uv venv

source .venv/bin/activate   # on Linux
# .venv\Scripts\activate    # on Windows

uv pip install -e ".[dev,docs]"
```

#### Build and deploy documentation

You can build and serve the documentation by running:

```bash
uv pip install -e .[docs]
mkdocs serve
```

## Code Quality

Before committing, please ensure that the code is formatted and styled correctly.
Run the following commands to check and fix code style issues:

```bash
# Check and fix code style issues
ruff format .
ruff check --fix .
```

See [Development](https://mratanusarkar.github.io/audim/setup/development/) for more details on how to setup the project environment and contribute to the project.

</details>

## ⚖️ License & Attribution

Audim is licensed under **Apache 2.0**. You can use it freely for personal and commercial projects.

**Attribution is encouraged.** If you use Audim, please:

- Keep the default watermark in videos, OR
- Add "Made with Audim" to video descriptions, OR  
- Link to this repo in your project documentation

> See [NOTICE](./NOTICE) file for complete attribution guidelines.

## 📄 Citation

If you use Audim in your project or research, please cite it as follows:

```bibtex
@software{audim,
  title = {Audim: Audio Podcast Animation Engine},
  author = {Sarkar, Atanu},
  year = {2025},
  url = {https://github.com/mratanusarkar/audim},
  version = {0.0.7}
}
```

You can also click the **"Cite this repository"** button on GitHub for other citation formats.

## ⚠️ Disclaimer

> [!WARNING]
> **Early Development Stage**
> 
> - This project is actively under development and may contain bugs or limitations.
> - While stable for basic use cases, the rendering engine requires further development and testing across diverse scenarios.
> - The API is subject to change, so keep an eye at the documentation for the latest updates.

> [!TIP]
> **We encourage you to:**
> 
> - Try Audim for your projects and podcast videos.
> - [Report issues](https://github.com/mratanusarkar/audim/issues) when encountered.
> - Feel free to raise a PR to contribute and improve the project.

_Your feedback and contributions help make Audim better for everyone!_
