Metadata-Version: 2.3
Name: mvid
Version: 0.1.0
Summary: Simple video reading and writing
Author: Adam Alcolado
Author-email: Adam Alcolado <adam.alcolado@mtl.ai>
Requires-Dist: av
Requires-Dist: numpy
Requires-Python: >=3.13
Description-Content-Type: text/markdown

# mvid
mvid is a simple library to treat video as a sequence (e.g. as a list) of NumPY arrays.

```python
from mvid import Video

with Video("myvideo.mp4") as video:
    # get the number of frames
    print(len(video))

    # random access
    frame = video[57]
    
    # iterate over all frames in the video
    for frame in video:
        pass
```
It is built on top of PyAV (with minimal to no overhead) and abstracts away seeking and timing logic.

# Requirements
Requires [PyAV](https://pyav.basswood-io.com/docs/stable/) and NumPY.

# How it works
Frame lookup is based on decoding from the nearest preceding keyframe up to the requested index. 
We determine that index using each frame’s timestamp together with the stream’s frame rate. 
This approach works well for videos with consistent timing metadata, but not all files follow those assumptions. 
Some containers use variable frame rates or contain incomplete or inconsistent timestamps. In those cases 
there is no reliable way to infer a stable frame index without first scanning every frame and assigning 
indices explicitly. Rather than performing that preprocessing step, we intentionally crash when encountering 
timing metadata that cannot be interpreted unambiguously.

# Performance
Generally speaking, sequential access is as fast as possible thanks to PyAV. Check `benchmark.py` and compare
with `ffmpeg -i <my_video> -f null -`. The benchmarking script will also try random access and various 
thread parameters so you can see what performance to expect. 

There is overhead from conversion to NumPY arrays. We also provide a more "raw" AVVideo class that 
performs all the bookkeeping without NumPY conversion.

# Related projects
[torchcodec](https://github.com/meta-pytorch/torchcodec) is a more heavy-duty library that returns PyTorch tensors.
It also has index-based access (among other options). It requires managing your installation of ffmpeg.