Metadata-Version: 2.4
Name: ionbus-parquet-cache
Version: 1.4.0.0
Summary: Versioned Parquet dataset management with snapshots and multi-cache support
Home-page: https://github.com/ionbus/ionbus_parquet_cache
Author-email: Charles Lewis <dev@ionbus.info>
License-Expression: MIT
Project-URL: Homepage, https://github.com/ionbus/ionbus_parquet_cache
Project-URL: Repository, https://github.com/ionbus/ionbus_parquet_cache
Project-URL: PyPI, https://pypi.org/project/ionbus-parquet-cache/
Classifier: Programming Language :: Python :: 3
Classifier: Operating System :: OS Independent
Requires-Python: >=3.9
Description-Content-Type: text/markdown
Requires-Dist: duckdb>=1.0
Requires-Dist: ionbus-utils>=1.3
Requires-Dist: pandas>=1.3
Requires-Dist: polars
Requires-Dist: pyarrow>=16.0.1
Requires-Dist: pydantic>=2.7.2
Requires-Dist: pyyaml>=6.0.1
Requires-Dist: typing_extensions>=4.7.1
Requires-Dist: backports.strenum>=0.4.1; python_version < "3.11"
Requires-Dist: eval-type-backport; python_version < "3.10"
Dynamic: home-page

# ionbus_parquet_cache

Python tools for managing versioned Parquet datasets with date partitioning,
snapshot versioning, multi-cache lookup, YAML-driven dataset creation, and
CLI workflows for update, cleanup, and synchronization.

## Installation

```bash
pip install ionbus-parquet-cache
```

Or install from source:

```bash
pip install -e .
```

## Includes

- `CacheRegistry` for reading from one or more cache locations
- `DatedParquetDataset` for date-partitioned, incrementally updated datasets
- `NonDatedParquetDataset` for full-refresh reference datasets
- `DataSource` and `DataCleaner` extension points
- YAML configuration helpers for declarative dataset setup
- Snapshot lineage, cache history, YAML annotations, and optional external
  provenance sidecars
- CLI modules for dataset creation, updating, cleanup, cache sync, and
  post-sync hooks

Full [documentation on GitHub](https://github.com/ionbus/ionbus_parquet_cache).

## Requirements

- Python >= 3.9
- See `requirements.txt` for runtime dependencies

## License

MIT License
