Metadata-Version: 2.4
Name: jorvik
Version: 1.1.1
Summary: A set of utilities for creating and managing ETL Pipelines with pyspark.
Home-page: https://github.com/jorvik-io/jorvik
Author: https://github.com/jorvik-io
License: Apache License 2.0
Keywords: ETL,Pyspark,Data Engineering,Data Pipelines
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: buildtools
Requires-Dist: delta-spark
Requires-Dist: pandas
Requires-Dist: pyarrow
Requires-Dist: pyspark
Provides-Extra: tests
Requires-Dist: pytest; extra == "tests"
Requires-Dist: flake8; extra == "tests"
Requires-Dist: pytest-mock; extra == "tests"
Requires-Dist: numpy<2.0.0; extra == "tests"
Dynamic: author
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: keywords
Dynamic: license
Dynamic: license-file
Dynamic: provides-extra
Dynamic: requires-dist
Dynamic: summary

# Jorvik
[![Build](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml/badge.svg)](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml)
[![Coverage Status](https://coveralls.io/repos/github/jorvik-io/jorvik/badge.svg?branch=main)](https://coveralls.io/github/jorvik-io/jorvik?branch=main)

Jorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.

## Contribute
The Jorvik project welcomes your expertise and enthusiasm!

Writing code isn’t the only way to contribute. You can also:

- review pull requests
- suggest improvements through issues
- let us know your painpoints and repeatitive tasks
- help us stay on top of new and old issues
- develop tutorials, videos, presentations, and other educational materials

See [How to Contribute](https://github.com/jorvik-io/jorvik/blob/main/CONTRIBUTING.md) for instructions on setting up your local machine and opening your first Pull Request.

## Getting Started.
Jorvik is available in Pypi and can be installed with pip

```bash
pip install jorvik
```

Packages:
- [Storage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/storage/README.md): Interact with the storage layer
- [Pipelines](https://github.com/jorvik-io/jorvik/blob/main/jorvik/pipelines/README.md): Build and test etl pipelines with ease
- [Data Lineage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/data_lineage/README.md): Track data lineage
