Metadata-Version: 2.4
Name: datablob
Version: 0.3.0
Summary: Client for Updating a Simple Data Warehouse on Blob Storage
Home-page: https://github.com/gocarta/datablob
Download-URL: https://github.com/gocarta/datablob/tarball/download
Author: Daniel J. Dufour
Author-email: daniel.j.dufour@gmail.com
Keywords: data,python
Classifier: Development Status :: 4 - Beta
Classifier: Programming Language :: Python :: 3
Classifier: License :: CC0 1.0 Universal (CC0 1.0) Public Domain Dedication
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: boto3
Requires-Dist: geopandas
Requires-Dist: pandas
Requires-Dist: pyarrow
Requires-Dist: shapely
Requires-Dist: tzdata
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: download-url
Dynamic: home-page
Dynamic: keywords
Dynamic: license-file
Dynamic: requires-dist
Dynamic: summary

# datablob
Client for Updating a Simple Data Warehouse on Blob Storage

## design philosophy
- optimize for simplicity and user friendliness
- storage is cheap (compared to compute)
- pre-compute as much as possible
- should work out of the box
- advanced configuration should be opt-in
- explicit is better than implicit
- straightforwardness over magic

## install
```sh
pip install datablob
```

## supported formats
- csv
- [geojson points](https://geojson.org/)
- json
- [json lines](https://jsonlines.org/)
- [parquet](https://parquet.apache.org/), including [geoparquet](https://geoparquet.org/)

## usage
More examples coming soon
```py
from datablob import DataBlobClient

client = DataBlobClient(bucket_name="example-test-bucket-123", bucket_path="prefix/to/dataportal")

client.update_dataset(name="fleet", version="2", data=rows)
# automatically creates the following files
# s3://example-test-bucket-123/prefix/to/dataportal/fleet/v2/meta.json
# s3://example-test-bucket-123/prefix/to/dataportal/fleet/v2/data.csv
# s3://example-test-bucket-123/prefix/to/dataportal/fleet/v2/data.json
# s3://example-test-bucket-123/prefix/to/dataportal/fleet/v2/data.jsonl
# s3://example-test-bucket-123/prefix/to/dataportal/fleet/v2/data.parquet
```
