Metadata-Version: 2.4
Name: clearspark
Version: 0.0.3
Summary: A curated collection of essential PySpark functions for daily data engineering.
Project-URL: Repository, https://github.com/v-skolder/clearspark
Project-URL: owner, https://github.com/v-skolder
Author-email: Vinicius <vinnyuniverso3@gmail.com>
License: MIT
License-File: LICENSE
Keywords: bucketing,categorization,data-engineering,dataframe,etl,pyspark,spark-utils
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.8
Requires-Dist: pydantic>=2.0
Requires-Dist: pyspark>=3.5.0
Provides-Extra: dev
Requires-Dist: pytest; extra == 'dev'
Requires-Dist: pytest-cov; extra == 'dev'
Description-Content-Type: text/markdown

<img src="https://raw.githubusercontent.com/v-skolder/clearspark/main/docs/assets/images/readme-logo.png"/>

# clearspark

**clearspark** is a lightweight PySpark utility library that makes common data transformation patterns cleaner, faster to write, and easier to read. Stop rewriting the same boilerplate `when/otherwise` chains — clearspark gives you expressive, validated, one-liner functions.

📖 **[Function reference →](https://github.com/v-skolder/clearspark/blob/main/docs/functions.md)**

---

## Installation

```bash
pip install clearspark
```

---

## Importing

```python
import clearspark.functions as cf
```