Metadata-Version: 2.4
Name: jellyjoin
Version: 0.1.2
Summary: Utilities for dataset similarity and joins
Author-email: Oran Looney <olooney@gmail.com>
License: MIT
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE.txt
Requires-Dist: jellyfish>=1.2.0
Requires-Dist: numpy>=1.26.4
Requires-Dist: pandas>=2.2.2
Requires-Dist: scipy>=1.13.0
Provides-Extra: dev
Requires-Dist: python-dotenv; extra == "dev"
Requires-Dist: openai; extra == "dev"
Requires-Dist: build; extra == "dev"
Requires-Dist: twine; extra == "dev"
Requires-Dist: black; extra == "dev"
Requires-Dist: pytest; extra == "dev"
Requires-Dist: matplotlib>=3.8.4; extra == "dev"
Dynamic: license-file

# Jellyjoin Python Package

"Jellyjoin: the softest of joins."

Join dataframes or lists based on semantic similarity.

Author: Oran Looney
License: MIT
Year: 2025

---

## Installation

```bash
pip install jellyjoin
```

---

## Usage

```python
import jellyjoin

jelly_df = jellyjoin.jellyjoin(
    left_df,
    right_df,
    left_column="Column Name", 
    right_column="Other Column Name", 
    threshold=0.7,
    similarity_strategy=jellyjoin.PairwiseSimilarity(),
)

print(jelly_df)
```


---

## Development

To set up a development environment:

```bash
git clone https://github.com/<your-username>/jellyjoin.git
cd jellyjoin
pip install -e .[dev]
```

Run tests:

```bash
pytest
```
