Metadata-Version: 2.4
Name: timefence
Version: 0.9.1
Summary: Temporal correctness layer for ML training data
Project-URL: Homepage, https://timefence.dev
Project-URL: Documentation, https://timefence.dev/documentation
Project-URL: Repository, https://github.com/gauthierpiarrette/timefence
Project-URL: Issues, https://github.com/gauthierpiarrette/timefence/issues
Project-URL: Changelog, https://github.com/gauthierpiarrette/timefence/blob/main/CHANGELOG.md
Author: Gauthier Piarrette
License-Expression: MIT
License-File: LICENSE
Keywords: audit,data-leakage,data-quality,feature-engineering,machine-learning,ml-training,mlops,point-in-time,temporal-correctness,time-series
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development :: Quality Assurance
Classifier: Typing :: Typed
Requires-Python: >=3.9
Requires-Dist: click>=8.0.0
Requires-Dist: duckdb>=1.0.0
Requires-Dist: pandas>=1.5.0
Requires-Dist: pyyaml>=6.0
Requires-Dist: rich>=13.0.0
Provides-Extra: all
Requires-Dist: dbt-core>=1.0.0; extra == 'all'
Requires-Dist: ipython>=7.0.0; extra == 'all'
Provides-Extra: dbt
Requires-Dist: dbt-core>=1.0.0; extra == 'dbt'
Provides-Extra: dev
Requires-Dist: hypothesis>=6.0.0; extra == 'dev'
Requires-Dist: pre-commit>=3.0.0; extra == 'dev'
Requires-Dist: pytest-cov>=4.0.0; extra == 'dev'
Requires-Dist: pytest>=7.0.0; extra == 'dev'
Requires-Dist: ruff>=0.4.0; extra == 'dev'
Requires-Dist: taskipy>=1.12.0; extra == 'dev'
Provides-Extra: notebook
Requires-Dist: ipython>=7.0.0; extra == 'notebook'
Description-Content-Type: text/markdown

<p align="center">
  <img src="docs/logo.png" alt="Timefence" width="80">
</p>

<h1 align="center">Timefence</h1>

<p align="center">
  <strong>Your ML model may be trained on the future. Find out in one command.</strong>
</p>

<p align="center">
  <a href="https://github.com/gauthierpiarrette/timefence/actions/workflows/ci.yml"><img src="https://github.com/gauthierpiarrette/timefence/actions/workflows/ci.yml/badge.svg" alt="CI"></a>
  <a href="https://codecov.io/gh/gauthierpiarrette/timefence"><img src="https://codecov.io/gh/gauthierpiarrette/timefence/graph/badge.svg" alt="codecov"></a>
  <a href="https://pypi.org/project/timefence/"><img src="https://img.shields.io/pypi/v/timefence" alt="PyPI"></a>
  <a href="https://pypi.org/project/timefence/"><img src="https://img.shields.io/pypi/pyversions/timefence" alt="Python"></a>
  <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-green.svg" alt="License: MIT"></a>
</p>

<p align="center">
  <a href="https://timefence.dev">Website</a> &middot;
  <a href="https://timefence.dev/documentation">Docs</a> &middot;
  <a href="CHANGELOG.md">Changelog</a> &middot;
  <a href="CONTRIBUTING.md">Contributing</a>
</p>

---

Timefence finds and fixes temporal data leakage in ML training sets. No infrastructure required — runs locally, reads Parquet/CSV, and finishes in seconds.

If you build training data by joining features to labels, your model may be training on the future. A `LEFT JOIN` or `merge_asof` gives each label the latest feature row — including data from *after* the event you're predicting. The model trains on the future. Offline metrics look great. Production doesn't match. No error, no warning, no way to tell from the output alone.

```bash
pip install timefence
```

## Try It in 60 Seconds

```bash
timefence quickstart churn-example && cd churn-example
timefence audit data/train_LEAKY.parquet
```

```
TEMPORAL AUDIT REPORT
Scanned 5,000 rows

WARNING  LEAKAGE DETECTED in 3 of 4 features

  LEAK  rolling_spend_30d
        1,520 rows (30.4%) use feature data from the future
        Severity: HIGH

  LEAK  days_since_login
        4,909 rows (98.2%) use feature data from the future
        Severity: HIGH

  OK    user_country - clean (5,000 rows)
  OK    account_age_days - clean (5,000 rows)
```

Rebuild it with temporal correctness:

```bash
timefence build --labels data/labels.parquet --features features.py --output train_CLEAN.parquet
```

```
Building training set...

  Labels     5,000 rows from data/labels.parquet
  Features   4 features

  Joining with point-in-time correctness (feature_time < label_time):

  OK  user_country         5,000 / 5,000 matched
  OK  account_age_days     5,000 / 5,000 matched
  OK  rolling_spend_30d    5,000 / 5,000 matched
  OK  days_since_login     5,000 / 5,000 matched

  Written   train_CLEAN.parquet (5,000 rows, 7 cols)
```

Verify:

```bash
timefence audit train_CLEAN.parquet
# ALL CLEAN - no temporal leakage detected
```

## Audit Your Existing Data

You don't need to change your pipeline. Point Timefence at any training set you already have:

```bash
timefence audit your_training_set.parquet --features features.py --keys user_id --label-time label_time
```

If it's clean, you'll know. If it's not, you'll see exactly which features leak, how many rows, and the severity. Takes seconds.

## Python API

Audit any existing dataset — no sources or feature definitions needed:

```python
import timefence

report = timefence.audit("train.parquet", keys=["user_id"], label_time="label_time")
report.assert_clean()  # raises if leakage found
```

Or define sources and features to build a correct dataset from scratch:

```python
users = timefence.Source(path="data/users.parquet", keys=["user_id"], timestamp="updated_at")
txns  = timefence.Source(path="data/txns.parquet", keys=["user_id"], timestamp="created_at")

country = timefence.Feature(source=users, columns=["country"])
spend   = timefence.Feature(source=txns, embargo="1d", name="spend_30d", sql="""
    SELECT user_id, created_at AS feature_time,
           SUM(amount) OVER (PARTITION BY user_id ORDER BY created_at
               RANGE BETWEEN INTERVAL 30 DAY PRECEDING AND CURRENT ROW) AS spend_30d
    FROM {source}
""")

labels = timefence.Labels(
    path="data/labels.parquet", keys=["user_id"],
    label_time="label_time", target=["churned"],
)

result = timefence.build(labels=labels, features=[country, spend], output="train.parquet")
```

## Add to CI

Stop leakage before it reaches production:

```yaml
- run: pip install timefence && timefence audit data/train.parquet --features features.py --strict
```

`--strict` exits with code 1 on leakage. Your pipeline fails before a leaky model ever trains.

## Performance

Built on DuckDB's columnar engine. Median of 3 runs after warmup (Intel i7, 16 GB):

| Scenario | Labels | Features | Build | Audit |
|----------|--------|----------|-------|-------|
| Small project | 100K | 1 | **0.5s** | 0.3s |
| Typical project | 100K | 10 | **1.9s** | 1.7s |
| Large project | 1M | 1 | **3.0s** | 2.0s |
| Large + many features | 1M | 10 | **12s** | 8.5s |

Adding embargo, staleness, and splits costs seconds, not minutes.

<details>
<summary>Run benchmarks yourself</summary>

```bash
uv run python benchmarks/bench.py --quick
uv run python benchmarks/bench.py --quick --include-pandas
```

</details>

## How It Works

Timefence generates SQL (ASOF JOIN or ROW_NUMBER) and runs it in an embedded DuckDB. No server, no JVM, no Spark. It enforces one rule — `feature_time < label_time - embargo` — for every row, every feature, every build. Every query is inspectable via `timefence -v build` or `timefence explain`.

## All Features

| | |
|---|---|
| **Joins** | Point-in-time correct. ASOF JOIN fast path, ROW_NUMBER fallback |
| **Guardrails** | Embargo, max lookback, max staleness — all configurable |
| **Inputs** | Parquet, CSV, SQL query, DataFrame |
| **Feature modes** | Column selection, SQL, Python transform |
| **Splitting** | Time-based train / validation / test splits |
| **Caching** | Feature-level cache with content-hash keys |
| **Audit** | Full rebuild-and-compare or lightweight temporal check |
| **Reports** | Severity classification. JSON manifest, HTML report, Rich terminal |
| **CLI** | `quickstart` `build` `audit` `explain` `diff` `inspect` `catalog` `doctor` |
| **Flags** | `-v` verbose · `--debug` · `--strict` CI gate · `--json` · `--html` |

## What Timefence Is NOT

| Not This | Why | Use Instead |
|----------|-----|-------------|
| Feature store | No server, no online serving | Tecton, Feast |
| Data orchestrator | No scheduling, no DAGs | Airflow, Dagster |
| Data quality framework | Temporal correctness only | Great Expectations |
| ML pipeline framework | Produces training data only | MLflow, Metaflow |

One tool. One job. Temporal correctness for ML training data.

---

<p align="center">
  <a href="https://timefence.dev/documentation">Documentation</a> &middot;
  <a href="CONTRIBUTING.md">Contributing</a> &middot;
  <a href="CHANGELOG.md">Changelog</a>
</p>

<p align="center">MIT License</p>
