Metadata-Version: 2.4
Name: project-lighthouse-anonymize
Version: 1.0.0
Summary: Privacy-preserving data anonymization using k-anonymity and related algorithms
Project-URL: Homepage, https://github.com/airbnb/project-lighthouse-anonymize
Project-URL: Documentation, https://project-lighthouse-anonymize.readthedocs.io
Project-URL: Repository, https://github.com/airbnb/project-lighthouse-anonymize
Project-URL: Bug Tracker, https://github.com/airbnb/project-lighthouse-anonymize/issues
Author-email: Adam Bloomston <adam.bloomston@airbnb.com>
License: MIT
License-File: LICENSE
Keywords: anonymization,data-privacy,differential-privacy,k-anonymity,privacy
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Security
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: <3.13,>=3.9
Requires-Dist: fastparquet==2024.11.0
Requires-Dist: first<3.0.0,>=2.0.0
Requires-Dist: numba<1.0.0,>=0.56.0
Requires-Dist: numpy<2.0.0,>=1.20.0
Requires-Dist: pandas<3.0.0,>=1.3.0
Requires-Dist: scikit-learn<2.0.0,>=1.0.0
Requires-Dist: scipy<2.0.0,>=1.7.0
Requires-Dist: treelib<2.0.0,>=1.6.0
Requires-Dist: ucimlrepo==0.0.7
Provides-Extra: dev
Requires-Dist: pyright>=1.1.0; extra == 'dev'
Requires-Dist: pytest-cov>=4.0.0; extra == 'dev'
Requires-Dist: pytest-profiling>=1.0.0; extra == 'dev'
Requires-Dist: pytest-randomly>=3.0.0; extra == 'dev'
Requires-Dist: pytest-xdist>=3.0.0; extra == 'dev'
Requires-Dist: pytest>=8.0.0; extra == 'dev'
Requires-Dist: ruff>=0.1.0; extra == 'dev'
Provides-Extra: docs
Requires-Dist: myst-parser>=2.0.0; extra == 'docs'
Requires-Dist: sphinx-autodoc-typehints; extra == 'docs'
Requires-Dist: sphinx-rtd-theme>=2.0.0; extra == 'docs'
Requires-Dist: sphinx>=7.0.0; extra == 'docs'
Description-Content-Type: text/markdown

# Project Lighthouse Anonymize

[![PyPI version](https://badge.fury.io/py/project-lighthouse-anonymize.svg)](https://pypi.org/project/project-lighthouse-anonymize/)
[![Python versions](https://img.shields.io/pypi/pyversions/project-lighthouse-anonymize.svg)](https://pypi.org/project/project-lighthouse-anonymize/)
[![Documentation Status](https://readthedocs.org/projects/project-lighthouse-anonymize/badge/?version=latest)](https://project-lighthouse-anonymize.readthedocs.io/en/latest/)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

Privacy-preserving data anonymization using k-anonymity and related algorithms.

## Installation

```bash
pip install project-lighthouse-anonymize
```

## Documentation

Full documentation: https://project-lighthouse-anonymize.readthedocs.io

- [Getting Started Guide](docs/getting_started.md)
- [API Reference](https://project-lighthouse-anonymize.readthedocs.io/en/latest/api/)

## Publications

This work builds on research into privacy-preserving data analysis:

- [Measuring Discrepancies in Airbnb Guest Acceptance Rates Using Anonymized Demographic Data](https://arxiv.org/abs/2204.12001) - The foundational paper for Project Lighthouse
- [Core Mondrian: Scalable Mondrian for Partition-Based Anonymization](https://arxiv.org/abs/2510.09661) - Covers the anonymization algorithm
- [Measuring data quality for Project Lighthouse](https://arxiv.org/abs/2510.06121) - Covers the way we measure the impact of anonymization

## Contributing

We welcome contributions! Please see [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.

## Releasing

To create a new release, use the `/release` command in Claude Code. This will:
1. Analyze changes since the last release
2. Propose an appropriate version bump following semantic versioning
3. Update CHANGELOG.md
4. Create and push a git tag
5. Trigger automated PyPI publishing via GitHub Actions

## License

This project is licensed under the MIT License - see [LICENSE](LICENSE) file for details.

## Acknowledgments

Developed by the Airbnb Anti-Discrimination & Equity team.

## Citation

If you use this software in your research, please cite:

```
Bloomston, A., & Airbnb Anti-Discrimination & Equity Engineering Team. (2026).
Project Lighthouse Anonymize. https://github.com/airbnb/project-lighthouse-anonymize
```

BibTeX:
```bibtex
@software{bloomston2025lighthouse,
  author = {Bloomston, Adam and {Airbnb Anti-Discrimination \& Equity Engineering Team}},
  title = {Project Lighthouse Anonymize},
  year = {2026},
  url = {https://github.com/airbnb/project-lighthouse-anonymize},
  license = {MIT}
}
```
