Metadata-Version: 2.1
Name: dataops-testgen
Version: 5.33.3
Summary: DataKitchen's Data Quality DataOps TestGen
Author-email: "DataKitchen, Inc." <info@datakitchen.io>
Maintainer-email: "DataKitchen, Inc." <info@datakitchen.io>
Project-URL: Source Code, https://github.com/DataKitchen/dataops-testgen
Project-URL: Bug Tracker, https://github.com/DataKitchen/dataops-testgen/issues
Project-URL: Documentation, https://docs.datakitchen.io/testgen/what-is-testgen/
Project-URL: Release Notes, https://docs.datakitchen.io/testgen/release-notes/
Project-URL: Slack, https://data-observability-slack.datakitchen.io/join
Project-URL: Homepage, https://example.com
Keywords: dataops,data,quality,testing,database,profiling
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Development Status :: 5 - Production/Stable
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: System :: Monitoring
Requires-Python: >=3.11
Description-Content-Type: text/markdown
License-File: LICENSE
License-File: NOTICE
Requires-Dist: PyYAML ==6.0.3
Requires-Dist: click ==8.3.1
Requires-Dist: sqlalchemy ==2.0.48
Requires-Dist: databricks-sql-connector ==4.2.5
Requires-Dist: databricks-sqlalchemy ==2.0.9
Requires-Dist: databricks-sdk >=0.20.0
Requires-Dist: snowflake-sqlalchemy ==1.9.0
Requires-Dist: sqlalchemy-bigquery ==1.16.0
Requires-Dist: oracledb ==3.4.0
Requires-Dist: hdbcli ==2.25.31
Requires-Dist: sqlalchemy-hana ==4.4.0
Requires-Dist: pyodbc ==5.2.0
Requires-Dist: psycopg2-binary ==2.9.11
Requires-Dist: pycryptodome ==3.21
Requires-Dist: prettytable ==3.7.0
Requires-Dist: requests-extensions ==1.1.3
Requires-Dist: numpy ==2.1.3
Requires-Dist: pandas ==2.2.3
Requires-Dist: streamlit ==1.55.0
Requires-Dist: streamlit-extras ==0.3.0
Requires-Dist: streamlit-aggrid ==0.3.4.post3
Requires-Dist: plotly-express ==0.4.1
Requires-Dist: streamlit-authenticator ==0.2.3
Requires-Dist: streamlit-javascript ==0.1.5
Requires-Dist: progress ==1.6
Requires-Dist: beautifulsoup4 ==4.12.3
Requires-Dist: trino ==0.327.0
Requires-Dist: xlsxwriter ==3.2.0
Requires-Dist: psutil ==5.9.8
Requires-Dist: concurrent-log-handler ==0.9.25
Requires-Dist: cryptography ==46.0.6
Requires-Dist: validators ==0.33.0
Requires-Dist: reportlab ==4.2.2
Requires-Dist: cron-converter ==1.2.1
Requires-Dist: cron-descriptor ==2.0.5
Requires-Dist: pybars3 ==0.9.7
Requires-Dist: azure-identity ==1.25.1
Requires-Dist: statsmodels ==0.14.6
Requires-Dist: holidays ~=0.89
Requires-Dist: pyarrow ==21.0.0
Requires-Dist: matplotlib ==3.9.2
Requires-Dist: scipy ==1.14.1
Requires-Dist: jinja2 ==3.1.6
Requires-Dist: pillow ==12.2.0
Requires-Dist: protobuf ==6.33.5
Requires-Dist: mcp[cli] ==1.26.0
Requires-Dist: uvicorn ==0.41.0
Requires-Dist: PyJWT ==2.12.0
Requires-Dist: bcrypt ==5.0.0
Requires-Dist: authlib ~=1.6.6
Requires-Dist: fastapi ==0.135.1
Provides-Extra: dev
Requires-Dist: invoke ==2.2.0 ; extra == 'dev'
Requires-Dist: ruff ==0.4.1 ; extra == 'dev'
Requires-Dist: mypy ==1.9.0 ; extra == 'dev'
Requires-Dist: pre-commit ==3.6.2 ; extra == 'dev'
Requires-Dist: types-PyYAML ==6.0.12.20240311 ; extra == 'dev'
Requires-Dist: types-setuptools ==69.2.0.20240317 ; extra == 'dev'
Requires-Dist: semver ==3.0.2 ; extra == 'dev'
Requires-Dist: pytest ==8.1.1 ; extra == 'dev'
Requires-Dist: pytest-cov ==4.1.0 ; extra == 'dev'
Requires-Dist: tomli ==2.0.1 ; extra == 'dev'
Provides-Extra: release
Requires-Dist: build ==1.2.1 ; extra == 'release'
Requires-Dist: bumpver ==2023.1129 ; extra == 'release'
Requires-Dist: twine ==5.1.1 ; extra == 'release'
Provides-Extra: standalone
Requires-Dist: pixeltable-pgserver >=0.5.1 ; extra == 'standalone'

# DataOps Data Quality TestGen
![apache 2.0 license Badge](https://img.shields.io/badge/License%20-%20Apache%202.0%20-%20blue) ![PRs Badge](https://img.shields.io/badge/PRs%20-%20Welcome%20-%20green) [![Latest Version](https://img.shields.io/badge/dynamic/json?url=https%3A%2F%2Fhub.docker.com%2Fv2%2Frepositories%2Fdatakitchen%2Fdataops-testgen%2Ftags%2F&query=results%5B0%5D.name&label=latest%20version&color=06A04A)](https://hub.docker.com/r/datakitchen/dataops-testgen) [![Docker Pulls](https://img.shields.io/badge/dynamic/json?url=https%3A%2F%2Fhub.docker.com%2Fv2%2Frepositories%2Fdatakitchen%2Fdataops-testgen%2F&query=pull_count&style=flat&label=docker%20pulls&color=06A04A)](https://hub.docker.com/r/datakitchen/dataops-testgen) [![Documentation](https://img.shields.io/badge/docs-On%20datakitchen.io-06A04A?style=flat)](https://docs.datakitchen.io/testgen/what-is-testgen/) [![Static Badge](https://img.shields.io/badge/Slack-Join%20Discussion-blue?style=flat&logo=slack)](https://data-observability-slack.datakitchen.io/join)

*<p style="text-align: center;">DataOps Data Quality TestGen, or "TestGen" for short, can help you find data issues so you can alert your users and notify your suppliers. It does this by delivering simple, fast data quality test generation and execution by data profiling, new dataset screening and hygiene review, algorithmic generation of data quality validation tests, ongoing production testing of new data refreshes, and continuous anomaly monitoring of datasets. TestGen is part of DataKitchen's Open Source Data Observability.</p>*

## Documentation

[DataOps TestGen Overview](https://datakitchen.io/dataops-testgen-product/)

[DataOps TestGen Documentation](https://docs.datakitchen.io/testgen/what-is-testgen/)


## Features

[Interactive Product Tour](https://datakitchen.storylane.io/share/byag8vimd5tn)

What does DataKitchen's DataOps Data Quality TestGen do? It helps you understand and <b>find data issues in new data</b>.
<p align="center">
<img alt="DatKitchen Open Source Data Quality TestGen Features - New Data" src="https://datakitchen.io/wp-content/uploads/2024/07/Screenshot-2024-07-23-at-2.22.57 PM.png" width="70%">
</p>
It constantly <b>watches your data for data quality anomalies</b> and lets you drill into problems.
<br></br>
<p align="center">
<img alt="DataKitchen Open Source Data Quality TestGen Features - Data Ingestion and Quality Testing" src="https://datakitchen.io/wp-content/uploads/2024/07/Screenshot-2024-07-23-at-2.23.07 PM.png" width="70%">
</p>
A <b>single place to manage Data Quality</b> across data sets, locations, and teams.
<br></br>
<p align="center">
<img alt="DataKitchen Open Source Data Quality TestGen Features - Single Place" src="https://datakitchen.io/wp-content/uploads/2024/07/Screenshot-dataops-testgen-centralize.png" width="70%">
</p>

## Installation

The [dk-installer](https://github.com/DataKitchen/data-observability-installer/?tab=readme-ov-file#install-the-testgen-application) program installs TestGen in either Docker or pip mode. For complete instructions, see the documentation:

* [Install on Mac/Linux](https://docs.datakitchen.io/testgen/get-started/install-on-mac-linux/)
* [Install on Windows](https://docs.datakitchen.io/testgen/get-started/install-on-windows/)

## What Next?

### Getting started guide
We recommend you start by going through the [Data Observability Overview Demo](https://docs.datakitchen.io/tutorials/quickstart-demo/).

### Support
For support requests, [join the Data Observability Slack](https://data-observability-slack.datakitchen.io/join) 👋 and post on the `#support` channel.

### Connect to your database
Follow [these instructions](https://docs.datakitchen.io/testgen/connect-your-database/) to improve the quality of data in your database.

### Community
Talk and learn with other data practitioners who are building with DataKitchen. Share knowledge, get help, and contribute to our open-source project.

Join our community here:

* 👋 [Join us on Slack](https://data-observability-slack.datakitchen.io/join), this is also how you get support (see above)

* 🌟 [Star us on GitHub](https://github.com/DataKitchen/data-observability-installer)

* 🐦 [Follow us on Twitter](https://twitter.com/i/flow/login?redirect_after_login=%2Fdatakitchen_io)

* 🕴️ [Follow us on LinkedIn](https://www.linkedin.com/company/datakitchen)

* 📺 [Get Free DataOps Fundamentals Certification](https://info.datakitchen.io/training-certification-dataops-fundamentals)

* 📚 [Read our blog posts](https://datakitchen.io/blog/)

* 🗃 [Sign The DataOps Manifesto](https://DataOpsManifesto.org)

* 🗃 [Sign The Data Journey Manifesto](https://DataJourneyManifesto.org)


### Contributing
For details on contributing or running the project for development, check out our [contributing guide](CONTRIBUTING.md).

### License
DataKitchen's DataOps Data Quality TestGen is Apache 2.0 licensed.
