Metadata-Version: 2.4
Name: vision_data_curation
Version: 0.0.1.dev1
Summary: A pipeline for curating and sanitizing large-scale image datasets.
Author: Ofer Hasson
License-Expression: Apache-2.0
Project-URL: Homepage, https://gitlab.com/birder/vision-data-curation
Project-URL: Issues, https://gitlab.com/birder/vision-data-curation/-/issues
Keywords: image-processing,data-curation,computer-vision,pytorch,deep-learning
Classifier: Development Status :: 2 - Pre-Alpha
Classifier: Intended Audience :: Science/Research
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.11
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: birder
Requires-Dist: matplotlib>=3.9.0
Requires-Dist: numpy>=2.2.0
Requires-Dist: Pillow>=11.0.0
Requires-Dist: polars>=1.31.0
Requires-Dist: pt-kmeans
Requires-Dist: tqdm>=4.67.0
Requires-Dist: webdataset>=0.2.111
Dynamic: license-file

# Vision Data Curation

This repository is dedicated to building a robust and efficient pipeline for curating large-scale image datasets,
preparing them for various machine learning and computer vision tasks.

## License

This project is licensed under the Apache-2.0 License - see the [LICENSE](https://gitlab.com/birder/vision-data-curation/blob/main/LICENSE) file for details.
