Metadata-Version: 2.1
Name: tracebloc_ingestor
Version: 0.1.0
Summary: A flexible data ingestion library for various file formats
Home-page: https://github.com/tracebloc/data-ingestors
Author: Tracebloc
Author-email: support@tracebloc.com
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE

# Data Ingestors
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

## 📄 Description
A robust data ingestion framework for machine learning pipelines. This repository provides tools and utilities for managing, processing, and validating training/test datasets. It supports various data sources, formats, and processing pipelines, making it easier to create and maintain ML datasets.

## 🛠️ Tech Stack
- Python 3.x
- Docker (for containerization)
- Data processing libraries (Pandas, NumPy)

## 🚀 Installation & Usage Instructions
1. Clone the repository
2. Install dependencies:
   ```bash
   pip install -r src/requirements.txt
   ```
3. Configure your environment
4. Follow the documentation guide to [Create Your Training/Test Dataset](https://traceblocdocsdev.azureedge.net/environment-setup/create-your-dataset)

## 📦 Features
- Multi-source data ingestion
- Data validation and preprocessing
- Database integration
- API endpoints for data management
- Containerized deployment
- Kubernetes support


## 📜 License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 📞 Support
For additional support or questions, please refer to our documentation or contact the Tracebloc support team at `support@tracebloc.io`.
