Metadata-Version: 2.2
Name: prepup_linux
Version: 0.2.1
Summary: Prepup is a free, open-source package for data preprocessing in terminal
Home-page: https://github.com/sudhanshumukherjeexx/prepup-linux
Author: Sudhanshu Mukherjee
Author-email: sudhanshumukherjeexx@gmail.com
Project-URL: Bug Reports, https://github.com/sudhanshumukherjeexx/prepup-linux/issues
Project-URL: Source, https://github.com/sudhanshumukherjeexx/prepup-linux
Keywords: data-preprocessing machine-learning data-science
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.7
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: blessed==1.19.1
Requires-Dist: imbalanced_learn==0.10.1
Requires-Dist: imblearn==0.0
Requires-Dist: joblib==1.2.0
Requires-Dist: matplotlib==3.10.1
Requires-Dist: numpy==2.2.3
Requires-Dist: pandas==2.2.3
Requires-Dist: plotext==5.2.8
Requires-Dist: pydantic==1.10.4
Requires-Dist: pyfiglet==0.8.post1
Requires-Dist: pytest==7.3.1
Requires-Dist: scikit_learn==1.6.1
Requires-Dist: scipy==1.15.2
Requires-Dist: termcolor==2.3.0
Requires-Dist: pyarrow==19.0.1
Requires-Dist: seaborn==0.11.2
Requires-Dist: tqdm==4.66.1
Requires-Dist: lightgbm==4.6.0
Requires-Dist: xgboost==2.1.4
Provides-Extra: dev
Requires-Dist: pytest; extra == "dev"
Requires-Dist: twine; extra == "dev"
Requires-Dist: wheel; extra == "dev"
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: keywords
Dynamic: project-url
Dynamic: provides-extra
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# 💻 Prepup: Interactive Data Preprocessing Toolkit

![Static Badge](https://img.shields.io/badge/Built_with_%E2%99%A5%EF%B8%8F-Sudhanshu_Mukherjee-black?link=https%3A%2F%2Fwww.linkedin.com%2Fin%2Fsudhanshumukherjeexx%2F)

![Python Versions](https://img.shields.io/badge/python-3.7+-blue.svg)
[![image](https://img.shields.io/pypi/v/prepup-linux.svg)](https://pypi.python.org/pypi/prepup-linux)
![Static Badge](https://img.shields.io/badge/Linux-Supported-green)
![Static Badge](https://img.shields.io/badge/macOS-Supported-blue)
![Static Badge](https://img.shields.io/badge/Ubuntu-Supported-red)
![License](https://img.shields.io/badge/license-MIT-green.svg)



## 🚀 Quick Overview

Prepup is a powerful, user-friendly data preprocessing tool designed to simplify and streamline your data analysis workflow directly from the terminal. Whether you're a data scientist, analyst, or researcher, Prepup provides an intuitive interface for exploring, cleaning, and preparing your datasets.

## ✨ Features

### Interactive Mode
- 📊 Load datasets from various formats (CSV, Excel, Parquet)
- 🔍 Comprehensive data inspection
- 📈 Advanced data exploration
- 🧹 Missing value handling
- 📊 Feature visualization
- 🤖 Automatic Machine Learning (AutoML) model selection

### Key Functionalities
- Data Loading
- Feature Inspection
- Correlation Analysis
- Distribution Checking
- Outlier Detection
- Missing Value Imputation
- Feature Standardization
- Automatic Model Training

## 🛠 Installation

### Using pip
```bash
pip install prepup-linux
```

### From Source
```bash
git clone https://github.com/sudhanshumukherjeexx/prepup-linux.git
cd prepup-linux
pip install .
```

## 💻 Usage

### Interactive Mode
```bash
prepup
```

### Loading a Specific Dataset
```bash
prepup path/to/your/dataset.csv
```

### Main Menu Options
1. Load Dataset
2. Inspect Data
3. Explore Data
4. Visualize Data
5. Impute Missing Values
6. Standardize Features
7. Export Data
8. AutoML (Train & Evaluate Models)

## 🎮 Interactive Workflow Example

1. **Launch Prepup** ```prepup```

2. **Load Your Dataset:** Choose option 1 and enter your dataset path

3. **Inspect Data:** Use option 2 to explore features, data types, and missing values

4. **Preprocess:** Impute missing values | Standardize features

5. **Analyze:** Visualize data distributions | Perform correlation analysis | Run AutoML for model selection

## 🤖 AutoML Capabilities
- Supports both Classification and Regression tasks
- Evaluates multiple machine learning algorithms
- Provides performance metrics
- Saves results to CSV

## 📦 Dependencies
- NumPy
- Pandas
- Scikit-learn
- Matplotlib
- and more (see requirements.txt)

## 🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request.

1. Fork the repository
2. Create your feature branch (`git checkout -b feature/AmazingFeature`)
3. Commit your changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request

## 📋 License
Distributed under the MIT License. See `LICENSE` for more information.


# Package Link

- Github: https://github.com/sudhanshumukherjeexx/prepup-linux
- Documentation: https://sudhanshumukherjeexx.github.io/prepup-linux



