Metadata-Version: 2.4
Name: pandas-dupcol
Version: 0.1.2
Summary: Find and remove duplicate columns in pandas DataFrames
Author: Sushil Poudel Chhetri
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas
Dynamic: license-file

# pandas-dupcol

A lightweight Python utility for finding and removing duplicate columns in pandas DataFrames.

## Installation

```bash
pip install pandas-dupcol
```

## Features

- Detect duplicate columns in pandas DataFrames
- Remove duplicate columns efficiently
- Uses hash-based optimization with equality verification

## Usage

```python
import pandas as pd
import pandas_dupcol as pdc

df = pd.DataFrame({
    "A": [1, 2, 3],
    "B": [4, 5, 6],
    "C": [1, 2, 3]
})

duplicates = pdc.find_duplicate_columns(df)

print(duplicates)
```

Output:

```python
['C']
```

## Remove Duplicate Columns

```python
cleaned_df = pdc.drop_duplicate_columns(df)

print(cleaned_df)
```

Output:

```python
   A  B
0  1  4
1  2  5
2  3  6
```

## Author

Sushil Poudel Chhetri
