Metadata-Version: 2.4
Name: banglish-stopwords
Version: 0.1.0
Summary: A high-performance library to filter Banglish stopwords from text.
Home-page: https://github.com/b-a-sabbir/banglish-stopwords.git
Author: Benjir Ahammed Sabbir
Author-email: bengirahammedsabbir123@gmail.com
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Topic :: Text Processing :: Linguistic
Classifier: Intended Audience :: Developers
Requires-Python: >=3.6
Description-Content-Type: text/markdown
License-File: LICENSE.txt
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: license-file
Dynamic: requires-python
Dynamic: summary

# Banglish Stopwords

A high-performance, lightweight Python library to filter Banglish (Bengali written in Latin/English script) stopwords. It helps in cleaning text data for NLP, sentiment analysis, and machine learning projects.

## Features
- **Comprehensive Dataset:** Includes 350+ Bengali words and their popular Banglish variations.
- **Lazy Typing Support:** Handles repeated characters automatically (e.g., `naaaa` -> `na`, `hbeee` -> `hbe`).
- **High Performance:** Uses optimized sets for O(1) lookup speed.
- **Punctuation Aware:** Cleans text while respecting common punctuation.

## Installation

You can install the library using pip (after you publish it):
```bash
pip install banglish-stopwords
