Metadata-Version: 2.2
Name: gllm-misc-binary
Version: 0.8.8
Summary: A library containing miscellaneous components for Gen AI applications.
Author-email: Dimitrij Ray <dimitrij.ray@gdplabs.id>, Henry Wicaksono <henry.wicaksono@gdplabs.id>, Resti Febriana <resti.febriana@gdplabs.id>, Kadek Denaya <kadek.d.r.diana@gdplabs.id>
Requires-Python: <3.13,>=3.11
Description-Content-Type: text/markdown
Requires-Dist: poetry<3.0.0,>=2.1.3
Requires-Dist: gllm-core-binary<0.4.0,>=0.3.0
Requires-Dist: gllm-inference-binary[google]<0.6.0,>=0.5.0
Requires-Dist: gllm-datastore-binary[chroma]<0.6.0,>=0.5.0
Requires-Dist: azure-search-documents<12.0.0,>=11.5.1
Requires-Dist: langchain<0.4.0,>=0.3.0
Requires-Dist: pycountry<25.0.0,>=24.6.1
Requires-Dist: python-magic<0.5.0,>=0.4.27; sys_platform != "win32"
Requires-Dist: python-magic-bin<0.5.0,>=0.4.14; sys_platform == "win32"
Requires-Dist: semantic-router<0.2.0,>=0.1.0
Requires-Dist: transformers<5.0.0,>=4.52.4
Provides-Extra: dev
Requires-Dist: coverage<8.0.0,>=7.4.4; extra == "dev"
Requires-Dist: mypy<2.0.0,>=1.15.0; extra == "dev"
Requires-Dist: pre-commit<4.0.0,>=3.7.0; extra == "dev"
Requires-Dist: pytest<9.0.0,>=8.1.1; extra == "dev"
Requires-Dist: pytest-asyncio<0.24.0,>=0.23.6; extra == "dev"
Requires-Dist: pytest-cov<6.0.0,>=5.0.0; extra == "dev"
Requires-Dist: ruff<0.7.0,>=0.6.7; extra == "dev"
Provides-Extra: audio
Requires-Dist: google-cloud-speech<3.0.0,>=2.28.0; extra == "audio"
Requires-Dist: google-cloud-storage<3.0.0,>=2.18.2; extra == "audio"
Requires-Dist: langcodes<4.0.0,>=3.4.1; extra == "audio"
Requires-Dist: openai<2.0.0,>=1.98.0; extra == "audio"
Requires-Dist: soundfile<0.14.0,>=0.13.1; extra == "audio"
Requires-Dist: youtube-transcript-api<2.0.0,>=1.1.0; extra == "audio"
Requires-Dist: yt-dlp>=2025.6.9; extra == "audio"
Provides-Extra: google-translate
Requires-Dist: googletrans<5.0.0,>=4.0.2; extra == "google-translate"
Provides-Extra: image
Requires-Dist: boto3<2.0.0,>=1.38.10; extra == "image"
Requires-Dist: pillow<12.0.0,>=11.2.1; extra == "image"
Requires-Dist: aioresponses<0.8.0,>=0.7.0; extra == "image"
Provides-Extra: json-repair
Requires-Dist: json-repair<0.48.0,>=0.47.6; extra == "json-repair"
Provides-Extra: langdetect
Requires-Dist: langdetect<2.0.0,>=1.0.9; extra == "langdetect"
Provides-Extra: language
Provides-Extra: llmlingua
Requires-Dist: llmlingua<0.3.0,>=0.2.2; extra == "llmlingua"
Requires-Dist: torch==2.2.2; (sys_platform == "darwin" and platform_machine == "x86_64") and extra == "llmlingua"
Requires-Dist: torch<3.0.0,>=2.2.2; (sys_platform == "darwin" and platform_machine == "arm64") and extra == "llmlingua"
Requires-Dist: torch<3.0.0,>=2.2.2; sys_platform != "darwin" and extra == "llmlingua"
Provides-Extra: kg

# GLLM Misc

## Description

A library containing miscellaneous utilities and helper functions for Generative AI applications.

## Installation

### Prerequisites
1. Python 3.11+ - [Install here](https://www.python.org/downloads/)
2. Pip (if using Pip) - [Install here](https://pip.pypa.io/en/stable/installation/)
3. Poetry (automatically installed via Makefile) - [Install here](https://python-poetry.org/docs/#installation)
4. Git (if using Git) - [Install here](https://git-scm.com/downloads)
5. For artifact registry installation:
   - Google Cloud CLI - [[installed](https://cloud.google.com/sdk/docs/) and [authenticated](https://cloud.google.com/docs/authentication/gcloud)]
   - Access to the [GDP Labs SDK artifact registry](https://console.cloud.google.com/artifacts/python/gdp-labs/asia-southeast2/gen-ai?project=gdp-labs)
6. gcloud CLI (for authentication) - [Install here](https://cloud.google.com/sdk/docs/install)
7. For git installation, access to the [GDP Labs SDK github repository](https://github.com/GDP-ADMIN/gl-sdk)

### 1. Installation from Artifact Registry
Choose one of the following methods to install the package:

#### Using pip
```bash
pip install keyring keyrings.gl-artifactregistry-auth
pip install gllm-misc-binary --index-url https://glsdk.gdplabs.id/gen-ai/simple
```

#### Using Poetry
```bash
poetry source add --priority=explicit gen-ai https://glsdk.gdplabs.id/gen-ai/simple
poetry config http-basic.gen-ai oauth2accesstoken "$(gcloud auth print-access-token)"
poetry add --source gen-ai gllm-misc-binary
```

### 2. Development Installation (Git)
For development purposes, you can install directly from the Git repository:
```bash
git clone git@github.com:GDP-ADMIN/gl-sdk.git
cd gl-sdk/libs/gllm-misc
```

## Local Development Setup

### Quick Setup (Recommended)
For local development with editable gllm packages, use the provided Makefile:

```bash
# Complete setup: installs Poetry, configures auth, installs packages, sets up pre-commit
make setup
```

The following are the available Makefile targets:

1. `make setup` - Complete development setup (recommended for new developers)
2. `make install-poetry` - Install or upgrade Poetry to the latest version
3. `make auth` - Configure authentication for internal repositories
4. `make install` - Install all dependencies
5. `make install-pre-commit` - Set up pre-commit hooks
6. `make update` - Update dependencies

### Manual Development Setup (Legacy)
If you prefer to manage dependencies manually:

1. Go to root folder of `gllm-misc` module, e.g. `cd libs/gllm-misc`.
2. Run `poetry shell` to create a virtual environment.
3. Run `poetry lock` to create a lock file if you haven't done it yet.
4. Run `poetry install` to install the `gllm-misc` requirements for the first time.
5. Run `poetry update` if you update any dependency module version at `pyproject.toml`.


## Contributing
Please refer to this [Python Style Guide](https://docs.google.com/document/d/1uRggCrHnVfDPBnG641FyQBwUwLoFw0kTzNqRm92vUwM/edit?usp=sharing)
to get information about code style, documentation standard, and SCA that you need to use when contributing to this project

### Getting Started with Development
1. Clone the repository and navigate to the gllm-misc directory
2. Run `make setup` to set up your development environment
3. Run `which python` to get the path to be referenced at Visual Studio Code interpreter path (`Ctrl`+`Shift`+`P` or `Cmd`+`Shift`+`P`)
4. Try running the unit test to see if it's working:
```bash
poetry run pytest -s tests/unit_tests/
```
5. When you want to update the dependencies, run `make update`
