Metadata-Version: 2.4
Name: chartoken-vp
Version: 2.0.1
Summary: Character-level tokenizer and typed morphological feature vocabulary for multilingual NLP pipelines.
Author: F000NK, Voluntas Progressus
License-Expression: MIT
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.14
Classifier: Operating System :: OS Independent
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: >=3.14
Description-Content-Type: text/markdown
Requires-Dist: torch>=2.0.0

# chartoken-vp

`chartoken-vp` is a typed character-level tokenization package for multilingual morphology pipelines.

It provides:

- `CharVocab` for character vocabularies and text encoding
- `FeatureVocab` for morphological feature encoding
- serialization helpers for checkpoints and reproducible training
- `py.typed` support for downstream type checkers

Install:

```bash
pip install chartoken-vp
```
